Generate lip-sync videos from uploaded videos and audio files
Transcribe audio or YouTube video to text
Identify human poses in images
Generate images from text prompts