Whisper
Transcribe audio files into text instantly
Transcribe audio files into text instantly
Upscale low‑resolution images to high‑resolution with AI
Translate text between 200 languages instantly
Generate depth map from any photo
Generate videos from text prompts and optional images
Powerful Watermark Removal API
Extend images to custom dimensions with AI outpainting
Generate images from text prompts
The agent using over 9000 vision models from the HF Hub.
Execute custom code from environment variable
Audio Conditioned LipSync with Latent Diffusion Models
Separate vocals and accompaniment from audio
Versatile Single-Image 3D Face Reconstruction
NSFW Uncensored text/Image to image for AI Limits
Translate text between languages
Expressive Zeroshot TTS
Video deep fake
Flexible Photo Recrafting While Preserving Your Identity