A unified multimodal understanding and generation model.
Edit images using instructions or masks
Generate app code from your idea
Generate code instantly from natural language prompts
Generate images from your text description
Upscale low‑resolution images to higher resolution
Create animated GIFs from text prompts
Text-to-Video