Generate realistic audio from text
Generate audio from text using a reference voice
Generate speech in a cloned voice
Generate text using open source models
Describe and highlight entities in images