IndexTTS 2 Demo
π’
781
Generate expressive speech audio from text with emotion control
Generate speech from text using a reference audio
Generate speech from text using a reference voice
Expressive Zeroshot TTS
An Agentic Framework with Tools for Complex Reasoning
Conversational speech generation
Restore and enhance faces in photos
Audio-based video editing using AI-generated transcription