TADA: A Generative Framework for Speech Modeling via Text-Acoustic Dual Alignment

A unified speech-language model that synchronizes speech and text into a single, cohesive stream via 1:1 alignment.

Text-Acoustic Dual-Alignment Large Language Model

TADA is a unified speech-language model that synchronizes speech and text into a single, cohesive stream via 1:1 alignment. By leveraging a novel tokenizer and architectural design, TADA achieves high-fidelity synthesis and generation with a fraction of the computational overhead required by traditional models.

⭐️ arxiv: https://arxiv.org/abs/2602.23068
⭐️ demo: https://huggingface.co/spaces/HumeAI/tada
⭐️ github: https://github.com/HumeAI/tada
⭐️ blog post:

Downloads last month: -; Downloads are not tracked for this model. How to track

Space using HumeAI/tada-codec 1

Collection including HumeAI/tada-codec

TADA

Collection

TADA: A Generative Framework for Speech Modeling via Text-Acoustic Dual Alignment | https://huggingface.co/papers/2602.23068 • 4 items • Updated about 7 hours ago • 21

Paper for HumeAI/tada-codec

TADA: A Generative Framework for Speech Modeling via Text-Acoustic Dual Alignment

Paper • 2602.23068 • Published 12 days ago