OmniTransfer: All-in-one Framework for Spatio-temporal Video Transfer Paper • 2601.14250 • Published 1 day ago • 29
nvidia/nemotron-speech-streaming-en-0.6b Automatic Speech Recognition • Updated 16 days ago • 7.34k • 416
Taming Hallucinations: Boosting MLLMs' Video Understanding via Counterfactual Video Generation Paper • 2512.24271 • Published 22 days ago • 60
view article Article How to make NeuTTS-air generate over 200 seconds of audio in a single second. Nov 21, 2025 • 22
Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans? Paper • 2512.13281 • Published Dec 15, 2025 • 63