Submitted by akhaliq 38 Seed-TTS: A Family of High-Quality Versatile Speech Generation Models · 46 authors 1.53k 2
Submitted by akhaliq 18 I4VGen: Image as Stepping Stone for Text-to-Video Generation · 4 authors 24 3
Submitted by akhaliq 12 RoboCasa: Large-Scale Simulation of Everyday Tasks for Generalist Robots · 8 authors 1.14k 1
Submitted by akhaliq 11 V-Express: Conditional Dropout for Progressive Training of Portrait Video Generation · 10 authors 2.37k 2
Submitted by akhaliq 10 CamCo: Camera-Controllable 3D-Consistent Image-to-Video Generation · 7 authors 4