Rethinking Video Generation Model for the Embodied World Paper • 2601.15282 • Published 17 days ago • 43
MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head Paper • 2601.07832 • Published 26 days ago • 51