Filter, Then Reweight: Rethinking Optimization Granularity in On-Policy Distillation Paper • 2606.02684 • Published 5 days ago • 11
AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information? Paper • 2412.02611 • Published Dec 3, 2024 • 25
Fira: Can We Achieve Full-rank Training of LLMs Under Low-rank Constraint? Paper • 2410.01623 • Published Oct 2, 2024 • 4