Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation Paper • 2602.12125 • Published 9 days ago • 57
HySparse: A Hybrid Sparse Attention Architecture with Oracle Token Selection and KV Cache Sharing Paper • 2602.03560 • Published 18 days ago • 44
GroundingME: Exposing the Visual Grounding Gap in MLLMs through Multi-Dimensional Evaluation Paper • 2512.17495 • Published Dec 19, 2025 • 20
LaSeR: Reinforcement Learning with Last-Token Self-Rewarding Paper • 2510.14943 • Published Oct 16, 2025 • 40
DCA: Diversified Co-Attention towards Informative Live Video Commenting Paper • 1911.02739 • Published Nov 7, 2019
TESTA: Temporal-Spatial Token Aggregation for Long-form Video-Language Understanding Paper • 2310.19060 • Published Oct 29, 2023