OmniDirector: General Multi-Shot Camera Cloning without Cross-Paired Data Paper • 2606.13432 • Published 5 days ago • 96
WeaveBench: A Long-Horizon, Real-World Benchmark for Computer-Use Agents with Hybrid Interfaces Paper • 2606.09426 • Published 8 days ago • 100
Robust-U1: Can MLLMs Self-Recover Corrupted Visual Content for Robust Understanding? Paper • 2606.08063 • Published 10 days ago • 76
InterleaveThinker: Reinforcing Agentic Interleaved Generation Paper • 2606.13679 • Published 5 days ago • 77
Toward Generalist Autonomous Research via Hypothesis-Tree Refinement Paper • 2606.11926 • Published 6 days ago • 110
Crafter: A Multi-Agent Harness for Editable Scientific Figure Generation from Diverse Inputs Paper • 2605.30611 • Published 19 days ago • 193
ChildVox: A Speech, Audio, and Large Audio-Language Model Benchmark in Understanding and Characterizing Sound across Childhood Paper • 2605.29257 • Published 19 days ago • 10
Is Position Bias in Dense Retrievers Built In-or Learned from Data? Paper • 2605.26578 • Published 21 days ago • 20