VideoSeeker: Incentivizing Instance-level Video Understanding via Native Agentic Tool Invocation Paper • 2605.16079 • Published 10 days ago • 27
MolmoAct2: Action Reasoning Models for Real-world Deployment Paper • 2605.02881 • Published 21 days ago • 336
GUI-G^2: Gaussian Reward Modeling for GUI Grounding Paper • 2507.15846 • Published Jul 21, 2025 • 135