VDC-Agent: When Video Detailed Captioners Evolve Themselves via Agentic Self-Reflection Paper • 2511.19436 • Published Nov 24, 2025
ReMoT: Reinforcement Learning with Motion Contrast Triplets Paper • 2603.00461 • Published Mar 20 • 1
Trajectory-Diversity-Driven Robust Vision-and-Language Navigation Paper • 2603.15370 • Published Mar 16
ProSR: Process-Shaped Spatial Reasoning for Reliable Chain-of-Thought in VLMs Paper • 2605.25524 • Published May 25
DataClaw0: Agentic Tailoring Multimodal Data from Raw Streams Paper • 2606.21337 • Published 7 days ago • 70
DataClaw0: Agentic Tailoring Multimodal Data from Raw Streams Paper • 2606.21337 • Published 7 days ago • 70