Q-Zoom: Query-Aware Adaptive Perception for Efficient Multimodal Large Language Models Paper • 2604.06912 • Published 5 days ago • 4
GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning Paper • 2604.02721 • Published 10 days ago • 349
SQuTR: A Robustness Benchmark for Spoken Query to Text Retrieval under Acoustic Noise Paper • 2602.12783 • Published Feb 13 • 216
Experience Transfer for Multimodal LLM Agents in Minecraft Game Paper • 2604.05533 • Published 6 days ago • 13
CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence Paper • 2603.28032 • Published 14 days ago • 339