Adaptive Auto-Harness: Sustained Self-Improvement for Agentic System Deployment on Open-Ended Task Streams Paper • 2606.01770 • Published 6 days ago • 12
Crafter: A Multi-Agent Harness for Editable Scientific Figure Generation from Diverse Inputs Paper • 2605.30611 • Published 10 days ago • 192
Why Do Reasoning Models Lose Coverage? The Role of Data and Forks in the Road Paper • 2605.17026 • Published 22 days ago • 4
AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration Paper • 2605.20025 • Published 19 days ago • 186
LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model Paper • 2604.20796 • Published Apr 22 • 243
MARS: Enabling Autoregressive Models Multi-Token Generation Paper • 2604.07023 • Published Apr 8 • 38
LIBERO-Para: A Diagnostic Benchmark and Metrics for Paraphrase Robustness in VLA Models Paper • 2603.28301 • Published Mar 30 • 82
FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization Paper • 2603.19835 • Published Mar 20 • 352
Astrolabe: Steering Forward-Process Reinforcement Learning for Distilled Autoregressive Video Models Paper • 2603.17051 • Published Mar 17 • 109
SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models Paper • 2603.16859 • Published Mar 17 • 249