WeaveBench: A Long-Horizon, Real-World Benchmark for Computer-Use Agents with Hybrid Interfaces Paper • 2606.09426 • Published 8 days ago • 99
DEMON: Diffusion Engine for Musical Orchestrated Noise Paper • 2605.28657 • Published 20 days ago • 11
From Model Scaling to System Scaling: Scaling the Harness in Agentic AI Paper • 2605.26112 • Published 22 days ago • 9
Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players Paper • 2605.28816 • Published 20 days ago • 423
IndusAgent: Reinforcing Open-Vocabulary Industrial Anomaly Detection with Agentic Tools Paper • 2605.20682 • Published 27 days ago • 83
Perception or Prejudice: Can MLLMs Go Beyond First Impressions of Personality? Paper • 2605.22109 • Published 26 days ago • 169