WeaveBench: A Long-Horizon, Real-World Benchmark for Computer-Use Agents with Hybrid Interfaces Paper • 2606.09426 • Published 9 days ago • 100
DEMON: Diffusion Engine for Musical Orchestrated Noise Paper • 2605.28657 • Published 21 days ago • 11
From Model Scaling to System Scaling: Scaling the Harness in Agentic AI Paper • 2605.26112 • Published 23 days ago • 9
DCAgent3/medagentbench_g1_diverse_tezos_top4_3160_8b_20260602_100923 Viewer • Updated 14 days ago • 1.49k • 33 • 1
Data-Gouv-ML/diagnostic-sur-le-patrimoine-arbore-gere-par-la-ville-de-rennes Viewer • Updated 2 days ago • 95.1k • 60 • 1
Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players Paper • 2605.28816 • Published 21 days ago • 423
IndusAgent: Reinforcing Open-Vocabulary Industrial Anomaly Detection with Agentic Tools Paper • 2605.20682 • Published 28 days ago • 83
Perception or Prejudice: Can MLLMs Go Beyond First Impressions of Personality? Paper • 2605.22109 • Published 27 days ago • 169