AutoNumerics: An Autonomous, PDE-Agnostic Multi-Agent Pipeline for Scientific Computing Paper • 2602.17607 • Published Feb 19
OptimAI: Optimization from Natural Language Using LLM-Powered AI Agents Paper • 2504.16918 • Published Jan 21
PerspectiveGap: A Benchmark for Multi-Agent Orchestration Prompting Paper • 2606.08878 • Published 6 days ago • 1
PerspectiveGap Benchmark Collection Paper, dataset, and leaderboard for multi-agent orchestration prompting. • 3 items • Updated 2 days ago
Running Agents PerspectiveGap Leaderboard 🧠Explore AI model rankings on the PerspectiveGap benchmark
Running Agents PerspectiveGap Leaderboard 🧠Explore AI model rankings on the PerspectiveGap benchmark
PerspectiveGap: A Benchmark for Multi-Agent Orchestration Prompting Paper • 2606.08878 • Published 6 days ago • 1