MAXS: Meta-Adaptive Exploration with LLM Agents Paper • 2601.09259 • Published 10 days ago • 92
A^3-Bench: Benchmarking Memory-Driven Scientific Reasoning via Anchor and Attractor Activation Paper • 2601.09274 • Published 10 days ago • 83
OpenMathReasoning Collection Models and datasets from "AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset" • 7 items • Updated 4 days ago • 46
WizardLMTeam/WizardLM_evol_instruct_V2_196k Viewer • Updated Mar 10, 2024 • 143k • 1.07k • 246
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters Paper • 2408.03314 • Published Aug 6, 2024 • 63