Policy and World Modeling Co-Training for Language Agents Paper • 2606.02388 • Published 8 days ago • 11
AHD Agent: Agentic Reinforcement Learning for Automatic Heuristic Design Paper • 2605.08756 • Published about 1 month ago • 23