Policy and World Modeling Co-Training for Language Agents Paper • 2606.02388 • Published 5 days ago • 11
AHD Agent: Agentic Reinforcement Learning for Automatic Heuristic Design Paper • 2605.08756 • Published 28 days ago • 23
Jigsaw-R1: A Study of Rule-based Visual Reinforcement Learning with Jigsaw Puzzles Paper • 2505.23590 • Published May 29, 2025 • 25