AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning Paper • 2402.15506 • Published Feb 23, 2024 • 18
API-BLEND: A Comprehensive Corpora for Training and Benchmarking API LLMs Paper • 2402.15491 • Published Feb 23, 2024 • 15
Memory as Action: Autonomous Context Curation for Long-Horizon Agentic Tasks Paper • 2510.12635 • Published Oct 14, 2025 • 17
GoLongRL: Capability-Oriented Long Context Reinforcement Learning with Multitask Alignment Paper • 2605.19577 • Published 4 days ago • 55
SpecBench: Measuring Reward Hacking in Long-Horizon Coding Agents Paper • 2605.21384 • Published 3 days ago • 4
OpenComputer: Verifiable Software Worlds for Computer-Use Agents Paper • 2605.19769 • Published 4 days ago • 56