SkillOpt: Executive Strategy for Self-Evolving Agent Skills Paper • 2605.23904 • Published 5 days ago • 174
DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards Paper • 2605.21467 • Published 7 days ago • 201
Lance: Unified Multimodal Modeling by Multi-Task Synergy Paper • 2605.18678 • Published 9 days ago • 74