-
SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization
Paper • 2604.02268 • Published • 92 -
ReflexiCoder: Teaching Large Language Models to Self-Reflect on Generated Code and Self-Correct It via Reinforcement Learning
Paper • 2603.05863 • Published • 6 -
GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning
Paper • 2604.02721 • Published • 347 -
GLM-5: from Vibe Coding to Agentic Engineering
Paper • 2602.15763 • Published • 139
glenn ba
glennba
·
AI & ML interests
None yet
Recent Activity
upvoted a paper 3 minutes ago
AI Can Learn Scientific Taste upvoted an article about 14 hours ago
Waypoint-1.5: Higher-Fidelity Interactive Worlds for Everyday GPUs upvoted an article about 14 hours ago
How we OCR'ed 30,000 papers using Codex, open OCR models and JobsOrganizations
None yet