P1: Mastering Physics Olympiads with Reinforcement Learning Paper • 2511.13612 • Published 20 days ago • 132
SWE-Fixer: Training Open-Source LLMs for Effective and Efficient GitHub Issue Resolution Paper • 2501.05040 • Published Jan 9 • 15
Can Knowledge Editing Really Correct Hallucinations? Paper • 2410.16251 • Published Oct 21, 2024 • 55