Alibaba-Apsara/Superior-Reasoning-SFT-gpt-oss-120b Viewer • Updated 10 days ago • 306k • 15.5k • 251
Think Twice: Enhancing LLM Reasoning by Scaling Multi-round Test-time Thinking Paper • 2503.19855 • Published Mar 25, 2025 • 29