Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text Paper • 2601.22975 • Published 8 days ago • 82
The Surprising Effectiveness of Membership Inference with Simple N-Gram Coverage Paper • 2508.09603 • Published Aug 13, 2025 • 2
The Surprising Effectiveness of Membership Inference with Simple N-Gram Coverage Paper • 2508.09603 • Published Aug 13, 2025 • 2
Amulet: Putting Complex Multi-Turn Conversations on the Stand with LLM Juries Paper • 2505.20451 • Published May 26, 2025
AI as Humanity's Salieri: Quantifying Linguistic Creativity of Language Models via Systematic Attribution of Machine Text against Web Text Paper • 2410.04265 • Published Oct 5, 2024
Prismatic Synthesis: Gradient-based Data Diversification Boosts Generalization in LLM Reasoning Paper • 2505.20161 • Published May 26, 2025 • 1
ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models Paper • 2505.24864 • Published May 30, 2025 • 143
Tulu V1 Suite Collection The set of models associated with the paper "How Far Can Camels Go? Exploring the State of Instruction Tuning on Open Resources". • 34 items • Updated Mar 4, 2025 • 3