view article Article Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries +7 15 days ago • 78
view article Article Ulysses Sequence Parallelism: Training with Million-Token Contexts 16 days ago • 23
Running on CPU Upgrade 208 The Synthetic Data Playbook: Generating Trillions of the Finest Tokens 📝 208 Explore synthetic data experiments as an interactive bookshelf
view article Article 🪄 Interpreto: A Unified Toolkit for Interpretability of Transformer Models Jan 20 • 37
Scaling Laws for Code: Every Programming Language Matters Paper • 2512.13472 • Published Dec 15, 2025 • 15