Data Repetition Beats Data Scaling in Long-CoT Supervised Fine-Tuning Paper • 2602.11149 • Published 6 days ago • 12
view article Article Re-understanding KL Approximation from an RL-for-LLM Lens: Notes on “Approximating KL Divergence” Aug 11, 2025 • 8