view article Article Distribution Matching Prevents Mode Collapse in Training Reasoning Models 6 days ago • 2
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models Paper • 2206.04615 • Published Jun 9, 2022 • 6