Running 86 Unlocking On-Policy Distillation for Any Model Family π 86 Visualize on-policy distillation for any model family
Running on CPU Upgrade Featured 3.01k The Smol Training Playbook π 3.01k The secrets to building world-class LLMs
view article Article ChatML vs Harmony: Understanding the new Format from OpenAI π Aug 9, 2025 β’ 53
meituan-longcat/LongCat-Flash-Chat Text Generation β’ 562B β’ Updated Sep 24, 2025 β’ 24.2k β’ 526
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition β’ Updated Dec 10, 2025 β’ 338k β’ 1.57k
yentinglin/Mistral-Small-24B-Instruct-2501-reasoning Text Generation β’ 24B β’ Updated Apr 20, 2025 β’ 19 β’ β’ 59
bartowski/DeepSeek-R1-Distill-Qwen-32B-abliterated-GGUF Text Generation β’ 33B β’ Updated Jan 25, 2025 β’ 8.15k β’ 132