Teacher Logits Collection Logits captured from large models to act as the teacher for distillation • 3 items • Updated Dec 15, 2025 • 8
view article Article Fine-tuning LLMs to 1.58bit: extreme quantization made easy +4 Sep 18, 2024 • 274
Running 3.67k The Ultra-Scale Playbook 🌌 3.67k The ultimate guide to training LLM on large GPU Clusters