INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats Paper • 2510.25602 • Published Oct 29, 2025 • 80
Running on CPU Upgrade Featured 3.12k The Smol Training Playbook 📚 3.12k The secrets to building world-class LLMs
Towards a Unified View of Large Language Model Post-Training Paper • 2509.04419 • Published Sep 4, 2025 • 76
Running 338 LLM Embeddings Explained: A Visual and Intuitive Guide 🚀 338 How Language Models Turn Text into Meaning, From Traditional
SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation Paper • 2502.13128 • Published Feb 18, 2025 • 41
Running Agents Featured 2.08k Wan2.1 💻 2.08k Wan: Open and Advanced Large-Scale Video Generative Models
Running 3.8k The Ultra-Scale Playbook 🌌 3.8k The ultimate guide to training LLM on large GPU Clusters
Running on CPU Upgrade 193 LLM Hallucination Leaderboard 🚀 193 View and filter LLM hallucination leaderboard