Running 3.57k The Ultra-Scale Playbook 🌌 3.57k The ultimate guide to training LLM on large GPU Clusters
Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads Paper • 2401.10774 • Published Jan 19, 2024 • 59
Running on CPU Upgrade 13.7k Open LLM Leaderboard 🏆 13.7k Track, rank and evaluate open LLMs and chatbots