naver-hyperclovax/HyperCLOVAX-SEED-Omni-8B Text Generation • 11B • Updated 21 days ago • 5.16k • 180
naver-hyperclovax/HyperCLOVAX-SEED-Think-32B Text Generation • 33B • Updated 21 days ago • 33.1k • 391
Running on CPU Upgrade Featured 2.93k The Smol Training Playbook 📚 2.93k The secrets to building world-class LLMs
Learning Dynamics in Continual Pre-Training for Large Language Models Paper • 2505.07796 • Published May 12, 2025 • 19
Learning Dynamics in Continual Pre-Training for Large Language Models Paper • 2505.07796 • Published May 12, 2025 • 19 • 4