Runtime error Featured 2.95k The Smol Training Playbook š 2.95k The secrets to building world-class LLMs
Shaping capabilities with token-level data filtering Paper ⢠2601.21571 ⢠Published 8 days ago ⢠25
view article Article We Got Claude to Build CUDA Kernels and teach open models! +2 9 days ago ⢠122