view changelog Changelog Introducing HF Jobs: Run scalable compute jobs on Hugging Face Jul 30, 2025 β’ 202
view article Article SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data +7 Jun 3, 2025 β’ 323
Running Featured 103 Qwen3 WebGPU π 103 A hybrid reasoning model that runs locally in your browser.
view article Article A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes Aug 17, 2022 β’ 126
view article Article Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA +3 May 24, 2023 β’ 175
view article Article Introduction to Quantization cooked in π€ with ππ§βπ³ Aug 25, 2023 β’ 38