Mistral Small 4 Collection A state-of-the-art model, open-weight, with a granular Mixture-of-Experts architecture that fuses instruct, reasoning and agentic skills. • 3 items • Updated 10 days ago • 61
view article Article Ulysses Sequence Parallelism: Training with Million-Token Contexts 18 days ago • 23
view article Article Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries +7 17 days ago • 79
NVIDIA Nemotron v3 Collection Open, Production-ready Enterprise Models • 15 items • Updated 2 days ago • 241
view article Article Introducing Modular Diffusers - Composable Building Blocks for Diffusion Pipelines +2 22 days ago • 45
view article Article Follow the White Rabbit: Using Embeddings So You Never Get Lost in Translation Feb 23 • 8
view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 Feb 20 • 490
Real-time Vision Models Collection A collection of real-time detectors. • 20 items • Updated Feb 18 • 23