Nemotron-Labs-Diffusion Collection A Tri-Mode Language Model Family Unifying Autoregressive, Diffusion, and Self-Speculation Decoding • 7 items • Updated 11 days ago • 49
view article Article Profiling in PyTorch (Part 1): A Beginner's Guide to torch.profiler +3 ariG23498, sayakpaul, sergiopaniego, ror, pcuenq • 25 days ago • 122
SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer Paper • 2605.15178 • Published May 14 • 91
Gemma 4 Collection Gemma 4 is Google's new model family including including E2B, E4B, 26B-A4B, and 31B. • 36 items • Updated 8 days ago • 221
Transformers.js V4 demos Collection A collection of demos built with Transformers.js V4 • 24 items • Updated Apr 16 • 62
Qwen3.5 Collection Qwen3.5 is Qwen's new model family including Qwen3.5 Small: 0.8B, 2B, 4B, 9B and Qwen3.5 Medium: 35B-A3B, 27B, 122B-A10B and 397B-A17B. • 25 items • Updated 8 days ago • 161
daggr HF Spaces Collection Explore the collection of dagger apps on HF Spaces • 14 items • Updated Jan 30 • 11
NVIDIA Nemotron v3 Collection Open, Production-ready Enterprise Models • 23 items • Updated 11 days ago • 329