view article Article Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand Dec 4, 2025 • 63
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM +2 Mar 12, 2025 • 483
view post Post 4772 Qwen 3 can launch very soon. 👀https://github.com/ggml-org/llama.cpp/pull/12828 See translation 3 replies · 🔥 16 16 👀 9 9 ❤️ 8 8 + Reply