GPT-OSS General (4.2B to 20B) Collection Collection of pruned GPT-OSS models spanning 1-32 experts, maintaining general capabilities across domains while reducing computational requirements. • 29 items • Updated Aug 13 • 10
GPT-OSS Pruned Experts (4.2B-20B) [IF, Science, Math, etc.] Collection Complete collection of domain-specialized GPT-OSS models (1-32 experts) optimized for science, math, medicine, law, safety, and instruction following. • 8 items • Updated Aug 13 • 10
Roleplaying Collection Creativity at cost of context & knowledge • 5 items • Updated 13 days ago • 14
Kimi-Linear-A3B Collection Moonshot's experimental MoE model with Kimi Delta Attention • 3 items • Updated Nov 1 • 18
Kimi-K2 Collection Moonshot's MoE LLMs with 1 trillion parameters, exceptional on agentic intellegence • 5 items • Updated Nov 14 • 162
Mistral Large 3 Collection A state-of-the-art, open-weight, general-purpose multimodal model with a granular Mixture-of-Experts architecture. • 4 items • Updated 24 days ago • 80
💧 LFM2 Collection LFM2 is a new generation of hybrid models, designed for on-device deployment. • 26 items • Updated about 5 hours ago • 128
Quantized Olmo 3 Collection Verified models. All compatible with vLLM for very fast inference. Use the 3.1 models as they are more recent. • 23 items • Updated 11 days ago • 3
AI PC: Text Generation Collection Text generation LLMs that have been validated to run on the AI PC Intel® Core™ Ultra CPU and iGPU. • 186 items • Updated Aug 28, 2024 • 12
Source files for GGUF, EXL2, AWQ, GPTQ, HQQ etc etc Collection Safetensor source files (by David_AU) to use directly and/or create different quants and/or merges. Link to GGUFS/full model card on each. • 371 items • Updated 17 days ago • 18
OpenVINO NPU Collection Models specifically tested on Intel's NPU with OpenVINO • 16 items • Updated Oct 29 • 2
Phi-4 Collection Phi-4 family of small language, multi-modal and reasoning models. • 17 items • Updated Jul 10 • 192