Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

One-click Deployment

Inference Endpoints

Microsoft Foundry

Amazon SageMaker AI

Misc

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

1,364

Base only

Active filters: nvfp4

nota-ai/Solar-Open2-250B-Nota-NVFP4

Text Generation • 145B • Updated 9 days ago • 22.4k • 151

nota-ai/Solar-Open2-250B-Nota-NVFP4-GlobalPruned

Text Generation • 117B • Updated about 23 hours ago • 296 • 34

DreamFast/Qwen3-VL-4b-Heretic-ComfyUI

Image-Text-to-Text • Updated 15 days ago • 50

nota-ai/Solar-Open-100B-NotaMoEQuant-NVFP4

Text Generation • 59B • Updated Mar 11 • 207 • 22

PassingByPixels/Qwen3.6-27B-Architect-Polaris2-Fable-B-F451-NVFP4

Image-Text-to-Text • 15B • Updated 4 days ago • 2.48k • 12

DreamFast/gemma-3-12b-it-heretic-v2

Text Generation • 12B • Updated 15 days ago • 11.1k • • 77

nvidia/DeepSeek-V4-Flash-NVFP4

Text Generation • 167B • Updated Jun 15 • 1.25M • 82

jarrelscy/GLM-5.2-NVFP4-AQLM-hybrid

Image-Text-to-Text • 208B • Updated 2 days ago • 3.09k • 14

sakamakismile/Huihui-ThinkingCap-Qwen3.6-27B-abliterated-NVFP4

Image-Text-to-Text • 17B • Updated 20 days ago • 9.78k • 33

0xSero/Laguna-S-2.1-Hybrid-3.25bpw

Text Generation • Updated 6 days ago • 212 • 6

PassingByPixels/Qwen3.6-27B-Architect-Polaris2-Fable-B-F451-NVFP4-MTP

Image-Text-to-Text • 15B • Updated 4 days ago • 672 • 6

bottlecapai/ThinkingCap-Qwen3.6-27B-NVFP4

Image-Text-to-Text • 17B • Updated 3 days ago • 3.06k • 6

michaelw9999/Qwen3.6-27B-NVFP4-MTP-GGUF

27B • Updated Jun 6 • 40.4k • 55

protoLabsAI/ThinkingCap-Qwen3.6-27B-MTP-GGUF

0.5B • Updated 22 days ago • 48.1k • 58

sakamakismile/KAT-Coder-V2.5-Dev-NVFP4

20B • Updated 8 days ago • 1.78k • 11

protoLabsAI/ThinkingCap-Qwen3.6-27B-heretic-MTP-GGUF

Text Generation • 27B • Updated 4 days ago • 2.35k • 5

nvidia/MiniMax-M3-NVFP4

Text Generation • 247B • Updated Jun 26 • 576k • 71

rdtand/Qwen3.6-27B-PrismaAURA-5.5bit-vllm

20B • Updated Jun 25 • 42.7k • 26

s-batman/Ornith-1.0-35B-NVFP4-MTP-GGUF

Text Generation • 36B • Updated Jun 29 • 37.7k • 38

nvidia/Nemotron-3-Embed-1B-NVFP4

Sentence Similarity • 0.7B • Updated 1 day ago • 22.4k • 72

jcbtc/Laguna-S-2.1-NVFP4-GGUF

Text Generation • 118B • Updated 9 days ago • 1.06k • 5

sakamakismile/Qwen3.6-27B-Fable-Fusion-MTP-NVFP4

Image-Text-to-Text • 17B • Updated 5 days ago • 477 • 5

scottgl/MiniMax-M2.7-REAP-172B-A10B-NVFP4-GB10

Text Generation • 98B • Updated Apr 16 • 3.26k • 6

RedHatAI/Qwen3.6-35B-A3B-NVFP4

20B • Updated 19 days ago • 1.85M • 166

nvidia/Gemma-4-26B-A4B-NVFP4

Text Generation • 14B • Updated May 11 • 1.26M • 124

nvidia/Qwen3.5-122B-A10B-NVFP4

Text Generation • 65B • Updated Jun 2 • 249k • 47

nvidia/diffusiongemma-26B-A4B-it-NVFP4

Text Generation • 14B • Updated 28 days ago • 1.68M • 112

s-batman/Ornith-1.0-9B-NVFP4-MTP-GGUF

Text Generation • 9B • Updated Jun 29 • 6.83k • 7

SergiusFlavius/Qwen3-VL-4B-Instruct-heretic-NVFP4

Image-Text-to-Text • Updated about 1 month ago • 11

CodeFault/Nvidia-Qwen3.6-27B-NVFP4-GGUF

Text Generation • 27B • Updated 15 days ago • 38.1k • 24