-
-
-
-
-
-
Inference Providers
Active filters:
nvfp4
nvidia/Qwen3-Next-80B-A3B-Thinking-NVFP4
Text Generation
•
Updated
•
58.3k
•
40
GadflyII/Qwen3-Coder-Next-NVFP4
Text Generation
•
Updated
•
34.7k
•
13
Image-Text-to-Text
•
62B
•
Updated
•
9.21k
•
6
GadflyII/GLM-4.7-Flash-NVFP4
Text Generation
•
18B
•
Updated
•
325k
•
61
nvidia/Qwen3-Next-80B-A3B-Instruct-NVFP4
Text Generation
•
Updated
•
21.2k
•
23
nvidia/Qwen3-235B-A22B-Thinking-2507-NVFP4
Text Generation
•
120B
•
Updated
•
162
•
3
vincentzed-hf/Qwen3-Coder-Next-NVFP4
Text Generation
•
Updated
•
1.39k
•
3
Ex0bit/Qwen3-VLTO-32B-Instruct-NVFP4
Text Generation
•
17B
•
Updated
•
222
•
1
Ex0bit/Qwen3-VLTO-32B-Instruct-NVFP4-256K
Text Generation
•
17B
•
Updated
•
148
•
1
mratsim/Behemoth-X-123B-v2-NVFP4
Text Generation
•
69B
•
Updated
•
146
•
2
Ex0bit/OLMo-3-7B-Instruct-NVFP4-1M
Text Generation
•
4B
•
Updated
•
26
•
2
nvidia/DeepSeek-V3.2-NVFP4
Text Generation
•
394B
•
Updated
•
7.62k
•
5
GadflyII/MiniMax-M2.1-NVFP4
Text Generation
•
Updated
•
4.8k
•
6
JEILDLWLRMA/Qwen3-VL-4B-Instruct-NVFP4
Image-to-Text
•
3B
•
Updated
•
24
•
1
apolloparty/Qwen3-4B-NVFP4A16
2B
•
Updated
•
1
cortecs/Qwen3-8B-NVFP4A16
5B
•
Updated
•
3
5B
•
Updated
•
9
cortecs/Qwen3-8B-clean-sparse
6B
•
Updated
•
2
cortecs/Qwen3-8B-clean-sparse-nvfp4a16
5B
•
Updated
cortecs/Qwen3-8B-clean-sparse-finetuned-0.01-nvfp4a16
5B
•
Updated
•
1
cortecs/Qwen3-8B-clean-sparse-finetuned-0.1-nvfp4a16
5B
•
Updated
•
1
llmat/Mistral-Small-24B-Instruct-2501-NVFP4
Text Generation
•
14B
•
Updated
•
57
llmat/Qwen3-30B-A3B-Instruct-2507-NVFP4
Text Generation
•
17B
•
Updated
•
292
•
2
llmat/Qwen3-4B-Instruct-2507-NVFP4
Text Generation
•
3B
•
Updated
•
172
•
1
llmat/Qwen3-30B-A3B-NVFP4
Text Generation
•
17B
•
Updated
•
23
Text Generation
•
19B
•
Updated
•
2
Text Generation
•
9B
•
Updated
•
18
Text Generation
•
5B
•
Updated
•
4
Text Generation
•
3B
•
Updated
•
31
Text Generation
•
1B
•
Updated
•
2