YAML Metadata
Warning:
empty or missing yaml metadata in repo card
(https://huggingface.co/docs/hub/model-cards#model-card-metadata)
Qwen3-32B β GGUF Quantized
This repo contains Qwen3-32B quantized weights for llama.cpp.
Included formats
Qwen3-32B-f16.ggufQwen3-32B-Q8_0.ggufQwen3-32B-Q6_K.ggufQwen3-32B-Q5_K_M.ggufQwen3-32B-Q4_K_M.gguf
Usage
./llama-cli -m Qwen3-32B-Q4_K_M.gguf -p "Hello"
Notes
- quantized using llama.cpp
- original model: https://huggingface.co/keypa/Qwen3-32B
- Downloads last month
- 32
Hardware compatibility
Log In
to view the estimation
4-bit
5-bit
6-bit
8-bit
16-bit
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
π
Ask for provider support