JGOS-31B-Think

Korean reasoning-focused LLM (31B, Gemma4) from VIDRAFT. Native step-by-step think reasoning.

Docker Deployment (K-AI Evaluation)

Image: vidraft/jgos-31b-think:01.03
vLLM 0.22.0 (model baked-in), Port 8000 (OpenAI-compatible API).

Memory-tuned for single eval GPU: max-model-len 8192, vision disabled (text eval), max-num-seqs 16, gpu-memory-utilization 0.90. bf16 weights ~59GB (needs a single GPU >= ~64GB; for smaller GPUs use FP8).

License

Gemma license (inherited from base).

Downloads last month
-
Safetensors
Model size
31B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for VIDraft/JGOS-31B-Think

Quantizations
2 models