Search-R1 Collection Preliminary checkpoints with outcome-only RL. • 15 items • Updated Aug 12, 2025 • 16
deepseek-ai/DeepSeek-R1-Distill-Qwen-14B Text Generation • 15B • Updated Feb 24, 2025 • 737k • • 608
unsloth/Qwen3-Coder-30B-A3B-Instruct-GGUF Text Generation • 31B • Updated Jan 30 • 189k • 515
Qwen/Qwen3-Coder-480B-A35B-Instruct Text Generation • 480B • Updated Aug 21, 2025 • 74.6k • • 1.3k
unsloth/Qwen3-Coder-480B-A35B-Instruct-GGUF Text Generation • 480B • Updated Jul 31, 2025 • 3.95k • 172