mradermacher/Qwen3-0.6B-Dakota-Grammar-RL-GGUF Reinforcement Learning • 0.8B • Updated Nov 10, 2025 • 163
HarleyCooper/Qwen3-30B-ThinkingMachines-Dakota1890 Reinforcement Learning • Updated Nov 23, 2025 • 10