mradermacher/MathSmith-Hard-Problem-Synthesizer-Qwen3-8B-i1-GGUF 8B • Updated 20 days ago • 1.21k • 1
asatheesh/deepmath-qwen3-4b-instruct-grpo-lora-eagle3-spec2 Reinforcement Learning • Updated 9 days ago
asatheesh/deepmath-qwen3-4b-instruct-grpo-lora-eagle3-spec4 Reinforcement Learning • Updated 9 days ago
asatheesh/deepmath-qwen3-4b-instruct-grpo-lora-ngram-spec4 Reinforcement Learning • Updated 9 days ago
asatheesh/deepmath-qwen3-4b-instruct-rloo-lora-eagle3-spec5 Reinforcement Learning • Updated 9 days ago
asatheesh/deepmath-qwen3-4b-instruct-drgrpo-lora-eagle3-spec5 Reinforcement Learning • Updated 9 days ago
asatheesh/deepmath-qwen3-4b-instruct-grpo-lora-eagle3-spec5 Reinforcement Learning • Updated 9 days ago
asatheesh/deepmath-qwen3-4b-instruct-rloo-lora-ngram-spec5 Reinforcement Learning • Updated 9 days ago
asatheesh/deepmath-qwen3-4b-instruct-drgrpo-lora-ngram-spec5 Reinforcement Learning • Updated 9 days ago