-
jaygala24/Qwen3-4B-GRPO-KL-math-reasoning
Text Generation • 4B • Updated • 984 -
jaygala24/Qwen3-4B-GRPO-math-reasoning
Text Generation • 4B • Updated • 842 -
jaygala24/Qwen3-4B-ReMax-math-reasoning
Text Generation • 4B • Updated • 789 -
jaygala24/Qwen3-1.7B-GRPO-KL-math-reasoning
Text Generation • 2B • Updated • 792
Jay Gala
jaygala24
·
AI & ML interests
Machine Learning, Natural Language Processing, Language and Vision Intersection, Fairness and Biases
Recent Activity
updated a model 3 days ago
jaygala24/Qwen3-4B-ReMax-math-reasoning updated a model 3 days ago
jaygala24/Qwen3-4B-GRPO-math-reasoning updated a model 3 days ago
jaygala24/Qwen3-4B-GRPO-KL-math-reasoning