VladShash/deepseek-math-7B-lean-prover-dpo-300k-mistral-150k-olmo Text Generation • 7B • Updated May 16 • 226
VladShash/deepseek-math-7B-lean-prover-grpo-olmo-weighed Text Generation • 7B • Updated May 2 • 134 • 1