formalmathatepfl/deepseek-prover-v2-grpo-800 Reinforcement Learning • 7B • Updated 29 days ago • 1.29k
formalmathatepfl/deepseek-prover-v2-grpo-800 Reinforcement Learning • 7B • Updated 29 days ago • 1.29k
formalmathatepfl/deepseek-prover-v2-cpt-sft-feedback-1e Text Generation • 7B • Updated about 1 month ago • 1.45k
formalmathatepfl/deepseek-prover-v2-cpt-sft-feedback-1e Text Generation • 7B • Updated about 1 month ago • 1.45k