·
AI & ML interests
None yet
Organizations
Lux0926/MetaMath-Mistral-7B-CGPO
7B
•
Updated
•
6
Lux0926/Qwen1.5-32B-SFT-CGPO
33B
•
Updated
•
5
Lux0926/MetaMath-Llama-8B-CGPO
8B
•
Updated
•
5
Lux0926/Qwen2-7B-SFT-CGPO
8B
•
Updated
•
5
Lux0926/MetaMath-Mistral-7B-Step-DPO
7B
•
Updated
•
4
Lux0926/MetaMath-Llama-8B-Step-DPO
8B
•
Updated
•
5
Lux0926/Deepseek-Coder-7B-Instruct-v1.5-CGPO
7B
•
Updated
•
3
Lux0926/DeepSeekMath-Base-7B-SFT-CGPO
7B
•
Updated
•
1
7B
•
Updated
•
3
7B
•
Updated
•
6
Lux0926/ASPRM-Training-Evaluation-Environment
Updated
Lux0926/ASPRM-MATHCODE-DeepSeek
7B
•
Updated
•
2
Lux0926/ASPRM-MATHCODE-Mistral
7B
•
Updated
•
1
7B
•
Updated
•
1
8B
•
Updated
•
2
•
1
7B
•
Updated
•
2
•
1
Lux0926/metamath_mistral_7b
Lux0926/MetaMath-LLaMA-8B
8B
•
Updated
•
3
•
1