Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
4
wang
wzx111
Follow
AI & ML interests
None yet
Recent Activity
updated
a model
5 days ago
wzx111/14B-Aggressive-OPO-Delta-LR2e-6-G32
published
a model
5 days ago
wzx111/14B-Aggressive-OPO-Delta-LR2e-6-G32
updated
a model
5 days ago
wzx111/14B-Aggressive-GSPO-LR2e-6-G32
View all activity
Organizations
None yet
spaces
2
Sort: Recently updated
pinned
Sleeping
My Argilla
✍
好
Runtime error
Chatweb
📊
models
10
Sort: Recently updated
wzx111/14B-Aggressive-OPO-Delta-LR2e-6-G32
Updated
5 days ago
wzx111/14B-Aggressive-GSPO-LR2e-6-G32
Updated
5 days ago
wzx111/Qwen3-1.7B-GRPO-math
Updated
Nov 29, 2025
wzx111/Qwen3-1.7B-Open-R1-ADPO
Text Generation
•
2B
•
Updated
Nov 23, 2025
•
2
wzx111/Qwen3-1.7B-Open-R1-GRPO-Baseline
Text Generation
•
2B
•
Updated
Nov 22, 2025
•
1
wzx111/Qwen3-1.7B-Open-R1-GRPO
2B
•
Updated
May 14, 2025
wzx111/Qwen3-1.7B-Open-R1-GDPO-epcoh_
Text Generation
•
2B
•
Updated
May 14, 2025
wzx111/Qwen3-1.7B-MATH-GDPO-EPOCH2
Text Generation
•
2B
•
Updated
May 2, 2025
•
1
wzx111/Qwen3-1.7B-MATH-GDPO
Text Generation
•
2B
•
Updated
May 1, 2025
•
6
•
1
wzx111/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
2B
•
Updated
Apr 28, 2025
datasets
3
Sort: Recently updated
wzx111/MATH-lighteval-level3
Viewer
•
Updated
Dec 9, 2025
•
2.72k
•
13
wzx111/MATH-lighteval-level-middlehigh
Viewer
•
Updated
Nov 24, 2025
•
5.63k
•
12
wzx111/MATH-lighteval-level-middle
Viewer
•
Updated
Nov 24, 2025
•
7.87k
•
13