1 2 31

Zhenghao Xu

zhenghaoxu

AI & ML interests

None yet

Recent Activity

commented on a paper 4 days ago

VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training

updated a dataset 5 days ago

zhenghaoxu/aime-beyond

updated a dataset 5 days ago

zhenghaoxu/aime-amc23

View all activity

Organizations

commented a paper 4 days ago

VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training

Paper • 2602.10693 • Published 16 days ago • 185 •

updated 2 datasets 5 days ago

zhenghaoxu/aime-beyond

Viewer • Updated 5 days ago • 100 • 125

zhenghaoxu/aime-amc23

Viewer • Updated 5 days ago • 40 • 75

published a dataset 5 days ago

zhenghaoxu/aime-amc23

Viewer • Updated 5 days ago • 40 • 75

updated 4 datasets 5 days ago

updated a dataset 14 days ago

zhenghaoxu/dapo-math-17k

Viewer • Updated 14 days ago • 17.4k • 172

published 5 datasets 14 days ago

zhenghaoxu/dapo-math-17k

Viewer • Updated 14 days ago • 17.4k • 172

zhenghaoxu/aime-beyond

Viewer • Updated 5 days ago • 100 • 125

zhenghaoxu/aime-2026

Viewer • Updated 5 days ago • 30 • 188

zhenghaoxu/aime-2025

Viewer • Updated 5 days ago • 30 • 174

zhenghaoxu/aime-2024

Viewer • Updated 5 days ago • 30 • 174

published a dataset 15 days ago

zhenghaoxu/math-aime-eval

Viewer • Updated 5 days ago • 230 • 36

upvoted a paper 21 days ago

Approximation of Log-Partition Function in Policy Mirror Descent Induces Implicit Regularization for LLM Post-Training

Paper • 2602.05933 • Published 22 days ago • 5

upvoted a paper 24 days ago

Good SFT Optimizes for SFT, Better SFT Prepares for Reinforcement Learning

Paper • 2602.01058 • Published 26 days ago • 41

liked 2 models 3 months ago

inclusionAI/LLaDA2.0-flash

Text Generation • Updated Dec 19, 2025 • 167 • 67

inclusionAI/LLaDA2.0-mini

Text Generation • Updated 18 days ago • 30.1k • 58

liked a model 4 months ago

inclusionAI/LLaDA2.0-mini-preview

Text Generation • 16B • Updated Dec 19, 2025 • 174 • 90

Zhenghao Xu

AI & ML interests

Recent Activity

Organizations

zhenghaoxu's activity