Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
Wang
VictorZheng
Follow
0 followers
·
2 following
ZKBig
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
2 days ago
Hidden States as Early Signals: Step-level Trace Evaluation and Pruning for Efficient Test-Time Scaling
upvoted
a
paper
4 months ago
Slow-Fast Policy Optimization: Reposition-Before-Update for LLM Reasoning
authored
a paper
4 months ago
Slow-Fast Policy Optimization: Reposition-Before-Update for LLM Reasoning
View all activity
Organizations
VictorZheng
's models
4
Sort: Recently updated
VictorZheng/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
2B
•
Updated
Jul 28, 2025
VictorZheng/Qwen2.5-1.5B-Open-R1-GRPO
Updated
Jul 28, 2025
VictorZheng/Qwen2.5-1.5B-Instruct-GRPO
Updated
Jul 28, 2025
VictorZheng/qwen-2.5-3b-r1-countdown
Updated
Jul 18, 2025