Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
25
1
Runpeng Dai
PRO
Leo-Dai
Follow
TongZheng1999's profile picture
1 follower
·
3 following
AI & ML interests
None yet
Recent Activity
authored
a paper
about 11 hours ago
Parallel-Probe: Towards Efficient Parallel Thinking via 2D Probing
upvoted
a
paper
about 20 hours ago
Learning Query-Specific Rubrics from Human Preferences for DeepResearch Report Generation
upvoted
a
paper
about 20 hours ago
Parallel-Probe: Towards Efficient Parallel Thinking via 2D Probing
View all activity
Organizations
Leo-Dai
's models
17
Sort: Recently updated
Leo-Dai/PPO_BL_250_critic
4B
•
Updated
Aug 15, 2025
Leo-Dai/PPO_BL_200_critic
Updated
Aug 15, 2025
•
2
Leo-Dai/PPO_BL_300_actor
Updated
Aug 15, 2025
Leo-Dai/PPO_BL_250_actor
Updated
Aug 15, 2025
Leo-Dai/PPO_BL_300_critic
Updated
Aug 15, 2025
Leo-Dai/GRPO_BL_40
4B
•
Updated
Aug 15, 2025
Leo-Dai/GRPO_BL_30
4B
•
Updated
Aug 15, 2025
Leo-Dai/GRPO_BL_20
4B
•
Updated
Aug 15, 2025
Leo-Dai/GRPO_BL_400
4B
•
Updated
Aug 15, 2025
Leo-Dai/GRPO_BL_10
4B
•
Updated
Aug 15, 2025
•
2
Leo-Dai/GRPO_BL_350
4B
•
Updated
Aug 15, 2025
Leo-Dai/GRPO_BL_200
4B
•
Updated
Aug 13, 2025
Leo-Dai/GRPO_BL_150
4B
•
Updated
Aug 13, 2025
Leo-Dai/GRPO_BL_100
4B
•
Updated
Aug 13, 2025
Leo-Dai/GRPO_BL_300
4B
•
Updated
Aug 13, 2025
Leo-Dai/GRPO_BL_250
4B
•
Updated
Aug 13, 2025
Leo-Dai/GRPO_BL_50
4B
•
Updated
Aug 13, 2025