Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Yikun Jiang
code1phoenix
Follow
AI & ML interests
None yet
Recent Activity
published
a model
7 days ago
code1phoenix/zamba2-2.7b-grpo-v2-length-gsm8k
published
a model
7 days ago
code1phoenix/zamba2-2.7b-dpo-v3-length-gsm8k
updated
a model
10 days ago
code1phoenix/zamba2-2.7b-dpo-v3-length-gsm8k
View all activity
Organizations
None yet
models
15
Sort: Recently updated
code1phoenix/zamba2-2.7b-dpo-v3-length-gsm8k
Updated
10 days ago
•
10
code1phoenix/zamba2-2.7b-grpo-v2-length-gsm8k
Updated
11 days ago
•
9
code1phoenix/pixelcopter
Reinforcement Learning
•
Updated
15 days ago
code1phoenix/rl_course_vizdoom_health_gathering_supreme
Reinforcement Learning
•
Updated
15 days ago
code1phoenix/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
15 days ago
•
38
code1phoenix/a2c-PandaReachDense-v3
Reinforcement Learning
•
Updated
15 days ago
•
25
code1phoenix/ppo-pyramid
Reinforcement Learning
•
Updated
15 days ago
•
20
code1phoenix/ppo-SnowballTarget
Reinforcement Learning
•
Updated
15 days ago
•
273
code1phoenix/cartpole-1
Reinforcement Learning
•
Updated
15 days ago
code1phoenix/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
15 days ago
•
41
View 15 models
datasets
0
None public yet