haiquan
chenhaiquan
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 2 months ago
Planner-R1: Reward Shaping Enables Efficient Agentic RL with Smaller
LLMs
new activity
7 months ago
Qwen/Qwen3-8B-Base:qwen3其他模型的eos_token应该是什么?
Organizations
None yet