Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
siyuanzhu's picture
5 17

siyuanzhu

siyuan-zhu
·

AI & ML interests

reinforcement learning

Recent Activity

upvoted a paper 3 days ago
GAGPO: Generalized Advantage Grouped Policy Optimization
authored a paper 3 days ago
GAGPO: Generalized Advantage Grouped Policy Optimization
authored a paper 5 months ago
Context-Picker: Dynamic context selection using multi-stage reinforcement learning
View all activity

Organizations

Sun Yat-sen University's profile picture Sun Yat-Sen University's profile picture

Papers 2

arxiv:2605.13217
arxiv:2512.14465

models 0

None public yet

datasets 3

siyuan-zhu/gsm8k-python

Viewer • Updated May 28, 2025 • 1.2k • 3 • 1

siyuan-zhu/kk-difficulty

Viewer • Updated Mar 24, 2025 • 6.9k • 6 • 1

siyuan-zhu/gsm8k-doubao-lite-difficulties

Viewer • Updated Mar 24, 2025 • 8.79k • 7 • 1
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs