dong zi's picture

In a Training Loop 🔄

2 3

dong zi

shenyao

·

AI & ML interests

None yet

Recent Activity

authored a paper 2 days ago

ADHint: Adaptive Hints with Difficulty Priors for Reinforcement Learning

authored a paper 2 days ago

RAVE: Re-Allocating Visual Attention in Large Multimodal Models

authored a paper 2 days ago

Revisiting Reinforcement Learning with Verifiable Rewards from a Contrastive Perspective

View all activity

Organizations

None yet

upvoted a paper 2 days ago

Learning from Your Own Mistakes: Constructing Learnable Micro-Reflective Trajectories for Self-Distillation

Paper • 2606.18844 • Published 4 days ago • 9

upvoted a paper 17 days ago

ADHint: Adaptive Hints with Difficulty Priors for Reinforcement Learning

Paper • 2512.13095 • Published Dec 15, 2025 • 2