XYX
xuyd16
AI & ML interests
None yet
Recent Activity
upvoted a paper about 15 hours ago
TRIAGE: Role-Typed Credit Assignment for Agentic Reinforcement Learning authored a paper about 2 months ago
Beyond GRPO and On-Policy Distillation: An Empirical Sparse-to-Dense Reward Principle for Language-Model Post-TrainingOrganizations
None yet