Yukai Wang's picture

Yukai Wang

defu2596

·

wonderNefelibata

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning

upvoted a paper 1 day ago

Rubric-based On-policy Distillation

liked a dataset 7 months ago

View all activity

Organizations

None yet

upvoted 2 papers 1 day ago

Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning

Paper • 2605.06130 • Published 7 days ago • 94

Rubric-based On-policy Distillation

Paper • 2605.07396 • Published 6 days ago • 36

liked a dataset 7 months ago

cais/hle

Benchmark • Updated Jan 20 • 2.5k • 48.8k • 797

upvoted a paper 8 months ago

Quantile Advantage Estimation for Entropy-Safe Reasoning

Paper • 2509.22611 • Published Sep 26, 2025 • 119

liked a dataset 8 months ago

data-is-better-together/10k_prompts_ranked

Viewer • Updated Mar 7, 2024 • 10.3k • 881 • 168

New activity in meta-llama/Llama-3.2-11B-Vision-Instruct about 1 year ago

Request rejected

#109 opened about 1 year ago by

New activity in Fancy-MLLM/R1-Onevision-7B-RL about 1 year ago

Model Selection

#1 opened about 1 year ago by