7 8

yanan chen

yananchen

https://github.com/yanan1116

AI & ML interests

None yet

Recent Activity

updated a dataset about 2 months ago

yananchen/robocasa_lerobot

updated a dataset about 2 months ago

yananchen/skillrl_sft_alfworld_prompt_completion

published a dataset about 2 months ago

yananchen/skillrl_sft_alfworld_prompt_completion

View all activity

Organizations

None yet

upvoted an article 6 months ago

Article

LeRobot v0.4.0: Supercharging OSS Robot Learning

imstevenpmwork, aractingi, pepijn223, CarolinePascal, jadechoghari, fracapuano, AdilZtn, nepyope, thomwolf

•

Oct 24, 2025

• 50

upvoted an article 10 months ago

Article

Visualize and understand GPU memory in PyTorch

qgallouedec

•

Dec 24, 2024

• 272

upvoted 3 articles 11 months ago

Article

Simplifying Alignment: From RLHF to Direct Preference Optimization (DPO)

ariG23498

•

Jan 19, 2025

• 50

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

NormalUhr

•

Feb 7, 2025

• 294

Article

Proximal Policy Optimization (PPO)

ThomasSimonini

•

Aug 5, 2022

• 87

upvoted an article almost 2 years ago

Article

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

mlabonne

•

Jul 29, 2024

• 372

upvoted a paper about 2 years ago

LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning

Paper • 2403.17919 • Published Mar 26, 2024 • 16

yanan chen

AI & ML interests

Recent Activity

Organizations

yananchen's activity

LeRobot v0.4.0: Supercharging OSS Robot Learning

Visualize and understand GPU memory in PyTorch

Simplifying Alignment: From RLHF to Direct Preference Optimization (DPO)

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Proximal Policy Optimization (PPO)

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth