Dawn's picture

3

Dawn

LegendaryDawn

·

AI & ML interests

None yet

Recent Activity

updated a model 2 days ago

LegendaryDawn/SDRL-freq-Qwen3-8B-Base-majority_n8_l4096-DAPO_n8_bs256_long12-yarn2-step200

published a model 2 days ago

LegendaryDawn/SDRL-freq-Qwen3-8B-Base-majority_n8_l4096-DAPO_n8_bs256_long12-yarn2-step200

updated a model 4 days ago

LegendaryDawn/SDRL-freq-Qwen3-8B-Base-majority_n8_l4096-DAPO_n8_bs256_long12-yarn2-step125

View all activity

Organizations

None yet

upvoted 2 papers 8 days ago

Prepare Reasoning Language Models for Multi-Agent Debate with Self-Debate Reinforcement Learning

Paper • 2601.22297 • Published 21 days ago • 2

PhyCritic: Multimodal Critic Models for Physical AI

Paper • 2602.11124 • Published 8 days ago • 51

upvoted a paper 3 months ago

Explore Data Left Behind in Reinforcement Learning for Reasoning Language Models

Paper • 2511.04800 • Published Nov 6, 2025 • 1