Huining Yuan's picture

6

Huining Yuan

HuiningYuan

·

HuiningYuan

AI & ML interests

Reinforcement learning, LLM Agents, World models

Recent Activity

upvoted a paper about 20 hours ago

RLinf: Flexible and Efficient Large-scale Reinforcement Learning via Macro-to-Micro Flow Transformation

upvoted a paper 5 days ago

WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning

upvoted a collection 2 months ago

View all activity

Organizations

upvoted a paper about 20 hours ago

RLinf: Flexible and Efficient Large-scale Reinforcement Learning via Macro-to-Micro Flow Transformation

Paper • 2509.15965 • Published Sep 19, 2025 • 17

upvoted a paper 5 days ago

WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning

Paper • 2602.04634 • Published 6 days ago • 89

upvoted a collection 2 months ago

MARSHAL

MARSHAL: Incentivizing Multi-Agent Reasoning via Self-Play with Strategic LLMs • 6 items • Updated Dec 5, 2025 • 2

upvoted a paper 2 months ago

DAComp: Benchmarking Data Agents across the Full Data Intelligence Lifecycle

Paper • 2512.04324 • Published Dec 3, 2025 • 154

updated 5 models 2 months ago

nics-efc/MARSHAL-Kuhn-Poker-Qwen3-4B

Text Generation • 4B • Updated Dec 4, 2025 • 3

nics-efc/MARSHAL-Mini-Hanabi-Qwen3-4B

Text Generation • 4B • Updated Dec 4, 2025 • 1

nics-efc/MARSHAL-Generalist-Qwen3-8B

Text Generation • 8B • Updated Dec 4, 2025 • 4

nics-efc/MARSHAL-Generalist-Qwen3-4B

Text Generation • 4B • Updated Dec 4, 2025 • 1

nics-efc/MARSHAL-Tic-Tac-Toe-Qwen3-4B

Text Generation • 4B • Updated Dec 4, 2025 • 4

updated a collection 2 months ago

MARSHAL

MARSHAL: Incentivizing Multi-Agent Reasoning via Self-Play with Strategic LLMs • 6 items • Updated Dec 5, 2025 • 2

published 5 models 2 months ago

nics-efc/MARSHAL-Mini-Hanabi-Qwen3-4B

Text Generation • 4B • Updated Dec 4, 2025 • 1

nics-efc/MARSHAL-Kuhn-Poker-Qwen3-4B

Text Generation • 4B • Updated Dec 4, 2025 • 3

nics-efc/MARSHAL-Tic-Tac-Toe-Qwen3-4B

Text Generation • 4B • Updated Dec 4, 2025 • 4

nics-efc/MARSHAL-Generalist-Qwen3-8B

Text Generation • 8B • Updated Dec 4, 2025 • 4

nics-efc/MARSHAL-Generalist-Qwen3-4B

Text Generation • 4B • Updated Dec 4, 2025 • 1