2 5 1

xzxuan

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

RLinf: Flexible and Efficient Large-scale Reinforcement Learning via Macro-to-Micro Flow Transformation

upvoted a paper 7 days ago

WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning

updated a collection 8 days ago

WideSeek-R1

View all activity

Organizations

upvoted a paper 3 days ago

RLinf: Flexible and Efficient Large-scale Reinforcement Learning via Macro-to-Micro Flow Transformation

Paper • 2509.15965 • Published Sep 19, 2025 • 17

upvoted a paper 7 days ago

WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning

Paper • 2602.04634 • Published 7 days ago • 91

updated a collection 8 days ago

WideSeek-R1

Collection

WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning • 4 items • Updated 7 days ago

updated a dataset 8 days ago

RLinf/WideSeek-R1-Corpus

Updated 7 days ago • 261

published a dataset 8 days ago

RLinf/WideSeek-R1-Corpus

Updated 7 days ago • 261

updated a model 8 days ago

RLinf/WideSeek-R1-4b

Text Generation • 4B • Updated 7 days ago • 55 • 1

published a model 8 days ago

RLinf/WideSeek-R1-4b

Text Generation • 4B • Updated 7 days ago • 55 • 1

upvoted a paper 3 months ago

Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models

Paper • 2511.08577 • Published Nov 11, 2025 • 108

New activity in RLinf/RLinf-OpenVLA-GRPO-ManiSkill3-25ood 4 months ago

Update README.md

#2 opened 4 months ago by

HillFir

New activity in RLinf/RLinf-OpenVLAOFT-GRPO-ManiSkill3-25ood 4 months ago

Update README.md

#2 opened 4 months ago by

HillFir

published a dataset 5 months ago

xzxuan/VS-Bench

Updated Sep 23, 2025 • 1

published 2 models 5 months ago

RLinf/RLinf-OpenVLAOFT-GRPO-ManiSkill3-25ood

Reinforcement Learning • 8B • Updated Oct 10, 2025 • 3

RLinf/RLinf-OpenVLA-GRPO-ManiSkill3-25ood

Reinforcement Learning • 8B • Updated Oct 10, 2025 • 3

updated 2 models 5 months ago

RLinf/RLinf-OpenVLAOFT-GRPO-ManiSkill3-25ood

Reinforcement Learning • 8B • Updated Oct 10, 2025 • 3

RLinf/RLinf-OpenVLA-GRPO-ManiSkill3-25ood

Reinforcement Learning • 8B • Updated Oct 10, 2025 • 3

upvoted a paper 8 months ago

PAROAttention: Pattern-Aware ReOrdering for Efficient Sparse and Quantized Attention in Visual Generation Models

Paper • 2506.16054 • Published Jun 19, 2025 • 60

liked a dataset 8 months ago

zelaix/VS-Bench

Viewer • Updated Jun 4, 2025 • 3.2k • 24 • 2

authored a paper 8 months ago

VS-Bench: Evaluating VLMs for Strategic Reasoning and Decision-Making in Multi-Agent Environments

Paper • 2506.02387 • Published Jun 3, 2025 • 58

xzxuan

AI & ML interests

Recent Activity

Organizations

xzxuan's activity

Update README.md

Update README.md