1 3 1

UCLA_WHX

willhx

AI & ML interests

None yet

Recent Activity

upvoted a paper about 24 hours ago

HarnessBridge: Learnable Bidirectional Controller for LLM Agent Harness

upvoted a paper about 1 month ago

T^2PO: Uncertainty-Guided Exploration Control for Stable Multi-Turn Agentic Reinforcement Learning

submitted a paper about 1 month ago

T^2PO: Uncertainty-Guided Exploration Control for Stable Multi-Turn Agentic Reinforcement Learning

View all activity

Organizations

upvoted a paper about 24 hours ago

HarnessBridge: Learnable Bidirectional Controller for LLM Agent Harness

Paper • 2606.12882 • Published 2 days ago • 8

upvoted a paper about 1 month ago

T^2PO: Uncertainty-Guided Exploration Control for Stable Multi-Turn Agentic Reinforcement Learning

Paper • 2605.02178 • Published May 4 • 10

submitted a paper to Daily Papers about 1 month ago

T^2PO: Uncertainty-Guided Exploration Control for Stable Multi-Turn Agentic Reinforcement Learning

Paper • 2605.02178 • Published May 4 • 10

updated a collection about 1 month ago

T2PO

Collection

2 items • Updated May 1

updated a model about 1 month ago

willhx/Qwen3-4B-rft-webshop-5

4B • Updated May 1 • 406

published a model about 1 month ago

willhx/Qwen3-4B-rft-webshop-5

4B • Updated May 1 • 406

updated a model about 1 month ago

willhx/Qwen3-4B-rft-alfworld-e5

4B • Updated May 1 • 2

published a model about 1 month ago

willhx/Qwen3-4B-rft-alfworld-e5

4B • Updated May 1 • 2

updated a model about 2 months ago

willhx/Qwen3-30B-A3B_base_math_search

Text Generation • 31B • Updated Apr 17 • 667

published a model about 2 months ago

willhx/Qwen3-30B-A3B_base_math_search

Text Generation • 31B • Updated Apr 17 • 667

updated a dataset 2 months ago

willhx/sft_swe

Viewer • Updated Apr 8 • 8.35k • 11

published a dataset 2 months ago

willhx/sft_swe

Viewer • Updated Apr 8 • 8.35k • 11

updated a model 3 months ago

willhx/Qwen3-4B-alfworld-finished

4B • Updated Mar 25 • 2

published a model 3 months ago

willhx/Qwen3-4B-alfworld-finished

4B • Updated Mar 25 • 2

authored a paper 4 months ago

ARLArena: A Unified Framework for Stable Agentic Reinforcement Learning

Paper • 2602.21534 • Published Feb 25 • 26

upvoted a paper 4 months ago

ARLArena: A Unified Framework for Stable Agentic Reinforcement Learning

Paper • 2602.21534 • Published Feb 25 • 26

updated a dataset 7 months ago

willhx/SkyRL-SQL-Reproduction

Viewer • Updated Nov 20, 2025 • 5.91k • 19

published a dataset 7 months ago

willhx/SkyRL-SQL-Reproduction

Viewer • Updated Nov 20, 2025 • 5.91k • 19

liked a dataset over 1 year ago

chen-yingfa/CFDBench

Updated Sep 4, 2024 • 104 • 3

UCLA_WHX

AI & ML interests

Recent Activity

Organizations

willhx's activity