Alvin

AZH04

AI & ML interests

None yet

Recent Activity

upvoted a paper about 13 hours ago

AgentOdyssey: Open-Ended Long-Horizon Text Game Generation for Test-Time Continual Learning Agents

updated a model 4 days ago

ssrm/gb300-selftrain

published a model 4 days ago

ssrm/gb300-selftrain

View all activity

Organizations

upvoted a paper about 13 hours ago

AgentOdyssey: Open-Ended Long-Horizon Text Game Generation for Test-Time Continual Learning Agents

Paper • 2606.24893 • Published May 29 • 6

updated a model 4 days ago

ssrm/gb300-selftrain

Updated 4 days ago

published a model 4 days ago

ssrm/gb300-selftrain

Updated 4 days ago

upvoted a paper about 1 month ago

Steered LLM Activations are Non-Surjective

Paper • 2604.09839 • Published May 7 • 15

upvoted a paper 9 months ago

The Alignment Waltz: Jointly Training Agents to Collaborate for Safety

Paper • 2510.08240 • Published Oct 9, 2025 • 41

upvoted a paper 11 months ago

Feedback Friction: LLMs Struggle to Fully Incorporate External Feedback

Paper • 2506.11930 • Published Jun 13, 2025 • 53

updated a model over 1 year ago

AZH04/Taxi-v3

Reinforcement Learning • Updated Feb 17, 2025

published a model over 1 year ago

AZH04/Taxi-v3

Reinforcement Learning • Updated Feb 17, 2025

updated a model over 1 year ago

AZH04/q-FrozenLake-v1-4x4-noSlippery

Reinforcement Learning • Updated Feb 17, 2025

published a model over 1 year ago

AZH04/q-FrozenLake-v1-4x4-noSlippery

Reinforcement Learning • Updated Feb 17, 2025

updated a model over 1 year ago

AZH04/ppo-LunarLander-v2

Reinforcement Learning • Updated Dec 27, 2024

upvoted a paper over 1 year ago

RATIONALYST: Pre-training Process-Supervision for Improving Reasoning

Paper • 2410.01044 • Published Oct 1, 2024 • 35

Alvin

AI & ML interests

Recent Activity

Organizations

AZH04's activity