LongTraceRL: Learning Long-Context Reasoning from Search Agent Trajectories with Rubric Rewards Paper • 2605.31584 • Published 28 days ago • 43
STREAM: A Data-Centric Framework for Mining High-Value Task-Oriented Dialogues from Streaming Media Paper • 2605.25162 • Published May 24 • 4
Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players Paper • 2605.28816 • Published about 1 month ago • 431