Small RL Controller, Large Language Model: RL-Guided Adaptive Sampling for Test-Time Scaling Paper • 2606.03102 • Published 4 days ago • 13
Domino: Decoupling Causal Modeling from Autoregressive Drafting in Speculative Decoding Paper • 2605.29707 • Published 9 days ago • 139
MUSE-Autoskill: Self-Evolving Agents via Skill Creation, Memory, Management, and Evaluation Paper • 2605.27366 • Published 11 days ago • 26
InstructSAM: Segment Any Instance with Any Instructions Paper • 2605.26102 • Published 12 days ago • 17