Next-Embedding Prediction Makes Strong Vision Learners Paper • 2512.16922 • Published 9 days ago • 79
MedSAM3: Delving into Segment Anything with Medical Concepts Paper • 2511.19046 • Published Nov 24 • 49
EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning Paper • 2509.22576 • Published Sep 26 • 134
WoW: Towards a World omniscient World model Through Embodied Interaction Paper • 2509.22642 • Published Sep 26 • 14
MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources Paper • 2509.21268 • Published Sep 25 • 104
CE-GPPO: Controlling Entropy via Gradient-Preserving Clipping Policy Optimization in Reinforcement Learning Paper • 2509.20712 • Published Sep 25 • 19
VCRL: Variance-based Curriculum Reinforcement Learning for Large Language Models Paper • 2509.19803 • Published Sep 24 • 120
AGP Collection [ECAI 2025 Spotlight] Adaptive Graph Pruning for Multi-Agent Communication • 3 items • Updated Sep 26 • 2
STEVE Collection See and Think: Embodied Agent in Virtual Environment • 6 items • Updated Sep 22, 2024 • 1