Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting Paper • 2601.02151 • Published 3 days ago • 63
InfiniDepth: Arbitrary-Resolution and Fine-Grained Depth Estimation with Neural Implicit Fields Paper • 2601.03252 • Published 2 days ago • 82
WebGym: Scaling Training Environments for Visual Web Agents with Realistic Tasks Paper • 2601.02439 • Published 3 days ago • 12
AceFF: A State-of-the-Art Machine Learning Potential for Small Molecules Paper • 2601.00581 • Published 7 days ago • 1
FFP-300K: Scaling First-Frame Propagation for Generalizable Video Editing Paper • 2601.01720 • Published 4 days ago • 4
SOP: A Scalable Online Post-Training System for Vision-Language-Action Models Paper • 2601.03044 • Published 2 days ago • 24
CogFlow: Bridging Perception and Reasoning through Knowledge Internalization for Visual Mathematical Problem Solving Paper • 2601.01874 • Published 3 days ago • 17
NitroGen: An Open Foundation Model for Generalist Gaming Agents Paper • 2601.02427 • Published 4 days ago • 27
UniCorn: Towards Self-Improving Unified Multimodal Models through Self-Generated Supervision Paper • 2601.03193 • Published 2 days ago • 36
SciEvalKit: An Open-source Evaluation Toolkit for Scientific General Intelligence Paper • 2512.22334 • Published 13 days ago • 32
Falcon-H1R: Pushing the Reasoning Frontiers with a Hybrid Model for Efficient Test-Time Scaling Paper • 2601.02346 • Published 3 days ago • 20
GARDO: Reinforcing Diffusion Models without Reward Hacking Paper • 2512.24138 • Published 9 days ago • 28
VINO: A Unified Visual Generator with Interleaved OmniModal Context Paper • 2601.02358 • Published 3 days ago • 26
VAR RL Done Right: Tackling Asynchronous Policy Conflicts in Visual Autoregressive Generation Paper • 2601.02256 • Published 3 days ago • 30
DreamID-V:Bridging the Image-to-Video Gap for High-Fidelity Face Swapping via Diffusion Transformer Paper • 2601.01425 • Published 4 days ago • 43