UniDriveVLA: Unifying Understanding, Perception, and Action Planning for Autonomous Driving Paper โข 2604.02190 โข Published 8 days ago โข 25
InfiniDepth: Arbitrary-Resolution and Fine-Grained Depth Estimation with Neural Implicit Fields Paper โข 2601.03252 โข Published Jan 6 โข 104
SpatialTree: How Spatial Abilities Branch Out in MLLMs Paper โข 2512.20617 โข Published Dec 23, 2025 โข 43
Towards Scalable Pre-training of Visual Tokenizers for Generation Paper โข 2512.13687 โข Published Dec 15, 2025 โข 106
Pixel-Perfect Depth with Semantics-Prompted Diffusion Transformers Paper โข 2510.07316 โข Published Oct 8, 2025 โข 3