BEAR: Benchmarking and Enhancing Multimodal Language Models for Atomic Embodied Capabilities Paper • 2510.08759 • Published Oct 9, 2025 • 46
Imagine while Reasoning in Space: Multimodal Visualization-of-Thought Paper • 2501.07542 • Published Jan 13, 2025 • 3