Prediction with Action: Visual Policy Learning via Joint Denoising Process Paper • 2411.18179 • Published Nov 27, 2024
HiRT: Enhancing Robotic Control with Hierarchical Robot Transformers Paper • 2410.05273 • Published Sep 12, 2024 • 1
UP-VLA: A Unified Understanding and Prediction Model for Embodied Agent Paper • 2501.18867 • Published Jan 31, 2025
Improving Vision-Language-Action Model with Online Reinforcement Learning Paper • 2501.16664 • Published Jan 28, 2025 • 1
Rethinking Agent Design: From Top-Down Workflows to Bottom-Up Skill Evolution Paper • 2505.17673 • Published May 23, 2025
Robix: A Unified Model for Robot Interaction, Reasoning and Planning Paper • 2509.01106 • Published Sep 1, 2025 • 52
UniCoD: Enhancing Robot Policy via Unified Continuous and Discrete Representation Learning Paper • 2510.10642 • Published Oct 12, 2025
VLM4VLA: Revisiting Vision-Language-Models in Vision-Language-Action Models Paper • 2601.03309 • Published Jan 6 • 1
Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations Paper • 2412.14803 • Published Dec 19, 2024 • 1