robot - a PandaQQ Collection

PandaQQ 's Collections

RL

robot

scene4D

robot

updated Aug 17, 2025

Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning

Paper • 2503.15558 • Published Mar 18, 2025 • 50
Humanoid Policy ~ Human Policy

Paper • 2503.13441 • Published Mar 17, 2025
RoboFactory: Exploring Embodied Agent Collaboration with Compositional Constraints

Paper • 2503.16408 • Published Mar 20, 2025 • 42
Dita: Scaling Diffusion Transformer for Generalist Vision-Language-Action Policy

Paper • 2503.19757 • Published Mar 25, 2025 • 51
Gemini Robotics: Bringing AI into the Physical World

Paper • 2503.20020 • Published Mar 25, 2025 • 29
PhysTwin: Physics-Informed Reconstruction and Simulation of Deformable Objects from Videos

Paper • 2503.17973 • Published Mar 23, 2025 • 8
KUDA: Keypoints to Unify Dynamics Learning and Visual Prompting for Open-Vocabulary Robotic Manipulation

Paper • 2503.10546 • Published Mar 13, 2025 • 3
Being-0: A Humanoid Robotic Agent with Vision-Language Models and Modular Skills

Paper • 2503.12533 • Published Mar 16, 2025 • 68
WideRange4D: Enabling High-Quality 4D Reconstruction with Wide-Range Movements and Scenes

Paper • 2503.13435 • Published Mar 17, 2025 • 18
ManipTrans: Efficient Dexterous Bimanual Manipulation Transfer via Residual Learning

Paper • 2503.21860 • Published Mar 27, 2025 • 4
TesserAct: Learning 4D Embodied World Models

Paper • 2504.20995 • Published Apr 29, 2025 • 22
CaRL: Learning Scalable Planning Policies with Simple Rewards

Paper • 2504.17838 • Published Apr 24, 2025 • 4
R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning

Paper • 2505.02835 • Published May 5, 2025 • 28
Interactive Post-Training for Vision-Language-Action Models

Paper • 2505.17016 • Published May 22, 2025 • 6
ScanBot: Towards Intelligent Surface Scanning in Embodied Robotic Systems

Paper • 2505.17295 • Published May 22, 2025 • 9
BridgeVLA: Input-Output Alignment for Efficient 3D Manipulation Learning with Vision-Language Models

Paper • 2506.07961 • Published Jun 9, 2025 • 11
EmbodiedGen: Towards a Generative 3D World Engine for Embodied Intelligence

Paper • 2506.10600 • Published Jun 12, 2025 • 8
villa-X: Enhancing Latent Action Modeling in Vision-Language-Action Models

Paper • 2507.23682 • Published Jul 31, 2025 • 23
AimBot: A Simple Auxiliary Visual Cue to Enhance Spatial Awareness of Visuomotor Policies

Paper • 2508.08113 • Published Aug 11, 2025 • 11
Genie Envisioner: A Unified World Foundation Platform for Robotic Manipulation

Paper • 2508.05635 • Published Aug 7, 2025 • 73