robot
updated
Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning
Paper
•
2503.15558
•
Published
•
50
Humanoid Policy ~ Human Policy
Paper
•
2503.13441
•
Published
RoboFactory: Exploring Embodied Agent Collaboration with Compositional
Constraints
Paper
•
2503.16408
•
Published
•
42
Dita: Scaling Diffusion Transformer for Generalist
Vision-Language-Action Policy
Paper
•
2503.19757
•
Published
•
51
Gemini Robotics: Bringing AI into the Physical World
Paper
•
2503.20020
•
Published
•
29
PhysTwin: Physics-Informed Reconstruction and Simulation of Deformable
Objects from Videos
Paper
•
2503.17973
•
Published
•
8
KUDA: Keypoints to Unify Dynamics Learning and Visual Prompting for
Open-Vocabulary Robotic Manipulation
Paper
•
2503.10546
•
Published
•
3
Being-0: A Humanoid Robotic Agent with Vision-Language Models and
Modular Skills
Paper
•
2503.12533
•
Published
•
68
WideRange4D: Enabling High-Quality 4D Reconstruction with Wide-Range
Movements and Scenes
Paper
•
2503.13435
•
Published
•
18
ManipTrans: Efficient Dexterous Bimanual Manipulation Transfer via
Residual Learning
Paper
•
2503.21860
•
Published
•
4
TesserAct: Learning 4D Embodied World Models
Paper
•
2504.20995
•
Published
•
22
CaRL: Learning Scalable Planning Policies with Simple Rewards
Paper
•
2504.17838
•
Published
•
4
R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement
Learning
Paper
•
2505.02835
•
Published
•
28
Interactive Post-Training for Vision-Language-Action Models
Paper
•
2505.17016
•
Published
•
6
ScanBot: Towards Intelligent Surface Scanning in Embodied Robotic
Systems
Paper
•
2505.17295
•
Published
•
9
BridgeVLA: Input-Output Alignment for Efficient 3D Manipulation Learning
with Vision-Language Models
Paper
•
2506.07961
•
Published
•
11
EmbodiedGen: Towards a Generative 3D World Engine for Embodied
Intelligence
Paper
•
2506.10600
•
Published
•
8
villa-X: Enhancing Latent Action Modeling in Vision-Language-Action
Models
Paper
•
2507.23682
•
Published
•
23
AimBot: A Simple Auxiliary Visual Cue to Enhance Spatial Awareness of
Visuomotor Policies
Paper
•
2508.08113
•
Published
•
11
Genie Envisioner: A Unified World Foundation Platform for Robotic
Manipulation
Paper
•
2508.05635
•
Published
•
73