Beyond Prompts: Unconditional 3D Inversion for Out-of-Distribution Shapes Paper • 2604.14914 • Published 10 days ago • 5
LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model Paper • 2604.20796 • Published 4 days ago • 227
WildDet3D: Scaling Promptable 3D Detection in the Wild Paper • 2604.08626 • Published 17 days ago • 240
Qualixar OS: A Universal Operating System for AI Agent Orchestration Paper • 2604.06392 • Published 19 days ago • 16
SkillClaw: Let Skills Evolve Collectively with Agentic Evolver Paper • 2604.08377 • Published 17 days ago • 285
EgoSim: Egocentric World Simulator for Embodied Interaction Generation Paper • 2604.01001 • Published 25 days ago • 38
PixelPrune: Pixel-Level Adaptive Visual Token Reduction via Predictive Coding Paper • 2604.00886 • Published 24 days ago • 6
Dynin-Omni: Omnimodal Unified Large Diffusion Language Model Paper • 2604.00007 • Published Mar 9 • 19
CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence Paper • 2603.28032 • Published 27 days ago • 340
When Models Judge Themselves: Unsupervised Self-Evolution for Multimodal Reasoning Paper • 2603.21289 • Published Mar 22 • 35
Think Longer to Explore Deeper: Learn to Explore In-Context via Length-Incentivized Reinforcement Learning Paper • 2602.11748 • Published Feb 12 • 37
Astrolabe: Steering Forward-Process Reinforcement Learning for Distilled Autoregressive Video Models Paper • 2603.17051 • Published Mar 17 • 109
SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models Paper • 2603.16859 • Published Mar 17 • 248