LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model Paper • 2604.20796 • Published 4 days ago • 228
TIPSv2: Advancing Vision-Language Pretraining with Enhanced Patch-Text Alignment Paper • 2604.12012 • Published 13 days ago • 10
Emergent Social Intelligence Risks in Generative Multi-Agent Systems Paper • 2603.27771 • Published 28 days ago • 52
CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence Paper • 2603.28032 • Published 27 days ago • 340