VQQA: An Agentic Approach for Video Evaluation and Quality Improvement Paper • 2603.12310 • Published 4 days ago • 6
VQQA: An Agentic Approach for Video Evaluation and Quality Improvement Paper • 2603.12310 • Published 4 days ago • 6
Spend Less, Reason Better: Budget-Aware Value Tree Search for LLM Agents Paper • 2603.12634 • Published 3 days ago • 2
From Sparse to Dense: Multi-View GRPO for Flow Models via Augmented Condition Space Paper • 2603.12648 • Published 3 days ago • 6
daVinci-Env: Open SWE Environment Synthesis at Scale Paper • 2603.13023 • Published 3 days ago • 17 • 2
Meta-Reinforcement Learning with Self-Reflection for Agentic Search Paper • 2603.11327 • Published 5 days ago • 6
TeamHOI: Learning a Unified Policy for Cooperative Human-Object Interactions with Any Team Size Paper • 2603.07988 • Published 7 days ago • 2
Mobile-GS: Real-time Gaussian Splatting for Mobile Devices Paper • 2603.11531 • Published 4 days ago • 8
One Model, Many Budgets: Elastic Latent Interfaces for Diffusion Transformers Paper • 2603.12245 • Published 4 days ago • 16
Spatial-TTT: Streaming Visual-based Spatial Intelligence with Test-Time Training Paper • 2603.12255 • Published 4 days ago • 78
view changelog Hugging Face Changelog Introducing Buckets: S3-like storage on the Hub 6 days ago • 151
One Model, Many Budgets: Elastic Latent Interfaces for Diffusion Transformers Paper • 2603.12245 • Published 4 days ago • 16
WeEdit: A Dataset, Benchmark and Glyph-Guided Framework for Text-centric Image Editing Paper • 2603.11593 • Published 4 days ago • 20
Multi-Task Reinforcement Learning for Enhanced Multimodal LLM-as-a-Judge Paper • 2603.11665 • Published 4 days ago • 3