Terminal Agents Suffice for Enterprise Automation Paper • 2604.00073 • Published 3 days ago • 72
ClawKeeper: Comprehensive Safety Protection for OpenClaw Agents Through Skills, Plugins, and Watchers Paper • 2603.24414 • Published 9 days ago • 167
CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence Paper • 2603.28032 • Published 4 days ago • 299
FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization Paper • 2603.19835 • Published 14 days ago • 305
Gen-Searcher: Reinforcing Agentic Search for Image Generation Paper • 2603.28767 • Published 4 days ago • 52
TAPS: Task Aware Proposal Distributions for Speculative Sampling Paper • 2603.27027 • Published 6 days ago • 137
Calibri: Enhancing Diffusion Transformers via Parameter-Efficient Calibration Paper • 2603.24800 • Published 8 days ago • 65
RealRestorer: Towards Generalizable Real-World Image Restoration with Large-Scale Image Editing Models Paper • 2603.25502 • Published 8 days ago • 55
Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale Paper • 2603.25040 • Published 8 days ago • 125
PixelSmile: Toward Fine-Grained Facial Expression Editing Paper • 2603.25728 • Published 8 days ago • 116
EVA: Efficient Reinforcement Learning for End-to-End Video Agent Paper • 2603.22918 • Published 10 days ago • 42
CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents Paper • 2603.24440 • Published 9 days ago • 93
When Models Judge Themselves: Unsupervised Self-Evolution for Multimodal Reasoning Paper • 2603.21289 • Published 12 days ago • 34
SIMART: Decomposing Monolithic Meshes into Sim-ready Articulated Assets via MLLM Paper • 2603.23386 • Published 10 days ago • 40