-
Solvita: Enhancing Large Language Models for Competitive Programming via Agentic Evolution
Paper • 2605.15301 • Published • 20 -
MMSkills: Towards Multimodal Skills for General Visual Agents
Paper • 2605.13527 • Published • 114 -
Agentic Discovery of Neural Architectures: AIRA-Compose and AIRA-Design
Paper • 2605.15871 • Published • 13 -
Look Before You Leap: Autonomous Exploration for LLM Agents
Paper • 2605.16143 • Published • 7
Collections
Discover the best community collections!
Collections including paper arxiv:2605.13527
-
ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling
Paper • 2603.25746 • Published • 155 -
TAPS: Task Aware Proposal Distributions for Speculative Sampling
Paper • 2603.27027 • Published • 144 -
Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models
Paper • 2603.25716 • Published • 156 -
LongCat-Next: Lexicalizing Modalities as Discrete Tokens
Paper • 2603.27538 • Published • 147
-
Self-Distilled Agentic Reinforcement Learning
Paper • 2605.15155 • Published • 104 -
MemEye: A Visual-Centric Evaluation Framework for Multimodal Agent Memory
Paper • 2605.15128 • Published • 60 -
Orchard: An Open-Source Agentic Modeling Framework
Paper • 2605.15040 • Published • 18 -
MMSkills: Towards Multimodal Skills for General Visual Agents
Paper • 2605.13527 • Published • 114
-
Vision2Web: A Hierarchical Benchmark for Visual Website Development with Agent Verification
Paper • 2603.26648 • Published • 43 -
OpenGame: Open Agentic Coding for Games
Paper • 2604.18394 • Published • 81 -
CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents
Paper • 2603.24440 • Published • 98 -
InteractWeb-Bench: Can Multimodal Agent Escape Blind Execution in Interactive Website Generation?
Paper • 2604.27419 • Published • 13
-
PersonalAlign: Hierarchical Implicit Intent Alignment for Personalized GUI Agent with Long-Term User-Centric Records
Paper • 2601.09636 • Published • 8 -
LearnAct: Few-Shot Mobile GUI Agent with a Unified Demonstration Benchmark
Paper • 2504.13805 • Published • 11 -
ClawGUI: A Unified Framework for Training, Evaluating, and Deploying GUI Agents
Paper • 2604.11784 • Published • 143 -
UI-Voyager: A Self-Evolving GUI Agent Learning via Failed Experience
Paper • 2603.24533 • Published • 47
-
Solvita: Enhancing Large Language Models for Competitive Programming via Agentic Evolution
Paper • 2605.15301 • Published • 20 -
MMSkills: Towards Multimodal Skills for General Visual Agents
Paper • 2605.13527 • Published • 114 -
Agentic Discovery of Neural Architectures: AIRA-Compose and AIRA-Design
Paper • 2605.15871 • Published • 13 -
Look Before You Leap: Autonomous Exploration for LLM Agents
Paper • 2605.16143 • Published • 7
-
Self-Distilled Agentic Reinforcement Learning
Paper • 2605.15155 • Published • 104 -
MemEye: A Visual-Centric Evaluation Framework for Multimodal Agent Memory
Paper • 2605.15128 • Published • 60 -
Orchard: An Open-Source Agentic Modeling Framework
Paper • 2605.15040 • Published • 18 -
MMSkills: Towards Multimodal Skills for General Visual Agents
Paper • 2605.13527 • Published • 114
-
Vision2Web: A Hierarchical Benchmark for Visual Website Development with Agent Verification
Paper • 2603.26648 • Published • 43 -
OpenGame: Open Agentic Coding for Games
Paper • 2604.18394 • Published • 81 -
CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents
Paper • 2603.24440 • Published • 98 -
InteractWeb-Bench: Can Multimodal Agent Escape Blind Execution in Interactive Website Generation?
Paper • 2604.27419 • Published • 13
-
ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling
Paper • 2603.25746 • Published • 155 -
TAPS: Task Aware Proposal Distributions for Speculative Sampling
Paper • 2603.27027 • Published • 144 -
Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models
Paper • 2603.25716 • Published • 156 -
LongCat-Next: Lexicalizing Modalities as Discrete Tokens
Paper • 2603.27538 • Published • 147
-
PersonalAlign: Hierarchical Implicit Intent Alignment for Personalized GUI Agent with Long-Term User-Centric Records
Paper • 2601.09636 • Published • 8 -
LearnAct: Few-Shot Mobile GUI Agent with a Unified Demonstration Benchmark
Paper • 2504.13805 • Published • 11 -
ClawGUI: A Unified Framework for Training, Evaluating, and Deploying GUI Agents
Paper • 2604.11784 • Published • 143 -
UI-Voyager: A Self-Evolving GUI Agent Learning via Failed Experience
Paper • 2603.24533 • Published • 47