-
Agent Learning via Early Experience
Paper • 2510.08558 • Published • 276 -
Learning on the Job: An Experience-Driven Self-Evolving Agent for Long-Horizon Tasks
Paper • 2510.08002 • Published • 24 -
Self-Improving LLM Agents at Test-Time
Paper • 2510.07841 • Published • 10 -
The Denario project: Deep knowledge AI agents for scientific discovery
Paper • 2510.26887 • Published • 8
Collections
Discover the best community collections!
Collections including paper arxiv:2511.16043
-
Learning to Continually Learn via Meta-learning Agentic Memory Designs
Paper • 2602.07755 • Published • 8 -
Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning
Paper • 2511.16043 • Published • 110 -
DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation
Paper • 2601.22153 • Published • 75 -
OccuBench: Evaluating AI Agents on Real-World Professional Tasks via Language World Models
Paper • 2604.10866 • Published • 66
-
Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy
Paper • 2507.01352 • Published • 60 -
A Data-Centric Framework for Addressing Phonetic and Prosodic Challenges in Russian Speech Generative Models
Paper • 2507.13563 • Published • 53 -
Scaling Laws for Optimal Data Mixtures
Paper • 2507.09404 • Published • 38 -
Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation
Paper • 2511.14993 • Published • 234
-
Guided Self-Evolving LLMs with Minimal Human Supervision
Paper • 2512.02472 • Published • 55 -
DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search
Paper • 2509.25454 • Published • 147 -
Video Reasoning without Training
Paper • 2510.17045 • Published • 9 -
Agent Learning via Early Experience
Paper • 2510.08558 • Published • 276
-
Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning
Paper • 2511.16043 • Published • 110 -
General Agentic Memory Via Deep Research
Paper • 2511.18423 • Published • 170 -
Agent0-VL: Exploring Self-Evolving Agent for Tool-Integrated Vision-Language Reasoning
Paper • 2511.19900 • Published • 49 -
MobiAgent: A Systematic Framework for Customizable Mobile Agents
Paper • 2509.00531 • Published • 8
-
SuperWriter: Reflection-Driven Long-Form Generation with Large Language Models
Paper • 2506.04180 • Published • 34 -
AniMaker: Automated Multi-Agent Animated Storytelling with MCTS-Driven Clip Generation
Paper • 2506.10540 • Published • 37 -
AutoMind: Adaptive Knowledgeable Agent for Automated Data Science
Paper • 2506.10974 • Published • 19 -
SPAR: Scholar Paper Retrieval with LLM-based Agents for Enhanced Academic Search
Paper • 2507.15245 • Published • 11
-
AgentConductor: Topology Evolution for Multi-Agent Competition-Level Code Generation
Paper • 2602.17100 • Published • 4 -
GroupGPT: A Token-efficient and Privacy-preserving Agentic Framework for Multi-User Chat Assistant
Paper • 2603.01059 • Published • 1 -
Multi-Domain Riemannian Graph Gluing for Building Graph Foundation Models
Paper • 2603.00618 • Published -
Heterogeneous Agent Collaborative Reinforcement Learning
Paper • 2603.02604 • Published • 197
-
Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models
Paper • 2510.04618 • Published • 133 -
Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning
Paper • 2511.16043 • Published • 110 -
Memory in the Age of AI Agents
Paper • 2512.13564 • Published • 157 -
Confucius Code Agent: An Open-sourced AI Software Engineer at Industrial Scale
Paper • 2512.10398 • Published • 14
-
Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL
Paper • 2508.13167 • Published • 129 -
Stabilizing Reinforcement Learning with LLMs: Formulation and Practices
Paper • 2512.01374 • Published • 106 -
Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning
Paper • 2511.16043 • Published • 110 -
Agentic Entropy-Balanced Policy Optimization
Paper • 2510.14545 • Published • 108
-
Agent Learning via Early Experience
Paper • 2510.08558 • Published • 276 -
Learning on the Job: An Experience-Driven Self-Evolving Agent for Long-Horizon Tasks
Paper • 2510.08002 • Published • 24 -
Self-Improving LLM Agents at Test-Time
Paper • 2510.07841 • Published • 10 -
The Denario project: Deep knowledge AI agents for scientific discovery
Paper • 2510.26887 • Published • 8
-
SuperWriter: Reflection-Driven Long-Form Generation with Large Language Models
Paper • 2506.04180 • Published • 34 -
AniMaker: Automated Multi-Agent Animated Storytelling with MCTS-Driven Clip Generation
Paper • 2506.10540 • Published • 37 -
AutoMind: Adaptive Knowledgeable Agent for Automated Data Science
Paper • 2506.10974 • Published • 19 -
SPAR: Scholar Paper Retrieval with LLM-based Agents for Enhanced Academic Search
Paper • 2507.15245 • Published • 11
-
Learning to Continually Learn via Meta-learning Agentic Memory Designs
Paper • 2602.07755 • Published • 8 -
Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning
Paper • 2511.16043 • Published • 110 -
DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation
Paper • 2601.22153 • Published • 75 -
OccuBench: Evaluating AI Agents on Real-World Professional Tasks via Language World Models
Paper • 2604.10866 • Published • 66
-
AgentConductor: Topology Evolution for Multi-Agent Competition-Level Code Generation
Paper • 2602.17100 • Published • 4 -
GroupGPT: A Token-efficient and Privacy-preserving Agentic Framework for Multi-User Chat Assistant
Paper • 2603.01059 • Published • 1 -
Multi-Domain Riemannian Graph Gluing for Building Graph Foundation Models
Paper • 2603.00618 • Published -
Heterogeneous Agent Collaborative Reinforcement Learning
Paper • 2603.02604 • Published • 197
-
Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy
Paper • 2507.01352 • Published • 60 -
A Data-Centric Framework for Addressing Phonetic and Prosodic Challenges in Russian Speech Generative Models
Paper • 2507.13563 • Published • 53 -
Scaling Laws for Optimal Data Mixtures
Paper • 2507.09404 • Published • 38 -
Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation
Paper • 2511.14993 • Published • 234
-
Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models
Paper • 2510.04618 • Published • 133 -
Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning
Paper • 2511.16043 • Published • 110 -
Memory in the Age of AI Agents
Paper • 2512.13564 • Published • 157 -
Confucius Code Agent: An Open-sourced AI Software Engineer at Industrial Scale
Paper • 2512.10398 • Published • 14
-
Guided Self-Evolving LLMs with Minimal Human Supervision
Paper • 2512.02472 • Published • 55 -
DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search
Paper • 2509.25454 • Published • 147 -
Video Reasoning without Training
Paper • 2510.17045 • Published • 9 -
Agent Learning via Early Experience
Paper • 2510.08558 • Published • 276
-
Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL
Paper • 2508.13167 • Published • 129 -
Stabilizing Reinforcement Learning with LLMs: Formulation and Practices
Paper • 2512.01374 • Published • 106 -
Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning
Paper • 2511.16043 • Published • 110 -
Agentic Entropy-Balanced Policy Optimization
Paper • 2510.14545 • Published • 108
-
Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning
Paper • 2511.16043 • Published • 110 -
General Agentic Memory Via Deep Research
Paper • 2511.18423 • Published • 170 -
Agent0-VL: Exploring Self-Evolving Agent for Tool-Integrated Vision-Language Reasoning
Paper • 2511.19900 • Published • 49 -
MobiAgent: A Systematic Framework for Customizable Mobile Agents
Paper • 2509.00531 • Published • 8