collection1 - a gee2001 Collection

gee2001 's Collections

collection1

updated about 14 hours ago

CAR-bench: Evaluating the Consistency and Limit-Awareness of LLM Agents under Real-World Uncertainty

Paper • 2601.22027 • Published 10 days ago • 74
Reinforcement World Model Learning for LLM-based Agents

Paper • 2602.05842 • Published 3 days ago • 18
Accurate Failure Prediction in Agents Does Not Imply Effective Failure Prevention

Paper • 2602.03338 • Published 6 days ago • 25
MemSkill: Learning and Evolving Memory Skills for Self-Evolving Agents

Paper • 2602.02474 • Published 6 days ago • 46