Supersede: Diagnosing and Training the Memory-Update Gap in LLM Agents Paper • 2606.27472 • Published 9 days ago
Supersede: Memory-Update Gap in LLM Agents Collection Open RL environment where the reward is temporal fact-currency. GRPO-trained Qwen2.5-3B LoRA lifts held-out supersession 9.0 -> 16.7 percent. • 4 items • Updated 4 days ago