Lost in Stories: Consistency Bugs in Long Story Generation by LLMs Paper • 2603.05890 • Published 16 days ago • 91
EnterpriseOps-Gym: Environments and Evaluations for Stateful Agentic Planning and Tool Use in Enterprise Settings Paper • 2603.13594 • Published 9 days ago • 141
P-GenRM: Personalized Generative Reward Model with Test-time User-based Scaling Paper • 2602.12116 • Published Feb 12 • 4
P-GenRM: Personalized Generative Reward Model with Test-time User-based Scaling Paper • 2602.12116 • Published Feb 12 • 4
Understanding Generalization in Role-Playing Models via Information Theory Paper • 2512.17270 • Published Dec 19, 2025 • 1
QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management Paper • 2512.12967 • Published Dec 15, 2025 • 109
MOA: Multi-Objective Alignment for Role-Playing Agents Paper • 2512.09756 • Published Dec 10, 2025 • 5
MOA: Multi-Objective Alignment for Role-Playing Agents Paper • 2512.09756 • Published Dec 10, 2025 • 5
Stabilizing Reinforcement Learning with LLMs: Formulation and Practices Paper • 2512.01374 • Published Dec 1, 2025 • 106