Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players Paper • 2605.28816 • Published 11 days ago • 420
CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence Paper • 2605.12882 • Published 25 days ago • 270
longtermrisk/Qwen3-8B-selectivesftjob-b2c5751cecde-plain-g0.0-b0.0-d0.0-s1234 Updated 24 days ago • 1
PrefixGuard: From LLM-Agent Traces to Online Failure-Warning Monitors Paper • 2605.06455 • Published May 7 • 3
Leveraging Verifier-Based Reinforcement Learning in Image Editing Paper • 2604.27505 • Published Apr 30 • 57
Context-Value-Action Architecture for Value-Driven Large Language Model Agents Paper • 2604.05939 • Published Apr 7 • 10
Beyond Hard Negatives: The Importance of Score Distribution in Knowledge Distillation for Dense Retrieval Paper • 2604.04734 • Published Apr 6 • 13