The Illusion of Reasoning: Exposing Evasive Data Contamination in LLMs via Zero-CoT Truncation Paper • 2605.21856 • Published 17 days ago • 8
Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players Paper • 2605.28816 • Published 11 days ago • 420
EvalVerse: Pipeline-Aware and Expert-Calibrated Benchmarking for Professional Cinematic Video Generation Paper • 2605.23271 • Published 16 days ago • 79