Crafter: A Multi-Agent Harness for Editable Scientific Figure Generation from Diverse Inputs Paper • 2605.30611 • Published 18 days ago • 192
JLT: Clean-Latent Prediction in Latent Diffusion Transformers Paper • 2605.27102 • Published 20 days ago • 33
WBench: A Comprehensive Multi-turn Benchmark for Interactive Video World Model Evaluation Paper • 2605.25874 • Published 21 days ago • 102
Forecasting Scientific Progress with Artificial Intelligence Paper • 2605.22681 • Published 25 days ago • 45
MixSD: Mixed Contextual Self-Distillation for Knowledge Injection Paper • 2605.16865 • Published about 1 month ago • 8
ESI-Bench: Towards Embodied Spatial Intelligence that Closes the Perception-Action Loop Paper • 2605.18746 • Published 28 days ago • 6
From Context to Skills: Can Language Models Learn from Context Skillfully? Paper • 2604.27660 • Published May 3 • 168
The Master Key Hypothesis: Unlocking Cross-Model Capability Transfer via Linear Subspace Alignment Paper • 2604.06377 • Published Apr 7 • 7
Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability Paper • 2604.06628 • Published Apr 8 • 327
Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding Paper • 2604.05015 • Published Apr 6 • 236
CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence Paper • 2603.28032 • Published Mar 30 • 343
SlopCodeBench: Benchmarking How Coding Agents Degrade Over Long-Horizon Iterative Tasks Paper • 2603.24755 • Published Mar 25 • 30