Training Large Language Models to Predict Clinical Events Paper • 2605.12817 • Published 25 days ago • 17
Semantic Generative Tuning for Unified Multimodal Models Paper • 2605.18714 • Published 19 days ago • 11
CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence Paper • 2605.12882 • Published 24 days ago • 270
Continual Harness: Online Adaptation for Self-Improving Foundation Agents Paper • 2605.09998 • Published 26 days ago • 17
Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers Paper • 2605.06169 • Published about 1 month ago • 233
When to Think, When to Speak: Learning Disclosure Policies for LLM Reasoning Paper • 2605.03314 • Published May 6 • 2
Leveraging Verifier-Based Reinforcement Learning in Image Editing Paper • 2604.27505 • Published Apr 30 • 57
Chat2Workflow: A Benchmark for Generating Executable Visual Workflows with Natural Language Paper • 2604.19667 • Published Apr 21 • 22