OmniOPD: Logit-Free On-Policy Distillation via Speculative Verification Paper • 2606.01476 • Published 5 days ago • 8
Conditional Hypothesis Generation for LLM-Based Text Analysis with Researcher-Specified Covariates Paper • 2606.03029 • Published 3 days ago • 5
VideoUFO: A Million-Scale User-Focused Dataset for Text-to-Video Generation Paper • 2503.01739 • Published Mar 3, 2025 • 10
Synthetic Sandbox for Training Machine Learning Engineering Agents Paper • 2604.04872 • Published Apr 6 • 14
Paying Less Generalization Tax: A Cross-Domain Generalization Study of RL Training for LLM Agents Paper • 2601.18217 • Published Jan 26 • 13
Multi-Crit: Benchmarking Multimodal Judges on Pluralistic Criteria-Following Paper • 2511.21662 • Published Nov 26, 2025 • 11
Multi-Crit: Benchmarking Multimodal Judges on Pluralistic Criteria-Following Paper • 2511.21662 • Published Nov 26, 2025 • 11