Compositional Text-to-Image Generation Via Region-aware Bimodal Direct Preference Optimization Paper • 2605.28615 • Published 10 days ago • 2
TransitLM: A Large-Scale Dataset and Benchmark for Map-Free Transit Route Generation Paper • 2605.22355 • Published 16 days ago • 177
Lance: Unified Multimodal Modeling by Multi-Task Synergy Paper • 2605.18678 • Published 19 days ago • 78
Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information Paper • 2605.11609 • Published 25 days ago • 195
CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence Paper • 2605.12882 • Published 24 days ago • 270
openai/clip-vit-large-patch14 Zero-Shot Image Classification • 0.4B • Updated Sep 15, 2023 • 22.7M • 2.03k