MegaStyle: Constructing Diverse and Scalable Style Dataset via Consistent Text-to-Image Style Mapping Paper • 2604.08364 • Published 7 days ago • 95
OpenVLThinkerV2: A Generalist Multimodal Reasoning Model for Multi-domain Visual Tasks Paper • 2604.08539 • Published 7 days ago • 46
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled Image-Text-to-Text • 28B • Updated 9 days ago • 589k • 2.66k
CREval: An Automated Interpretable Evaluation for Creative Image Manipulation under Complex Instructions Paper • 2603.26174 • Published 19 days ago • 5