Ovis-U1 Collection An unified model for multimodal understanding, text-to-image generation, and image editing. β’ 3 items β’ Updated Jul 2, 2025 β’ 6
Ovis2.5 Collection Our next-generation MLLMs for native-resolution vision and advanced reasoning β’ 5 items β’ Updated Aug 19, 2025 β’ 57
Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities Paper β’ 2505.02567 β’ Published May 5, 2025 β’ 80
Ovis2 Collection Our latest advancement in multi-modal large language models (MLLMs) β’ 15 items β’ Updated Mar 25, 2025 β’ 65