TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models Paper • 2512.02014 • Published Dec 1, 2025 • 73
OneStory: Coherent Multi-Shot Video Generation with Adaptive Memory Paper • 2512.07802 • Published Dec 8, 2025 • 46
VecGlypher: Unified Vector Glyph Generation with Language Models Paper • 2602.21461 • Published 8 days ago • 11
VecGlypher: Unified Vector Glyph Generation with Language Models Paper • 2602.21461 • Published 8 days ago • 11
MedVLThinker: Simple Baselines for Multimodal Medical Reasoning Paper • 2508.02669 • Published Aug 4, 2025
MedVLSynther: Synthesizing High-Quality Visual Question Answering from Medical Documents with Generator-Verifier LMMs Paper • 2510.25867 • Published Oct 29, 2025 • 7
m1: Unleash the Potential of Test-Time Scaling for Medical Reasoning with Large Language Models Paper • 2504.00869 • Published Apr 1, 2025 • 10
Story-Adapter: A Training-free Iterative Framework for Long Story Visualization Paper • 2410.06244 • Published Oct 8, 2024 • 20
Efficient Meshy Neural Fields for Animatable Human Avatars Paper • 2303.12965 • Published Mar 23, 2023
VoCo-LLaMA: Towards Vision Compression with Large Language Models Paper • 2406.12275 • Published Jun 18, 2024 • 31