Proact-VL: A Proactive VideoLLM for Real-Time AI Companions Paper • 2603.03447 • Published 9 days ago • 31
Visual Para-Thinker: Divide-and-Conquer Reasoning for Visual Comprehension Paper • 2602.13310 • Published about 1 month ago • 8
OneVision-Encoder: Codec-Aligned Sparsity as a Foundational Principle for Multimodal Intelligence Paper • 2602.08683 • Published about 1 month ago • 50
DanQing: An Up-to-Date Large-Scale Chinese Vision-Language Pre-training Dataset Paper • 2601.10305 • Published Jan 15 • 36
DanQing: An Up-to-Date Large-Scale Chinese Vision-Language Pre-training Dataset Paper • 2601.10305 • Published Jan 15 • 36 • 6
DanQing: An Up-to-Date Large-Scale Chinese Vision-Language Pre-training Dataset Paper • 2601.10305 • Published Jan 15 • 36