TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs Paper • 2512.14698 • Published Dec 16, 2025 • 27
HYDRA-X: Native Unified Multimodal Models with Holistic Visual Tokenizers Paper • 2606.13289 • Published 3 days ago • 25
InternVideo3: Agentify Foundation Models with Multimodal Contextual Reasoning Paper • 2606.12195 • Published 4 days ago • 20
InternVideo3: Agentify Foundation Models with Multimodal Contextual Reasoning Paper • 2606.12195 • Published 4 days ago • 20
Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale Paper • 2603.25040 • Published Mar 26 • 133
Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale Paper • 2603.25040 • Published Mar 26 • 133
ExpVid: A Benchmark for Experiment Video Understanding & Reasoning Paper • 2510.11606 • Published Oct 13, 2025 • 6
ExpVid: A Benchmark for Experiment Video Understanding & Reasoning Paper • 2510.11606 • Published Oct 13, 2025 • 6 • 2
ExpVid: A Benchmark for Experiment Video Understanding & Reasoning Paper • 2510.11606 • Published Oct 13, 2025 • 6