Extract+Think Data and Models for Extract+Think as part of Downscaling Intelligence: Exploring Perception and Reasoning Bottlenecks in Small Multimodal Models markendo/Visual-Extraction-Tuning-382K Viewer • Updated Nov 25, 2025 • 382k • 51 markendo/llava-extract-qwen3-0.6B Image-Text-to-Text • 1.0B • Updated Nov 25, 2025 markendo/llava-extract-qwen3-1.7B Image-Text-to-Text • 2B • Updated Nov 25, 2025 • 14 markendo/llava-extract-from-scratch-qwen3-0.6B Image-Text-to-Text • 1.0B • Updated Nov 25, 2025 • 4
Extract+Think Data and Models for Extract+Think as part of Downscaling Intelligence: Exploring Perception and Reasoning Bottlenecks in Small Multimodal Models markendo/Visual-Extraction-Tuning-382K Viewer • Updated Nov 25, 2025 • 382k • 51 markendo/llava-extract-qwen3-0.6B Image-Text-to-Text • 1.0B • Updated Nov 25, 2025 markendo/llava-extract-qwen3-1.7B Image-Text-to-Text • 2B • Updated Nov 25, 2025 • 14 markendo/llava-extract-from-scratch-qwen3-0.6B Image-Text-to-Text • 1.0B • Updated Nov 25, 2025 • 4