SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion Paper • 2503.11576 • Published Mar 14, 2025 • 149
Qwen2.5-Coder Collection Code-specific model series based on Qwen2.5 • 38 items • Updated 2 days ago • 356
view article Article Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models +1 Jun 24, 2024 • 206
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks Paper • 2311.06242 • Published Nov 10, 2023 • 95
💫 StarCoder2 Collection StarCoder2 models and datasets! • 8 items • Updated Mar 1, 2024 • 91
Phi-3 Collection Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 25 items • Updated 2 days ago • 575