Running on Zero 9 Qwen3-VL Multimodal Search Engine 🔥 9 Cross-modal text-image search powered by Qwen3-VL
huihui-ai/Huihui-Qwen3-VL-8B-Instruct-abliterated Image-Text-to-Text • 9B • Updated Dec 15, 2025 • 6.25k • 142
fancyfeast/llama-joycaption-beta-one-hf-llava Image-Text-to-Text • 8B • Updated May 16, 2025 • 64.2k • 298
huihui-ai/Huihui-MiniCPM-V-4_5-abliterated Image-Text-to-Text • 9B • Updated Sep 8, 2025 • 6.55k • 27
Running on Zero Featured 899 Joy Caption Beta One 🖼 899 Generate captions for images with various styles and options
HuggingFaceTB/SmolVLM2-500M-Video-Instruct Image-Text-to-Text • 0.5B • Updated Apr 8, 2025 • 93.3k • 117
Runtime error Featured 198 Better Florence 2 🔥 198 Analyze images to detect objects, generate captions, or perform OCR