mistralai/Voxtral-Mini-4B-Realtime-2602 Automatic Speech Recognition β’ 4B β’ Updated 18 days ago β’ 802k β’ 757
Running on Zero Featured 1.77k Qwen3-TTS Demo π 1.77k Generate speech audio via voice design, cloning, or preset speakers
Configuration error Featured 131 Ministral WebGPU β‘ 131 Frontier multimodal AI, running entirely in your browser.
Running on Zero MCP 404 Multimodal OCR π 404 Nanonets / olmOCR / RolmOCR / Aya-Vision / Qwen2-VL-OCR
docling-project/SmolDocling-256M-preview Image-Text-to-Text β’ Updated Sep 17, 2025 β’ 64.7k β’ 1.61k
Running on Zero Featured 1.76k Dia 1.6B π― 1.76k Generate realistic dialogue from a script, using Dia!