microsoft/Phi-4-reasoning-vision-15B Image-Text-to-Text β’ 15B β’ Updated 6 days ago β’ 24.4k β’ 156
Running on Zero MCP Featured 87 GLM OCR Demo π 87 Multimodal OCR model for complex document understanding.
mistralai/Voxtral-Mini-4B-Realtime-2602 Automatic Speech Recognition β’ 4B β’ Updated 13 days ago β’ 739k β’ 723