Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
1
Libraries
Languages
Licenses
Other
Tasks
Reset Tasks
Text Generation
Any-to-Any
Image-Text-to-Text
Image-to-Text
Image-to-Image
Text-to-Image
Text-to-Video
Text-to-Speech
+ 42
Parameters
Reset Parameters
< 1B
6B
12B
32B
128B
> 500B
< 1B
> 500B
Libraries
PyTorch
google-tensorflow
TensorFlow
JAX
Transformers
Diffusers
sentence-transformers
Safetensors
ONNX
GGUF
Transformers.js
MLX
+ 41
Apps
vLLM
TGI
llama.cpp
MLX LM
LM Studio
Ollama
Jan
+ 12
Inference Providers
Groq
Novita
Nebius AI
Cerebras
SambaNova
Nscale
fal
Hyperbolic
+ 11
Apply filters
Models
6,120
Full-text search
Inference Available
Edit filters
Sort: Trending
Active filters:
image-text-to-text
Clear all
shreydan/SmolVLM-256M-Detection
Image-Text-to-Text
•
0.3B
•
Updated
May 17
•
9
•
2
John6666/llama-joycaption-beta-one-hf-llava-nf4
Image-Text-to-Text
•
3B
•
Updated
May 17
•
307
•
4
Mike522/Qwen2.5-VL-3B-sft-LaTeX
Image-Text-to-Text
•
4B
•
Updated
May 17
•
8
iqbalamo93/gemma-3-12b-it-GGUF-q8_0
Image-Text-to-Text
•
12B
•
Updated
May 17
•
58
•
1
Mungert/UI-TARS-1.5-7B-GGUF
Image-Text-to-Text
•
8B
•
Updated
Sep 24
•
491
•
11
unsloth/InternVL3-1B
Image-Text-to-Text
•
0.9B
•
Updated
May 18
•
38
unsloth/InternVL3-2B
Image-Text-to-Text
•
2B
•
Updated
May 18
•
110
•
2
rootonchair/EraX-VL-7B-V1.0-GGUF
Image-Text-to-Text
•
8B
•
Updated
May 19
•
123
FlashVL/FlashVL-2B-Dynamic
Image-Text-to-Text
•
3B
•
Updated
May 19
•
10
•
1
Ricky06662/TaskRouter-1.5B
Image-Text-to-Text
•
2B
•
Updated
Jun 12
•
219
•
2
unsloth/InternVL3-1B-GGUF
Image-Text-to-Text
•
0.6B
•
Updated
May 18
•
427
•
5
unsloth/InternVL3-2B-GGUF
Image-Text-to-Text
•
2B
•
Updated
May 18
•
413
•
1
unsloth/InternVL3-8B
Image-Text-to-Text
•
8B
•
Updated
May 18
•
2.3k
•
2
unsloth/InternVL3-8B-GGUF
Image-Text-to-Text
•
8B
•
Updated
May 18
•
599
•
4
TIGER-Lab/PixelReasoner-RL-v1
Image-Text-to-Text
•
8B
•
Updated
Jun 11
•
677
•
9
unsloth/InternVL3-14B-GGUF
Image-Text-to-Text
•
15B
•
Updated
May 18
•
283
•
1
unsloth/InternVL3-38B
Image-Text-to-Text
•
38B
•
Updated
May 18
•
12
Mungert/SmolVLM-500M-Instruct-GGUF
Image-Text-to-Text
•
0.4B
•
Updated
Sep 24
•
479
unsloth/InternVL3-38B-GGUF
Image-Text-to-Text
•
33B
•
Updated
May 18
•
436
•
3
Mungert/Vintern-1B-v3_5-GGUF
Image-Text-to-Text
•
0.6B
•
Updated
Sep 24
•
48
unsloth/InternVL3-78B-GGUF
Image-Text-to-Text
•
73B
•
Updated
May 19
•
388
•
1
lordChipotle/nutrition-label-detector
Image-Text-to-Text
•
9B
•
Updated
May 19
•
12
unsloth/InternVL3-1B-Instruct
Image-Text-to-Text
•
0.9B
•
Updated
May 19
•
29
unsloth/InternVL3-1B-Instruct-GGUF
Image-Text-to-Text
•
0.6B
•
Updated
May 19
•
150
unsloth/InternVL3-2B-Instruct
Image-Text-to-Text
•
2B
•
Updated
May 19
•
23
unsloth/InternVL3-2B-Instruct-GGUF
Image-Text-to-Text
•
2B
•
Updated
May 19
•
219
unsloth/InternVL3-8B-Instruct-GGUF
Image-Text-to-Text
•
8B
•
Updated
May 19
•
307
•
2
unsloth/InternVL3-14B-Instruct
Image-Text-to-Text
•
15B
•
Updated
May 19
•
13
unsloth/InternVL3-14B-Instruct-GGUF
Image-Text-to-Text
•
15B
•
Updated
May 19
•
522
•
4
TienAnh/Finetune_OCR_1B
Image-Text-to-Text
•
0.9B
•
Updated
May 22
•
28
•
1
Previous
1
...
98
99
100
Next