Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
MBZUAI
/
CoME-VL
like
3
Follow
Mohamed Bin Zayed University of Artificial Intelligence
754
Image-Text-to-Text
Transformers
English
multimodal
charts
diagrams
pointing
localization
CoME-VL
arxiv:
2604.03231
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
main
CoME-VL
/
assets
335 kB
Ctrl+K
Ctrl+K
3 contributors
History:
1 commit
ankanmbz
Added the assets
5be1187
verified
6 days ago
main_arct.png
161 kB
xet
Added the assets
6 days ago
teaser_fig.png
174 kB
xet
Added the assets
6 days ago