Chuntao Dan
p051tr0n
·
AI & ML interests
all kinds
Organizations
Voice
Multimodal
-
Salesforce/blip-itm-base-coco
Updated • 22.2k • 28 -
Salesforce/blip-image-captioning-base
Image-to-Text • Updated • 2.55M • 844 -
Salesforce/blip-vqa-base
Visual Question Answering • 0.4B • Updated • 673k • 187 -
openai/clip-vit-large-patch14
Zero-Shot Image Classification • 0.4B • Updated • 6.34M • 1.97k
Agentic
Voice
Vision
Multimodal
-
Salesforce/blip-itm-base-coco
Updated • 22.2k • 28 -
Salesforce/blip-image-captioning-base
Image-to-Text • Updated • 2.55M • 844 -
Salesforce/blip-vqa-base
Visual Question Answering • 0.4B • Updated • 673k • 187 -
openai/clip-vit-large-patch14
Zero-Shot Image Classification • 0.4B • Updated • 6.34M • 1.97k
Robot