Chuntao Dan
p051tr0n
·
AI & ML interests
all kinds
Organizations
Voice
Multimodal
-
Salesforce/blip-itm-base-coco
Updated • 110k • 28 -
Salesforce/blip-image-captioning-base
Image-to-Text • Updated • 2.47M • 859 -
Salesforce/blip-vqa-base
Visual Question Answering • 0.4B • Updated • 262k • 194 -
openai/clip-vit-large-patch14
Zero-Shot Image Classification • 0.4B • Updated • 25.4M • 2.02k
Agentic
Voice
Vision
Multimodal
-
Salesforce/blip-itm-base-coco
Updated • 110k • 28 -
Salesforce/blip-image-captioning-base
Image-to-Text • Updated • 2.47M • 859 -
Salesforce/blip-vqa-base
Visual Question Answering • 0.4B • Updated • 262k • 194 -
openai/clip-vit-large-patch14
Zero-Shot Image Classification • 0.4B • Updated • 25.4M • 2.02k
Robot