visheratin/laion-coco-nllb
Viewer β’ Updated β’ 894k β’ 1.18k β’ 44
How to use visheratin/nllb-clip-large-oc with OpenCLIP:
import open_clip
model, preprocess_train, preprocess_val = open_clip.create_model_and_transforms('hf-hub:visheratin/nllb-clip-large-oc')
tokenizer = open_clip.get_tokenizer('hf-hub:visheratin/nllb-clip-large-oc')NLLB-CLIP is a model that combines a text encoder from the NLLB model and an image encoder from the LAION CLIP. This allows us to extend the model capabilities to 201 languages of the Flores-200. NLLB-CLIP sets state-of-the-art on the Crossmodal-3600 dataset by performing very well on low-resource languages. You can find more details about the model in the paper.
I thank ML Collective for providing Google Cloud compute resources to train the OpenCLIP-compatible version of NLLB-CLIP.