Models, datasets, spaces, docs, etc. on Text Summarization (within Natural Language Processing, NLP)
Alejandro Rodriguez Pascual
alexrodpas
AI & ML interests
Applied ML/AI
Organizations
None yet
Computer Vision (CV)
Collection of useful computer vision models to fine-tune.
-
microsoft/resnet-50
Image Classification • 25.6M • Updated • 312k • • 471 -
google/vit-base-patch16-224-in21k
Image Feature Extraction • 86.4M • Updated • 1.49M • 391 -
google/vit-base-patch32-224-in21k
Image Feature Extraction • 88M • Updated • 5.78k • 19 -
facebook/dinov2-large
Image Feature Extraction • 0.3B • Updated • 3.03M • 98
CV - Object Detection
Object Detection Models
-
facebook/detr-resnet-50
Object Detection • 41.6M • Updated • 1.72M • • 916 -
facebook/detr-resnet-101-dc5
Object Detection • 60.7M • Updated • 1.29k • 19 -
facebook/detr-resnet-50-dc5
Object Detection • 41.6M • Updated • 1.47k • 6 -
google/owlvit-base-patch32
Zero-Shot Object Detection • 0.2B • Updated • 102k • 143
CV - Zero-shot Image Classification Models
Zero-shot Image Classification models
-
openai/clip-vit-large-patch14
Zero-Shot Image Classification • 0.4B • Updated • 8.18M • 1.94k -
openai/clip-vit-base-patch32
Zero-Shot Image Classification • Updated • 16M • 828 -
laion/CLIP-ViT-bigG-14-laion2B-39B-b160k
Zero-Shot Image Classification • Updated • 92k • 302 -
kakaobrain/align-base
Zero-Shot Image Classification • Updated • 9.73k • 28
Generative AI - Image-to-Image
Collection of image to image editing, and image enhancement adapter models.
Generative AI - Image-to-Text
-
Salesforce/blip-image-captioning-large
Image-to-Text • 0.5B • Updated • 1.22M • 1.44k -
Salesforce/blip-image-captioning-base
Image-to-Text • Updated • 1.75M • 828 -
microsoft/trocr-base-handwritten
Image-to-Text • 0.3B • Updated • 141k • 469 -
microsoft/git-large-coco
Image-to-Text • 0.4B • Updated • 1.83k • 104
Large Language Models (LLM)
Collection of useful Large Language Models (LLM) models and HF spaces.
CV - Image Classification
Image Classification models
-
facebook/deit-base-distilled-patch16-384
Image Classification • 87.6M • Updated • 441 • 7 -
facebook/convnextv2-base-1k-224
Image Classification • 88.7M • Updated • 494 • 4 -
facebook/deit-base-distilled-patch16-224
Image Classification • Updated • 10.7k • • 31 -
google/vit-base-patch32-384
Image Classification • 88.3M • Updated • 6.77k • • 23
CV - Image Segmentation
Image Segmentation models
-
facebook/maskformer-swin-large-coco
Image Segmentation • 0.2B • Updated • 1.35k • • 27 -
nvidia/segformer-b0-finetuned-ade-512-512
Image Segmentation • 3.75M • Updated • 197k • • 178 -
facebook/detr-resnet-50-dc5-panoptic
Image Segmentation • 43M • Updated • 35 • 3 -
nvidia/segformer-b5-finetuned-cityscapes-1024-1024
Image Segmentation • Updated • 87.7k • • 36
Video Classification
Video Classification models
-
microsoft/xclip-base-patch32
Video Classification • 0.2B • Updated • 205k • 106 -
facebook/timesformer-base-finetuned-k400
Video Classification • Updated • 51.4k • 42 -
facebook/timesformer-base-finetuned-k600
Video Classification • Updated • 6.01k • 12 -
google/vivit-b-16x2
Video Classification • Updated • 6.51k • 11
Generative AI - Text-to-Image
NLP - Summarization
Models, datasets, spaces, docs, etc. on Text Summarization (within Natural Language Processing, NLP)
Large Language Models (LLM)
Collection of useful Large Language Models (LLM) models and HF spaces.
Computer Vision (CV)
Collection of useful computer vision models to fine-tune.
-
microsoft/resnet-50
Image Classification • 25.6M • Updated • 312k • • 471 -
google/vit-base-patch16-224-in21k
Image Feature Extraction • 86.4M • Updated • 1.49M • 391 -
google/vit-base-patch32-224-in21k
Image Feature Extraction • 88M • Updated • 5.78k • 19 -
facebook/dinov2-large
Image Feature Extraction • 0.3B • Updated • 3.03M • 98
CV - Image Classification
Image Classification models
-
facebook/deit-base-distilled-patch16-384
Image Classification • 87.6M • Updated • 441 • 7 -
facebook/convnextv2-base-1k-224
Image Classification • 88.7M • Updated • 494 • 4 -
facebook/deit-base-distilled-patch16-224
Image Classification • Updated • 10.7k • • 31 -
google/vit-base-patch32-384
Image Classification • 88.3M • Updated • 6.77k • • 23
CV - Object Detection
Object Detection Models
-
facebook/detr-resnet-50
Object Detection • 41.6M • Updated • 1.72M • • 916 -
facebook/detr-resnet-101-dc5
Object Detection • 60.7M • Updated • 1.29k • 19 -
facebook/detr-resnet-50-dc5
Object Detection • 41.6M • Updated • 1.47k • 6 -
google/owlvit-base-patch32
Zero-Shot Object Detection • 0.2B • Updated • 102k • 143
CV - Image Segmentation
Image Segmentation models
-
facebook/maskformer-swin-large-coco
Image Segmentation • 0.2B • Updated • 1.35k • • 27 -
nvidia/segformer-b0-finetuned-ade-512-512
Image Segmentation • 3.75M • Updated • 197k • • 178 -
facebook/detr-resnet-50-dc5-panoptic
Image Segmentation • 43M • Updated • 35 • 3 -
nvidia/segformer-b5-finetuned-cityscapes-1024-1024
Image Segmentation • Updated • 87.7k • • 36
CV - Zero-shot Image Classification Models
Zero-shot Image Classification models
-
openai/clip-vit-large-patch14
Zero-Shot Image Classification • 0.4B • Updated • 8.18M • 1.94k -
openai/clip-vit-base-patch32
Zero-Shot Image Classification • Updated • 16M • 828 -
laion/CLIP-ViT-bigG-14-laion2B-39B-b160k
Zero-Shot Image Classification • Updated • 92k • 302 -
kakaobrain/align-base
Zero-Shot Image Classification • Updated • 9.73k • 28
Video Classification
Video Classification models
-
microsoft/xclip-base-patch32
Video Classification • 0.2B • Updated • 205k • 106 -
facebook/timesformer-base-finetuned-k400
Video Classification • Updated • 51.4k • 42 -
facebook/timesformer-base-finetuned-k600
Video Classification • Updated • 6.01k • 12 -
google/vivit-b-16x2
Video Classification • Updated • 6.51k • 11
Generative AI - Image-to-Image
Collection of image to image editing, and image enhancement adapter models.
Generative AI - Text-to-Image
Generative AI - Image-to-Text
-
Salesforce/blip-image-captioning-large
Image-to-Text • 0.5B • Updated • 1.22M • 1.44k -
Salesforce/blip-image-captioning-base
Image-to-Text • Updated • 1.75M • 828 -
microsoft/trocr-base-handwritten
Image-to-Text • 0.3B • Updated • 141k • 469 -
microsoft/git-large-coco
Image-to-Text • 0.4B • Updated • 1.83k • 104