Miscellaneous - a GayatriValley Collection

Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
Log In
Sign Up

GayatriValley 's Collections

Miscellaneous

updated 16 days ago

Build error

Featured

794

Unique3D

⚡

794

Create a 1M faces 3D colored model from an image!
Runtime error

53

Paligemma Doc

📚

53

Try PaliGemma on document understanding tasks
wangfuyun/PCM_Weights

Text-to-Image • Updated Oct 30, 2024 • 89 • 99
Running on Zero

464

Stable Audio Open Zero

🔥

464

Generate immersive audio from text prompts
Paused

Featured

314

PaliGemma Demo

🤲

314

Annotate and describe images with text prompts
atcsecure/dolphin-2.9.2-qwen72b-8.0bpw-h8-exl2

Text Generation • Updated Jun 9, 2024 • 2 • 2
stabilityai/stable-video-diffusion-img2vid-xt

Image-to-Video • Updated Jul 10, 2024 • 204k • 3.25k
DAMO-NLP-SG/VideoLLaMA2-7B

Visual Question Answering • 8B • Updated Aug 13, 2024 • 8.91k • 42
SakanaAI/DiscoPOP-zephyr-7b-gemma

Text Generation • 9B • Updated Jun 13, 2024 • 28 • 36
madebyollin/taesd3

Updated Jun 14, 2024 • 1.58k • 38
hpcai-tech/OpenSora-VAE-v1.2

0.4B • Updated Jun 17, 2024 • 7.16k • 57
Sleeping

Featured

84

NaRCan

💊

84

Edit your video with text prompts and style control
MaziyarPanahi/calme-2.1-qwen2-72b-GGUF

Text Generation • 73B • Updated Aug 2, 2024 • 105 • 13
Build error

Featured

93

DiffIR2VR

👌

93

Video upscaler/restorer
CAMB-AI/MARS5-TTS

Text-to-Speech • Updated Jul 5, 2024 • 69 • 482
dphn/dolphin-vision-72b

Text Generation • 73B • Updated Jul 16, 2024 • 105 • 133
Running on Zero

Featured

72

Florence-2 for Videos

🎬

72

Annotate videos with object boxes and labels using captions
Running on Zero

132

FLUX.1-dev + Captioner

🐨

132

Generate images from prompts or images
Runtime error

Featured

367

Video Transcription Smart Summary

⚡

367

Generate summaries from YouTube videos or uploaded videos
qnguyen3/nanoLLaVA-1.5

Image-Text-to-Text • 1B • Updated Sep 21, 2024 • 217 • 112
Runtime error

Featured

124

nanoLLaVA-1.5

🚀

124

Chat about images by uploading them
zai-org/codegeex4-all-9b

Text Generation • 9B • Updated Jul 18, 2024 • 4.65k • 265
Sleeping

10

Langflow Crewai

💻

10

Build and run language models visually
Running on Zero

Featured

979

Tile Upscaler

🚀

979

Enhance and upscale images with AI controlnet
Running

Featured

220

Whisper Timestamped

🕒

220

In-browser speech recognition w/ word-level timestamps
Running on Zero

Featured

2.08k

IDM VTON

👕

2.08k

High-fidelity Virtual Try-on
deepseek-ai/DeepSeek-V2-Chat-0628

Text Generation • 236B • Updated Jul 18, 2024 • 3.3k • 177
TheDrummer/Big-Tiger-Gemma-27B-v1-GGUF

27B • Updated Jul 14, 2024 • 641 • 73
fal/AuraFlow

Text-to-Image • Updated Jul 18, 2024 • 290 • • 655
xinsir/controlnet-union-sdxl-1.0

Text-to-Image • Updated Jul 30, 2024 • 108k • 1.7k
TheBloke/MythoMax-L2-13B-GPTQ

Text Generation • 13B • Updated Sep 27, 2023 • 364 • 219
Gryphe/MythoMax-L2-13b

Text Generation • Updated Apr 21, 2024 • 9.48k • • 374
Gryphe/Pantheon-RP-1.0-8b-Llama-3

Text Generation • 8B • Updated May 13, 2024 • 21 • • 51
Gryphe/Tiamat-8b-1.2-Llama-3-DPO

Text Generation • 8B • Updated May 3, 2024 • 4 • 6
BeaverLegacy/Smegmma-9B-v1

Text Generation • 10B • Updated Jul 13, 2024 • 59 • 51
mradermacher/Nymph_8B-i1-GGUF

8B • Updated Aug 2, 2024 • 52 • 2
Runtime error

29

MusiConGen

🪩

29
mlabonne/Meta-Llama-3.1-8B-Instruct-abliterated

Text Generation • 8B • Updated Sep 14, 2024 • 5.35k • • 197
FunAudioLLM/SenseVoiceSmall

Updated Jul 31, 2024 • 2.46k • 366
Running on Zero

MCP

25

Video-to-Audio Ldm

🎧

25

Video-to-Audio Generation with Hidden Alignment
CofeAI/Tele-FLM-1T

Text Generation • Updated Jan 10 • 159 • 82
maxin-cn/Cinemo

Image-to-Video • Updated Aug 14, 2024 • 14 • 32
Running on Zero

Featured

204

Cinemo

🎥

204

Multimodal Image-to-Video
Running

20

Mms Zeroshot

🌍

20

Transcribe audio in any language using text data
Running on Zero

Featured

56

AccDiffusion

🏆

56

Generate high‑quality images from text prompts
Running on Zero

Featured

185

Artist

🎨

185

Aesthetically Controllable Text-Driven Stylization w/o Train
Runtime error

95

EchoMimic

🐨

95

Generate lifelike video animations from images and audio
HuggingFaceM4/Idefics3-8B-Llama3

Image-Text-to-Text • Updated Dec 2, 2024 • 161k • 302
parler-tts/parler-tts-mini-v1

Text-to-Speech • 0.9B • Updated Nov 25, 2024 • 8.43k • 152
parler-tts/parler-tts-large-v1

Text-to-Speech • 2B • Updated Nov 22, 2024 • 10.7k • 272
Qwen/Qwen2-Audio-7B

Audio-Text-to-Text • Updated Nov 20, 2024 • 11k • 165
black-forest-labs/FLUX.1-dev

Text-to-Image • Updated Jun 27, 2025 • 745k • • 12.5k
Runtime error

215

CatVTON

🐈

215

Try on clothes virtually with images
wanglab/ecg-fm

Updated May 5, 2025 • 15
XLabs-AI/flux-lora-collection

Text-to-Image • Updated Aug 14, 2024 • 582
Runtime error

58

Vgg Heads

🖼

58
migtissera/Tess-3-Mistral-Nemo-12B

12B • Updated Sep 4, 2024 • 16 • 13
nisten/all-human-diseases

Viewer • Updated Aug 19, 2024 • 2.2k • 75 • 106
DAMO-NLP-SG/VideoLLaMA2-72B

Visual Question Answering • 75B • Updated Aug 14, 2024 • 49 • 10
answerdotai/answerai-colbert-small-v1

33.4M • Updated Feb 14 • 1.27M • 160
mlabonne/Hermes-3-Llama-3.1-8B-lorablated-GGUF

8B • Updated Aug 16, 2024 • 1.09k • 31
labotollama3/lobotollama-5.5b

Text Generation • 6B • Updated Apr 22, 2024 • 4
Mozilla/whisperfile

Updated Oct 2, 2024 • 917 • 256
Runtime error

45

FAI Fuzer Medium v0.3

🎨

45

Generate enhanced images by blending foreground with custom backgrounds
ZhengPeng7/BiRefNet

Image Segmentation • 0.2B • Updated Feb 4 • 787k • 541
Runtime error

10k

Kolors Virtual Try-On

👕

10k

Try on clothes on a person image
fal/AuraFace-v1

Updated Aug 26, 2024 • 144
dphn/dolphin-2.9.4-gemma2-2b

3B • Updated Aug 27, 2024 • 48 • 38
pzc163/MiniCPMv2_6-prompt-generator

Updated Aug 24, 2024 • 39 • 49
Running on Zero

1.03k

CogVideoX-5B

🎥

1.03k

Text-to-Video
yifeihu/TB-OCR-preview-0.1

Image-Text-to-Text • 4B • Updated Sep 6, 2024 • 15 • 129
InstantX/FLUX.1-dev-Controlnet-Union

Updated Aug 26, 2024 • 9.36k • 471
Running on Zero

Featured

87

Qwen2-VL-2B

🔥

87

Generate text from images or videos
Qwen/Qwen2-VL-2B-Instruct

Image-Text-to-Text • Updated Jan 12, 2025 • 3.24M • 496
Running

Featured

58

Groq Gradio Voice Assistant

👁

58

Turn spoken words into AI chat responses
IntelLabs/LlavaOLMoBitnet1B

Updated Aug 30, 2024 • 25 • 29
facebook/sapiens

Updated Sep 20, 2024 • 113 • 244
Running on Zero

28

Tb Ocr

📈

28

Convert image text to markdown format
YuWangX/memoryllm-8b-chat

10B • Updated Nov 17, 2024 • 95 • 20
Running

211

HivisionIDPhotos

🌖

211

Generate passport‑ready ID photos from a portrait
virtuals-protocol/mario-videogamegen

Updated Sep 6, 2024 • 13
Running on Zero

266

Qwen2-VL-7B

🔥

266

Answer questions about your images
Running on Zero

Featured

283

Latent Navigation

🪐

283

Travel through the model latent space
mattshumer/Reflection-Llama-3.1-70B

Text Generation • 71B • Updated Sep 24, 2024 • 292 • 1.71k
Configuration error

Featured

115

ViewCrafter

🐨

115

Create a video from an image with camera motion
Runtime error

18

Text Image Analyzer

💻

18

Analyse any image with Llama3.2
vidore/colqwen2-v0.1

Visual Document Retrieval • Updated Mar 21, 2025 • 24.1k • 193
Runtime error

13

Llama 3.2 Vision Free

🐢

13
facebook/Self-taught-evaluator-llama3.1-70B

Updated Sep 30, 2024 • 42
openai/clip-vit-large-patch14-336

Zero-Shot Image Classification • Updated Oct 4, 2022 • 6.11M • 288
jasperai/Flux.1-dev-Controlnet-Upscaler

Image-to-Image • Updated Mar 22, 2025 • 2.78k • 859
Running on Zero

Featured

325

Diffusers Image Fill

🏃

325

Fill and edit images using masks
Running

37

PDF to Page Images Dataset

📂

37

Convert PDFs to individual page images
Running on Zero

Featured

72

ColPali fine-tuning Query Generator

🔍

72

Generate document retrieval queries from a page image
Runtime error

10

Vision Pipeline

🌍

10

Answer questions about uploaded images and documents
nvidia/NVLM-D-72B

Image-Text-to-Text • Updated Jan 14, 2025 • 129k • 775
Running on Zero

1.01k

Whisper Turbo

🤯

1.01k

Transcribe or translate audio and YouTube videos to text
davanstrien/ufo-ColPali

Viewer • Updated Sep 23, 2024 • 2.24k • 188 • 25
jadechoghari/openmusic

Text-to-Audio • Updated Oct 10, 2024 • 26 • 73
Build error

214

OpenMusic

🎶

214

Generate music from text descriptions
Running

458

PDF2Audio

📚

458

Generate spoken‑style scripts from documents
Running on Zero

239

Ultrapixel-demo

😻

239

Ultra-high resolution image synthesis
PleIAs/OCRonos-Vintage

Text Generation • 0.1B • Updated Aug 8, 2024 • 371 • 83
Running on Zero

275

EzAudio

🟣

275

Generate or edit realistic audio from text prompts
stepfun-ai/GOT-OCR2_0

Image-Text-to-Text • Updated Feb 4, 2025 • 20.8k • 1.53k
Running on CPU Upgrade

1k

Open VLM Leaderboard

🌎

1k

VLMEvalKit Evaluation Results Collection
Build error

64

ArxivCopilot

🏢

64

Generate personalized research profiles and chat with Arxiv Copilot
gpt-omni/mini-omni

Text-to-Speech • Updated Sep 4, 2024 • 2 • 438
mistral-community/pixtral-12b-240910

Image-Text-to-Text • Updated Oct 1, 2024 • 1.32k • 381
ICTNLP/Llama-3.1-8B-Omni

Updated Nov 14, 2024 • 111 • 418
fishaudio/fish-speech-1.4

Text-to-Speech • Updated Nov 5, 2024 • 647 • 456
bartowski/Reflection-Llama-3.1-70B-GGUF

Text Generation • 71B • Updated Sep 7, 2024 • 596 • 52
lelapa/InkubaLM-0.4B

Text Generation • Updated Sep 5, 2024 • 233 • 58
Running

144

Qwen 2.5 Code Interpreter

🐍

144

Run code and get answers with AI
Runtime error

311

Virtual Try On

👕

311

High-fidelity Virtual Try-on
Runtime error

36

Ferret Demo

📚

36

Describe image contents with prompts
Running

64

ColPali 🤝 Vespa - Visual Retrieval

👀

64

Visual Retrieval with ColPali and Vespa
oxyapi/oxy-1-small

Text Generation • 15B • Updated Apr 30, 2025 • 283 • • 84
QuantFactory/MN-Chunky-Lotus-12B-GGUF

12B • Updated Dec 4, 2024 • 46 • 4
Running

25

ScholarCopilot

📊

25

Using RAG LLM to assist your academic writing
Running on Zero

613

Leffa

👗

613

Generate new person images with swapped clothes or poses
Lightricks/LTX-Video

Image-to-Video • Updated Jul 16, 2025 • 377k • • 2.13k

Collection guide
Browse collections

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs