Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
SDewittCLathrop3PhD 's Collections
FINANCE
SPEECH TO TEXT
AGENTS
CHARACTER AI
RESEARCH ARXIV
TTS
PERSONALIZATION
VISION
GPT-OSS
DOCUMENT WRITER
PLAYGROUND
SPREADSHEET
LORAS
EMBEDDING
LAW
SEARCH
LEADERBOARD
HEALTH
VIDEO
WRITE
HARDWARE, VRAM
MODELS
SONGS
TRAINING
IMAGE EXPLANATION
IMAGES
OCR
SPACES

SPEECH TO TEXT

updated Nov 8
Upvote
-

  • Running
    Featured
    231

    Qwen3 ASR Demo

    👀
    231

    Convert audio to text with context and language options


  • Runtime error
    Featured
    2.63k

    Whisper

    📉
    2.63k

    Transcribe audio files or YouTube videos into text


  • openai/whisper-large-v3

    Automatic Speech Recognition • 2B • Updated Aug 12, 2024 • 5.41M • • 5.21k

  • Running
    49

    Qwen3 Omni Captioner Demo

    🐠
    49

    Generate captions from audio


  • Qwen/Qwen3-Omni-30B-A3B-Captioner

    Any-to-Any • 32B • Updated Sep 22 • 24.4k • 180

  • nvidia/parakeet-tdt-0.6b-v3

    Automatic Speech Recognition • Updated 18 days ago • 73.2k • 453

  • LiquidAI/LFM2-Audio-1.5B

    Audio-to-Audio • 1B • Updated 11 days ago • 2.63k • 314

  • Running
    Featured
    1.21k

    Whisper Web

    🎤
    1.21k

    Convert spoken words into text

Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs