Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
biki96 's Collections
image-text-to-video
I2I
Face Swap
Embedding
A2A
TTS
Text2Image
LLM
IT3D
OCR
I2V
STT
diffusion

STT

updated Feb 5
Upvote
-

  • Running on CPU Upgrade
    Agents
    Featured
    1.35k

    Open ASR Leaderboard

    🏆
    1.35k

    Explore and compare speech recognition model benchmarks


  • nvidia/canary-qwen-2.5b

    Automatic Speech Recognition • 3B • Updated 29 days ago • 73.5k • 427

  • nvidia/parakeet-tdt-0.6b-v3

    Automatic Speech Recognition • 0.6B • Updated about 4 hours ago • 317k • 855

  • nvidia/parakeet-tdt-0.6b-v2

    Automatic Speech Recognition • Updated Apr 13 • 200k • 1.47k

  • stabilityai/stable-video-diffusion-img2vid

    Image-to-Video • Updated Jul 10, 2024 • 56.3k • 1.03k

  • LiquidAI/LFM2-Audio-1.5B

    Audio-to-Audio • 1B • Updated Mar 27 • 284 • 347

  • mistralai/Voxtral-Mini-4B-Realtime-2602

    Automatic Speech Recognition • 4B • Updated Mar 11 • 1.43M • 855
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs