coqui-ai/STT-models
Open models for Coqui STT
This project provides pre-trained speech-to-text models that can convert spoken audio into written text. It takes audio recordings as input and produces transcribed text as output. This is useful for anyone who needs to quickly and accurately convert voice data into a searchable or editable text format, such as journalists, researchers, or content creators.
152 stars. No commits in the last 6 months.
Use this if you need to convert audio files into text transcripts without building a speech recognition system from scratch.
Not ideal if you require highly specialized transcription for unique accents or very noisy environments without any custom training.
Stars
152
Forks
46
Language
—
License
—
Category
Last pushed
May 09, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/coqui-ai/STT-models"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
pnnbao97/VieNeu-TTS
Vietnamese TTS with instant voice cloning • On-device • Real-time CPU inference • 24kHz audio...
CorentinJ/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
babysor/MockingBird
🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time
r9y9/nnmnkwii
Library to build speech synthesis systems designed for easy and fast prototyping.
Softcatala/open-dubbing
Open dubbing is an AI dubbing system which uses machine learning models to automatically...