coqui-ai/STT
🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
This is a toolkit for developers who need to convert spoken audio into written text. You feed it audio files, and it produces text transcripts. It's designed for software engineers and data scientists building applications that require speech recognition capabilities.
2,572 stars. No commits in the last 6 months.
Use this if you are a developer looking to integrate speech-to-text functionality into your applications or research projects.
Not ideal if you are not a developer and simply need to transcribe audio, as this project is no longer actively maintained and has a learning curve for non-technical users.
Stars
2,572
Forks
302
Language
C++
License
MPL-2.0
Category
Last pushed
Mar 11, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/coqui-ai/STT"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
pnnbao97/VieNeu-TTS
Vietnamese TTS with instant voice cloning • On-device • Real-time CPU inference • 24kHz audio...
CorentinJ/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
babysor/MockingBird
🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time
r9y9/nnmnkwii
Library to build speech synthesis systems designed for easy and fast prototyping.
Softcatala/open-dubbing
Open dubbing is an AI dubbing system which uses machine learning models to automatically...