coqui-ai/STT

🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

/ 100

Emerging

This is a toolkit for developers who need to convert spoken audio into written text. You feed it audio files, and it produces text transcripts. It's designed for software engineers and data scientists building applications that require speech recognition capabilities.

2,572 stars. No commits in the last 6 months.

Use this if you are a developer looking to integrate speech-to-text functionality into your applications or research projects.

Not ideal if you are not a developer and simply need to transcribe audio, as this project is no longer actively maintained and has a learning curve for non-technical users.

speech-recognition audio-processing natural-language-processing machine-learning-development

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 21 / 25

How are scores calculated?

Stars

2,572

Forks

302

Language

C++

License

MPL-2.0

Higher-rated alternatives

pnnbao97/VieNeu-TTS

Vietnamese TTS with instant voice cloning • On-device • Real-time CPU inference • 24kHz audio...

CorentinJ/Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

babysor/MockingBird

🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time

r9y9/nnmnkwii

Library to build speech synthesis systems designed for easy and fast prototyping.

Softcatala/open-dubbing

Open dubbing is an AI dubbing system which uses machine learning models to automatically...

Explore Voice AI Tools

All categories Trending Voice AI directory Insights