coqui-ai/STT-models

Open models for Coqui STT

/ 100

Emerging

This project provides pre-trained speech-to-text models that can convert spoken audio into written text. It takes audio recordings as input and produces transcribed text as output. This is useful for anyone who needs to quickly and accurately convert voice data into a searchable or editable text format, such as journalists, researchers, or content creators.

152 stars. No commits in the last 6 months.

Use this if you need to convert audio files into text transcripts without building a speech recognition system from scratch.

Not ideal if you require highly specialized transcription for unique accents or very noisy environments without any custom training.

audio-transcription voice-to-text content-creation research-analysis data-entry-automation

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 8 / 25

Community 22 / 25

How are scores calculated?

Stars

152

Forks

Language

—

License

—

Higher-rated alternatives

pnnbao97/VieNeu-TTS

Vietnamese TTS with instant voice cloning • On-device • Real-time CPU inference • 24kHz audio...

CorentinJ/Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

babysor/MockingBird

🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time

r9y9/nnmnkwii

Library to build speech synthesis systems designed for easy and fast prototyping.

Softcatala/open-dubbing

Open dubbing is an AI dubbing system which uses machine learning models to automatically...

Explore Voice AI Tools

All categories Trending Voice AI directory Insights