mozilla-ai/speech-to-text-finetune

Blueprint by Mozilla.ai for finetuning a Speech-To-Text model in your own language

/ 100

Emerging

This tool helps you accurately transcribe spoken audio into text, especially for languages or accents that general speech-to-text tools might struggle with. You provide your own audio recordings and their correct transcriptions to create a specialized speech recognition model. It's designed for language experts, researchers, or content creators who need highly accurate transcriptions for specific audio.

Use this if you need to create a high-accuracy speech-to-text model tailored to a unique language, dialect, or specialized vocabulary, and you have access to example audio and text pairs.

Not ideal if you just need to transcribe common languages with standard accuracy and don't want to invest time in creating a custom dataset or training a model.

language-transcription audio-analysis content-localization voice-recognition linguistic-research

No Package No Dependents

Maintenance 6 / 25

Adoption 8 / 25

Maturity 16 / 25

Community 14 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

TuananhCR/Dia-Finetuning-Vietnamese

TTS Dia finetuning for Vietnamese

dangvansam/viet-tts

VietTTS: An Open-Source Vietnamese Text to Speech

thinhlpg/vixtts-demo

A Vietnamese Voice Cloning Text-to-Speech Model ✨

NTT123/vietTTS

Vietnamese Text to Speech library

ekwek1/soprano-factory

Soprano-Factory: Train your own 2000x realtime text-to-speech model

Explore Voice AI Tools

All categories Trending Voice AI directory Insights