mozilla-ai/speech-to-text-finetune

Blueprint by Mozilla.ai for finetuning a Speech-To-Text model in your own language

44
/ 100
Emerging

This tool helps you accurately transcribe spoken audio into text, especially for languages or accents that general speech-to-text tools might struggle with. You provide your own audio recordings and their correct transcriptions to create a specialized speech recognition model. It's designed for language experts, researchers, or content creators who need highly accurate transcriptions for specific audio.

Use this if you need to create a high-accuracy speech-to-text model tailored to a unique language, dialect, or specialized vocabulary, and you have access to example audio and text pairs.

Not ideal if you just need to transcribe common languages with standard accuracy and don't want to invest time in creating a custom dataset or training a model.

language-transcription audio-analysis content-localization voice-recognition linguistic-research
No Package No Dependents
Maintenance 6 / 25
Adoption 8 / 25
Maturity 16 / 25
Community 14 / 25

How are scores calculated?

Stars

63

Forks

9

Language

Python

License

Apache-2.0

Last pushed

Oct 23, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/mozilla-ai/speech-to-text-finetune"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.