mozilla/deepspeech-playbook

A crash course for training speech recognition models using DeepSpeech.

/ 100

Emerging

This playbook provides a complete guide for training your own custom speech recognition models using DeepSpeech. You'll learn how to prepare your audio data, configure the model, and train it to convert spoken words into text. It's for developers or technical users who want to build custom transcription, voice control, or keyword spotting applications.

No commits in the last 6 months.

Use this if you need to build a custom speech-to-text solution for a specific language, accent, or domain where off-the-shelf models don't perform well.

Not ideal if you are looking for a plug-and-play API to transcribe audio without any model training or development work.

speech-to-text voice-applications audio-transcription machine-learning-engineering natural-language-processing

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 16 / 25

Community 16 / 25

How are scores calculated?

Stars

Forks

Language

—

License

—

Higher-rated alternatives

githubharald/CTCDecoder

Connectionist Temporal Classification (CTC) decoding algorithms: best path, beam search, lexicon...

githubharald/CTCWordBeamSearch

Connectionist Temporal Classification (CTC) decoder with dictionary and language model.

nl8590687/ASRT_SpeechRecognition

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

athena-team/athena

an open-source implementation of sequence-to-sequence based speech processing engine

hirofumi0810/tensorflow_end2end_speech_recognition

End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)

Explore Voice AI Tools

All categories Trending Voice AI directory Insights