robmsmt/KerasDeepSpeech
A Keras CTC implementation of Baidu's DeepSpeech for model experimentation
This project helps machine learning engineers and researchers to quickly experiment with different model architectures for Automatic Speech Recognition (ASR). You provide audio data, and it outputs trained speech-to-text models. It's designed for individuals working on developing and refining speech recognition systems.
243 stars. No commits in the last 6 months.
Use this if you are an ML researcher or engineer looking to rapidly prototype and test various deep learning models for speech-to-text conversion.
Not ideal if you need a ready-to-use speech recognition application without delving into model architecture experimentation and training.
Stars
243
Forks
76
Language
Python
License
AGPL-3.0
Category
Last pushed
Mar 17, 2018
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/robmsmt/KerasDeepSpeech"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
githubharald/CTCDecoder
Connectionist Temporal Classification (CTC) decoding algorithms: best path, beam search, lexicon...
githubharald/CTCWordBeamSearch
Connectionist Temporal Classification (CTC) decoder with dictionary and language model.
nl8590687/ASRT_SpeechRecognition
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
athena-team/athena
an open-source implementation of sequence-to-sequence based speech processing engine
hirofumi0810/tensorflow_end2end_speech_recognition
End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)