robmsmt/KerasDeepSpeech

A Keras CTC implementation of Baidu's DeepSpeech for model experimentation

/ 100

Emerging

This project helps machine learning engineers and researchers to quickly experiment with different model architectures for Automatic Speech Recognition (ASR). You provide audio data, and it outputs trained speech-to-text models. It's designed for individuals working on developing and refining speech recognition systems.

243 stars. No commits in the last 6 months.

Use this if you are an ML researcher or engineer looking to rapidly prototype and test various deep learning models for speech-to-text conversion.

Not ideal if you need a ready-to-use speech recognition application without delving into model architecture experimentation and training.

speech-to-text ASR-model-development deep-learning-research audio-processing machine-learning-engineering

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 23 / 25

How are scores calculated?

Stars

243

Forks

Language

Python

License

AGPL-3.0

Higher-rated alternatives

githubharald/CTCDecoder

Connectionist Temporal Classification (CTC) decoding algorithms: best path, beam search, lexicon...

githubharald/CTCWordBeamSearch

Connectionist Temporal Classification (CTC) decoder with dictionary and language model.

nl8590687/ASRT_SpeechRecognition

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

athena-team/athena

an open-source implementation of sequence-to-sequence based speech processing engine

hirofumi0810/tensorflow_end2end_speech_recognition

End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)

Explore Voice AI Tools

All categories Trending Voice AI directory Insights