ai-adv-lab/deepspeech.mxnet

A MXNet implementation of Baidu's DeepSpeech architecture

/ 100

Emerging

This project helps machine learning engineers build powerful Speech-to-Text (STT) models using the MXNet framework. You provide audio files paired with their corresponding text transcripts, along with a configuration file, and it outputs a trained STT model that can convert spoken language into written text. This is designed for practitioners who need to create custom speech recognition systems for various applications.

No commits in the last 6 months.

Use this if you are a machine learning engineer or data scientist looking to train a custom speech-to-text model using MXNet with advanced neural network architectures.

Not ideal if you need a ready-to-use speech-to-text solution without custom model training, or if you prefer a different deep learning framework.

Speech Recognition Natural Language Processing Machine Learning Engineering Deep Learning Model Training

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 9 / 25

Maturity 16 / 25

Community 21 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

githubharald/CTCDecoder

Connectionist Temporal Classification (CTC) decoding algorithms: best path, beam search, lexicon...

githubharald/CTCWordBeamSearch

Connectionist Temporal Classification (CTC) decoder with dictionary and language model.

nl8590687/ASRT_SpeechRecognition

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

athena-team/athena

an open-source implementation of sequence-to-sequence based speech processing engine

hirofumi0810/tensorflow_end2end_speech_recognition

End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)

Explore Voice AI Tools

All categories Trending Voice AI directory Insights