HawkAaron/RNN-Transducer

MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks

/ 100

Emerging

This project helps developers build end-to-end speech recognition systems. It takes raw audio features and processes them to output transcribed text. It's designed for machine learning engineers and researchers working on automatic speech recognition tasks.

139 stars. No commits in the last 6 months.

Use this if you are a machine learning engineer building a custom speech-to-text system and need an implementation of the RNN-Transducer model.

Not ideal if you are an end-user looking for a ready-to-use speech recognition application or a pre-trained model for immediate transcription.

speech-recognition automatic-transcription audio-processing natural-language-processing machine-learning-engineering

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 8 / 25

Community 20 / 25

How are scores calculated?

Stars

139

Forks

Language

Python

License

—

Higher-rated alternatives

TensorSpeech/TensorFlowASR

:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2....

dangvansam/viet-asr

VietASR - Vietnamese Automatic Speech Recognition

wenet-e2e/wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

xinjli/allosaurus

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

srvk/eesen

The official repository of the Eesen project

Explore Voice AI Tools

All categories Trending Voice AI directory Insights