HawkAaron/RNN-Transducer
MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks
This project helps developers build end-to-end speech recognition systems. It takes raw audio features and processes them to output transcribed text. It's designed for machine learning engineers and researchers working on automatic speech recognition tasks.
139 stars. No commits in the last 6 months.
Use this if you are a machine learning engineer building a custom speech-to-text system and need an implementation of the RNN-Transducer model.
Not ideal if you are an end-user looking for a ready-to-use speech recognition application or a pre-trained model for immediate transcription.
Stars
139
Forks
31
Language
Python
License
—
Category
Last pushed
Jun 07, 2021
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/HawkAaron/RNN-Transducer"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
TensorSpeech/TensorFlowASR
:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2....
dangvansam/viet-asr
VietASR - Vietnamese Automatic Speech Recognition
wenet-e2e/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
xinjli/allosaurus
Allosaurus is a pretrained universal phone recognizer for more than 2000 languages
srvk/eesen
The official repository of the Eesen project