noahchalifour/rnnt-speech-recognition

End-to-end speech recognition using RNN Transducers in Tensorflow 2.0

51
/ 100
Established

This project helps developers build custom speech recognition systems that can convert spoken audio into written text. You provide audio files (like MP3s or WAVs) and corresponding text transcripts, and it produces a trained model capable of transcribing new audio. This tool is for machine learning engineers or AI researchers who need to develop or fine-tune their own speech-to-text engines.

249 stars. No commits in the last 6 months.

Use this if you are a machine learning engineer looking to implement or experiment with a custom RNN-Transducer based speech recognition model from scratch.

Not ideal if you need an out-of-the-box speech-to-text solution without extensive programming or model training.

speech-to-text voice-recognition natural-language-processing audio-processing AI-development
Stale 6m No Package No Dependents
Maintenance 2 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 23 / 25

How are scores calculated?

Stars

249

Forks

79

Language

Python

License

MIT

Last pushed

Jul 15, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/noahchalifour/rnnt-speech-recognition"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.