msalhab96/RNN-Transducer

PyTorch implementation of Sequence Transduction with Recurrent Neural Networks (RNN-T) speech recognition paper

/ 100

Experimental

This project helps convert spoken words into written text. You provide audio files and their corresponding transcripts, and it learns to accurately transcribe the speech. This is useful for researchers and developers building speech recognition systems.

No commits in the last 6 months.

Use this if you are a researcher or developer who needs to train a custom speech-to-text model on your own specific audio data.

Not ideal if you need an out-of-the-box speech recognition tool for immediate use, as it currently lacks an inference module and demo.

speech-recognition audio-transcription natural-language-processing machine-learning-research

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 8 / 25

Community 10 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

TensorSpeech/TensorFlowASR

:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2....

dangvansam/viet-asr

VietASR - Vietnamese Automatic Speech Recognition

wenet-e2e/wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

xinjli/allosaurus

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

srvk/eesen

The official repository of the Eesen project

Explore Voice AI Tools

All categories Trending Voice AI directory Insights