iamjanvijay/rnnt_decoder_cuda

An efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.

/ 100

Emerging

This project helps developers integrate a highly efficient speech-to-text decoding process into their applications. It takes raw speech audio features and a defined vocabulary, and rapidly produces the most probable text transcripts. It is designed for engineers building real-time speech recognition systems that demand fast and accurate transcription.

Use this if you are a software engineer or machine learning engineer building a production-level speech recognition system and need to quickly convert speech into text.

Not ideal if you are a non-developer seeking a ready-to-use speech-to-text application or a general-purpose natural language processing tool.

speech-recognition real-time-transcription voice-ai audio-processing

No Package No Dependents

Maintenance 6 / 25

Adoption 8 / 25

Maturity 16 / 25

Community 14 / 25

How are scores calculated?

Stars

Forks

Language

Cuda

License

MIT

Higher-rated alternatives

TensorSpeech/TensorFlowASR

:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2....

dangvansam/viet-asr

VietASR - Vietnamese Automatic Speech Recognition

wenet-e2e/wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

xinjli/allosaurus

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

srvk/eesen

The official repository of the Eesen project

Explore Voice AI Tools

All categories Trending Voice AI directory Insights