upskyy/Transformer-Transducer

PyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASSP 2020)

/ 100

Emerging

This project offers a specific architecture for building streaming automatic speech recognition (ASR) systems. It takes raw audio features and processes them to generate text transcriptions in real-time. It's designed for machine learning engineers and researchers who are building and evaluating advanced speech-to-text models.

113 stars. No commits in the last 6 months.

Use this if you are a machine learning engineer working on developing or enhancing real-time speech recognition models and want to incorporate a Transformer-based architecture.

Not ideal if you are looking for a complete, out-of-the-box speech-to-text application or a solution that handles the entire training pipeline.

speech-to-text automatic-speech-recognition real-time-transcription machine-learning-engineering audio-processing

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 9 / 25

Maturity 16 / 25

Community 18 / 25

How are scores calculated?

Stars

113

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

TensorSpeech/TensorFlowASR

:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2....

dangvansam/viet-asr

VietASR - Vietnamese Automatic Speech Recognition

wenet-e2e/wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

xinjli/allosaurus

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

srvk/eesen

The official repository of the Eesen project

Explore Voice AI Tools

All categories Trending Voice AI directory Insights