upskyy/Transformer-Transducer
PyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASSP 2020)
This project offers a specific architecture for building streaming automatic speech recognition (ASR) systems. It takes raw audio features and processes them to generate text transcriptions in real-time. It's designed for machine learning engineers and researchers who are building and evaluating advanced speech-to-text models.
113 stars. No commits in the last 6 months.
Use this if you are a machine learning engineer working on developing or enhancing real-time speech recognition models and want to incorporate a Transformer-based architecture.
Not ideal if you are looking for a complete, out-of-the-box speech-to-text application or a solution that handles the entire training pipeline.
Stars
113
Forks
20
Language
Python
License
Apache-2.0
Category
Last pushed
Feb 27, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/upskyy/Transformer-Transducer"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
TensorSpeech/TensorFlowASR
:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2....
dangvansam/viet-asr
VietASR - Vietnamese Automatic Speech Recognition
wenet-e2e/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
xinjli/allosaurus
Allosaurus is a pretrained universal phone recognizer for more than 2000 languages
srvk/eesen
The official repository of the Eesen project