sooftware/End-to-End-Speech-Recognition-Models
PyTorch implementation of automatic speech recognition models.
This is a collection of pre-built speech recognition models that help turn spoken language into text. It provides the core technology for understanding speech, which can then be integrated into various applications. This is designed for developers who are building systems that need to convert audio into written words, such as voice assistants, transcription services, or call center analytics.
No commits in the last 6 months.
Use this if you are a developer looking for readily available, well-known automatic speech recognition model architectures to integrate into your projects.
Not ideal if you need a complete, ready-to-use speech-to-text application, including training data or audio preprocessing tools, as this repository focuses only on the model structures.
Stars
38
Forks
5
Language
Python
License
Apache-2.0
Category
Last pushed
Jan 10, 2021
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/sooftware/End-to-End-Speech-Recognition-Models"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
TensorSpeech/TensorFlowASR
:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2....
dangvansam/viet-asr
VietASR - Vietnamese Automatic Speech Recognition
wenet-e2e/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
xinjli/allosaurus
Allosaurus is a pretrained universal phone recognizer for more than 2000 languages
srvk/eesen
The official repository of the Eesen project