sooftware/End-to-End-Speech-Recognition-Models

PyTorch implementation of automatic speech recognition models.

/ 100

Emerging

This is a collection of pre-built speech recognition models that help turn spoken language into text. It provides the core technology for understanding speech, which can then be integrated into various applications. This is designed for developers who are building systems that need to convert audio into written words, such as voice assistants, transcription services, or call center analytics.

No commits in the last 6 months.

Use this if you are a developer looking for readily available, well-known automatic speech recognition model architectures to integrate into your projects.

Not ideal if you need a complete, ready-to-use speech-to-text application, including training data or audio preprocessing tools, as this repository focuses only on the model structures.

speech-to-text voice-interfaces audio-transcription natural-language-processing AI-development

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 7 / 25

Maturity 16 / 25

Community 12 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

TensorSpeech/TensorFlowASR

:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2....

dangvansam/viet-asr

VietASR - Vietnamese Automatic Speech Recognition

wenet-e2e/wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

xinjli/allosaurus

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

srvk/eesen

The official repository of the Eesen project

Explore Voice AI Tools

All categories Trending Voice AI directory Insights