1ytic/open_stt_e2e

PyTorch end-to-end speech recognition

/ 100

Emerging

This project offers a set of tools to build custom speech recognition systems. It takes raw audio recordings and their corresponding text transcripts as input and produces highly accurate acoustic and language models. These models can then convert spoken audio into written text, benefiting researchers or developers working with speech-to-text applications, especially for the Russian language.

No commits in the last 6 months.

Use this if you need to train or fine-tune speech recognition models for the Russian language using your own audio datasets.

Not ideal if you are looking for a pre-built, ready-to-use speech-to-text application without needing to train custom models.

speech-recognition natural-language-processing audio-transcription voice-to-text machine-learning-research

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 8 / 25

Maturity 16 / 25

Community 15 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

TensorSpeech/TensorFlowASR

:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2....

dangvansam/viet-asr

VietASR - Vietnamese Automatic Speech Recognition

wenet-e2e/wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

xinjli/allosaurus

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

srvk/eesen

The official repository of the Eesen project

Explore Voice AI Tools

All categories Trending Voice AI directory Insights