freewym/espresso

Espresso: A Fast End-to-End Neural Speech Recognition Toolkit

/ 100

Emerging

Espresso helps researchers and developers build advanced automatic speech recognition (ASR) systems. It takes audio data and language model configurations as input and produces highly accurate text transcripts from spoken language. This toolkit is ideal for speech scientists and machine learning engineers developing new speech-to-text technologies.

940 stars. No commits in the last 6 months.

Use this if you are a researcher or developer who needs to train state-of-the-art, end-to-end neural speech recognition models on large datasets.

Not ideal if you are looking for an out-of-the-box speech-to-text application for immediate use, rather than a development toolkit.

speech-to-text voice-AI-development audio-processing natural-language-processing machine-learning-research

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 20 / 25

How are scores calculated?

Stars

940

Forks

116

Language

Python

License

—

Higher-rated alternatives

TensorSpeech/TensorFlowASR

:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2....

dangvansam/viet-asr

VietASR - Vietnamese Automatic Speech Recognition

wenet-e2e/wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

xinjli/allosaurus

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

srvk/eesen

The official repository of the Eesen project

Explore Voice AI Tools

All categories Trending Voice AI directory Insights