jindongwang/EasyEspnet

Making Espnet easier to use

/ 100

Emerging

This project simplifies building and deploying automatic speech recognition (ASR) systems. It takes your speech audio data, processes it, and outputs trained ASR models and their performance metrics like Word Error Rate (WER) or Character Error Rate (CER). It's designed for machine learning engineers and researchers who work with speech technology and want a more straightforward way to manage their ASR experiments.

No commits in the last 6 months.

Use this if you need to train, evaluate, and deploy ASR models and find the standard ESPNet workflow too complex or difficult to integrate into Python-based pipelines.

Not ideal if you're not working with speech processing or if you prefer to use only shell scripts for your deep learning workflows.

speech-recognition ASR machine-learning-engineering natural-language-processing deep-learning-deployment

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 8 / 25

Maturity 16 / 25

Community 9 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

TensorSpeech/TensorFlowASR

:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2....

dangvansam/viet-asr

VietASR - Vietnamese Automatic Speech Recognition

wenet-e2e/wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

xinjli/allosaurus

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

srvk/eesen

The official repository of the Eesen project

Explore Voice AI Tools

All categories Trending Voice AI directory Insights