jindongwang/EasyEspnet
Making Espnet easier to use
This project simplifies building and deploying automatic speech recognition (ASR) systems. It takes your speech audio data, processes it, and outputs trained ASR models and their performance metrics like Word Error Rate (WER) or Character Error Rate (CER). It's designed for machine learning engineers and researchers who work with speech technology and want a more straightforward way to manage their ASR experiments.
No commits in the last 6 months.
Use this if you need to train, evaluate, and deploy ASR models and find the standard ESPNet workflow too complex or difficult to integrate into Python-based pipelines.
Not ideal if you're not working with speech processing or if you prefer to use only shell scripts for your deep learning workflows.
Stars
54
Forks
4
Language
Python
License
Apache-2.0
Category
Last pushed
Apr 09, 2021
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/jindongwang/EasyEspnet"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
TensorSpeech/TensorFlowASR
:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2....
dangvansam/viet-asr
VietASR - Vietnamese Automatic Speech Recognition
wenet-e2e/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
xinjli/allosaurus
Allosaurus is a pretrained universal phone recognizer for more than 2000 languages
srvk/eesen
The official repository of the Eesen project