gentaiscool/end2end-asr-pytorch
End-to-End Automatic Speech Recognition on PyTorch
This tool helps developers build custom speech-to-text systems. You feed it audio files and their corresponding text transcripts, and it trains a model that can then convert spoken language into written text. This is for software engineers or machine learning practitioners who need to integrate automatic speech recognition into their applications.
304 stars. No commits in the last 6 months.
Use this if you are a developer looking to train or fine-tune an end-to-end speech recognition model using your own specific audio datasets for a custom application.
Not ideal if you are an end-user simply looking for a ready-to-use speech-to-text application without needing to build or train a model.
Stars
304
Forks
62
Language
Python
License
MIT
Category
Last pushed
Jun 02, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/gentaiscool/end2end-asr-pytorch"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
TensorSpeech/TensorFlowASR
:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2....
dangvansam/viet-asr
VietASR - Vietnamese Automatic Speech Recognition
wenet-e2e/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
xinjli/allosaurus
Allosaurus is a pretrained universal phone recognizer for more than 2000 languages
srvk/eesen
The official repository of the Eesen project