shahad-mahmud/incremental_learning_for_asr
Incremental learning for automatic speech recognition (ASR)
This project helps improve the accuracy of automatic speech recognition (ASR) systems over time by allowing them to learn from new audio data without forgetting what they've already learned. It takes audio files and their corresponding text transcripts as input and produces an updated ASR model that is better at converting speech to text. This is designed for ASR developers, researchers, or speech technologists who manage and train speech-to-text models.
No commits in the last 6 months.
Use this if you need to continually update your speech-to-text models with new audio data to improve performance, rather than retraining from scratch each time.
Not ideal if you are looking for a ready-to-use speech-to-text API or a tool for basic audio transcription without the need for custom model training.
Stars
8
Forks
1
Language
Python
License
—
Category
Last pushed
Apr 17, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/shahad-mahmud/incremental_learning_for_asr"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
TensorSpeech/TensorFlowASR
:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2....
dangvansam/viet-asr
VietASR - Vietnamese Automatic Speech Recognition
wenet-e2e/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
xinjli/allosaurus
Allosaurus is a pretrained universal phone recognizer for more than 2000 languages
srvk/eesen
The official repository of the Eesen project