stevenhillis/awesome-asr-contextualization
A curated list of awesome papers on contextualizing E2E ASR outputs
Getting accurate transcriptions of spoken words can be tricky, especially for unusual terms like proper nouns, jargon, or rare words. This list compiles academic papers that explore techniques to improve the accuracy of Automatic Speech Recognition (ASR) systems for these specific words by leveraging context. Speech scientists and engineers would use this resource to find research on how to make ASR outputs more reliable for critical terms.
No commits in the last 6 months.
Use this if you are developing or researching ASR systems and need to improve their accuracy when transcribing specialized vocabulary, names, or industry-specific terms.
Not ideal if you are an end-user simply looking for an off-the-shelf speech-to-text application for general use, rather than researching ASR system enhancements.
Stars
80
Forks
8
Language
—
License
Apache-2.0
Category
Last pushed
May 10, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/stevenhillis/awesome-asr-contextualization"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
TensorSpeech/TensorFlowASR
:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2....
dangvansam/viet-asr
VietASR - Vietnamese Automatic Speech Recognition
wenet-e2e/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
xinjli/allosaurus
Allosaurus is a pretrained universal phone recognizer for more than 2000 languages
srvk/eesen
The official repository of the Eesen project