rshahamiri/SpeechVision
Speech Vision (SV) is a Dysarthric Speech Recognition System that adopts a novel approach towards dysarthric ASR in which speech features are extracted visually, then SV learns to see the shape of the words pronounced by dysarthric individuals.
This system helps individuals with dysarthria to have their speech recognized more accurately by computers. It takes visual information about how words are formed, rather than just audio, and translates it into recognized text. Speech therapists, assistive technology specialists, or researchers working with dysarthric individuals would find this useful.
No commits in the last 6 months.
Use this if you need a speech recognition system specifically designed to interpret the unique patterns of dysarthric speech.
Not ideal if you are looking for a general-purpose speech recognition system for typical speech.
Stars
9
Forks
6
Language
Jupyter Notebook
License
MIT
Category
Last pushed
Apr 08, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/rshahamiri/SpeechVision"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
TimoBolkart/voca
This codebase demonstrates how to synthesize realistic 3D character animations given an...
Kalebu/image-to-sound-python-
A python project for converting an Image into audible sound using OCR and speech synthesis
itsanthonio/Vision-To-Speech
A vision to speech project
moritztng/cyberstudio
Face Animation from Text :mage_man: