koudounasalkis/AI4Voice
This repo contains the code for "Voice Disorder Analysis: A Transformer-based Approach", accepted at Interspeech 2024
This project helps medical professionals analyze voice recordings to detect and classify voice disorders. You input raw voice audio, such as spoken sentences or sustained vowel sounds, and it outputs predictions about the presence and type of vocal pathologies. This tool is designed for phoniatricians, speech-language pathologists, and researchers in voice health.
No commits in the last 6 months.
Use this if you need an automated way to screen voice recordings for signs of dysphonia or other voice disorders across different types of vocal tasks.
Not ideal if you require real-time diagnosis in a clinical setting without further human expert review.
Stars
15
Forks
4
Language
Python
License
Apache-2.0
Category
Last pushed
Jun 11, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/koudounasalkis/AI4Voice"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.