rorizzz/YOLO-Stutter
YOLO-Stutter: End-to-end Region-Wise Speech Dysfluency Detection
This tool helps speech-language pathologists, researchers, or educators automatically identify and pinpoint disfluent speech segments in audio recordings. You provide an audio file, and it outputs labels indicating where stuttering or other dysfluencies occur. This assists in analyzing speech patterns and assessing dysfluency.
No commits in the last 6 months.
Use this if you need to accurately detect and locate speech dysfluencies in audio recordings for research, diagnostic, or educational purposes.
Not ideal if you need to synthesize disfluent speech or require a real-time, live speech analysis solution.
Stars
20
Forks
3
Language
Jupyter Notebook
License
MIT
Category
Last pushed
Mar 04, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/rorizzz/YOLO-Stutter"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Higher-rated alternatives
felixbur/nkululeko
Machine learning speaker characteristics
claritychallenge/clarity
Clarity Challenge toolkit - software for building Clarity Challenge systems
juanmc2005/diart
A python package to build AI-powered real-time audio applications
astorfi/3D-convolutional-speaker-recognition
:speaker: Deep Learning & 3D Convolutional Neural Networks for Speaker Verification
wq2012/awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.