wq2012/awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
This is a curated collection of resources for Speaker Diarization. It provides a comprehensive list of research papers, software tools, and datasets related to identifying 'who spoke when' in audio recordings. If you're a speech researcher, AI developer specializing in audio, or a data scientist working with conversational data, this resource helps you find the building blocks for your projects.
1,851 stars. No commits in the last 6 months.
Use this if you need to research, implement, or evaluate systems that automatically label speech segments in a recording with the identity of the speaker.
Not ideal if you are a casual user looking for an out-of-the-box solution to diarize audio without needing to understand the underlying technologies.
Stars
1,851
Forks
238
Language
—
License
Apache-2.0
Category
Last pushed
Jul 22, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/wq2012/awesome-diarization"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
felixbur/nkululeko
Machine learning speaker characteristics
claritychallenge/clarity
Clarity Challenge toolkit - software for building Clarity Challenge systems
juanmc2005/diart
A python package to build AI-powered real-time audio applications
astorfi/3D-convolutional-speaker-recognition
:speaker: Deep Learning & 3D Convolutional Neural Networks for Speaker Verification
hitachi-speech/EEND
End-to-End Neural Diarization