wq2012/awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

49
/ 100
Emerging

This is a curated collection of resources for Speaker Diarization. It provides a comprehensive list of research papers, software tools, and datasets related to identifying 'who spoke when' in audio recordings. If you're a speech researcher, AI developer specializing in audio, or a data scientist working with conversational data, this resource helps you find the building blocks for your projects.

1,851 stars. No commits in the last 6 months.

Use this if you need to research, implement, or evaluate systems that automatically label speech segments in a recording with the identity of the speaker.

Not ideal if you are a casual user looking for an out-of-the-box solution to diarize audio without needing to understand the underlying technologies.

speech-recognition audio-analysis conversational-AI signal-processing machine-learning-research
Stale 6m No Package No Dependents
Maintenance 2 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 21 / 25

How are scores calculated?

Stars

1,851

Forks

238

Language

License

Apache-2.0

Last pushed

Jul 22, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/wq2012/awesome-diarization"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.