wq2012/awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

/ 100

Emerging

This is a curated collection of resources for Speaker Diarization. It provides a comprehensive list of research papers, software tools, and datasets related to identifying 'who spoke when' in audio recordings. If you're a speech researcher, AI developer specializing in audio, or a data scientist working with conversational data, this resource helps you find the building blocks for your projects.

1,851 stars. No commits in the last 6 months.

Use this if you need to research, implement, or evaluate systems that automatically label speech segments in a recording with the identity of the speaker.

Not ideal if you are a casual user looking for an out-of-the-box solution to diarize audio without needing to understand the underlying technologies.

speech-recognition audio-analysis conversational-AI signal-processing machine-learning-research

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 21 / 25

How are scores calculated?

Stars

1,851

Forks

238

Language

—

License

Apache-2.0

Higher-rated alternatives

felixbur/nkululeko

Machine learning speaker characteristics

claritychallenge/clarity

Clarity Challenge toolkit - software for building Clarity Challenge systems

juanmc2005/diart

A python package to build AI-powered real-time audio applications

astorfi/3D-convolutional-speaker-recognition

:speaker: Deep Learning & 3D Convolutional Neural Networks for Speaker Verification

hitachi-speech/EEND

End-to-End Neural Diarization

Explore ML Frameworks

All categories Trending ML Framework directory Insights