Speaker Diarization Embedding ML Frameworks
Tools and frameworks for speaker diarization, speaker embedding, and speaker recognition/verification in audio. Does NOT include general speech recognition, speech synthesis, or voice cloning systems.
There are 32 speaker diarization embedding frameworks tracked. 4 score above 50 (established tier). The highest-rated is felixbur/nkululeko at 61/100 with 43 stars.
Get all 32 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=ml-frameworks&subcategory=speaker-diarization-embedding&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Framework | Score | Tier |
|---|---|---|---|
| 1 |
felixbur/nkululeko
Machine learning speaker characteristics |
|
Established |
| 2 |
claritychallenge/clarity
Clarity Challenge toolkit - software for building Clarity Challenge systems |
|
Established |
| 3 |
juanmc2005/diart
A python package to build AI-powered real-time audio applications |
|
Established |
| 4 |
astorfi/3D-convolutional-speaker-recognition
:speaker: Deep Learning & 3D Convolutional Neural Networks for Speaker Verification |
|
Established |
| 5 |
wq2012/awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets,... |
|
Emerging |
| 6 |
hitachi-speech/EEND
End-to-End Neural Diarization |
|
Emerging |
| 7 |
itmo-mbss-lab/sr_labs_book
The project is related to the development of labs for the ITMO Speaker... |
|
Emerging |
| 8 |
georgygospodinov/speech_course
Deep Learning for Speech |
|
Emerging |
| 9 |
metacore-stack/modular-auto-specch-recog-toolkit
Building a modular, open-source toolkit that advances automatic speech... |
|
Emerging |
| 10 |
BiometricVox/DAE_SpeakerID
Denoising autoencoders for speaker identification on MCE 2018 challenge |
|
Emerging |
| 11 |
rorizzz/YOLO-Stutter
YOLO-Stutter: End-to-end Region-Wise Speech Dysfluency Detection |
|
Emerging |
| 12 |
zycv/Speaker-Recognition-Based-on-Deep-Learning-An-Overview
This repo is to list the references papers of 《Speaker Recognition Based on... |
|
Emerging |
| 13 |
matlab-deep-learning/wav2vec-2.0
This repo provides the pretrained baseline 960 hours wav2vec 2.0 model in MATLAB. |
|
Emerging |
| 14 |
MingLunHan/CIF-ColDec
[ICASSP 2022] Improving End-to-End Contextual Speech Recognition with... |
|
Emerging |
| 15 |
rorizzz/Stutter-Solver
Stutter-Solver: End-to-end Cross-lingual Dysfluency Detection |
|
Emerging |
| 16 |
zabir-nabil/awesome-speaker-recognition-verification
A curated list of awesome speaker recognition/verification papers, projects,... |
|
Emerging |
| 17 |
Paradeluxe/Praditor
Praditor: A DBSCAN-Based Automation for Speech Onset Detection |
|
Emerging |
| 18 |
tarun-bisht/wav2vec2-asr
wav2vec2 asr with transformers |
|
Experimental |
| 19 |
kaistmm/seed-pytorch
[INTERSPEECH 2025] Official code for "SEED: Speaker Embedding Enhancement... |
|
Experimental |
| 20 |
debanjan06/noise-robust-asr
🔊 Advanced Noise-Robust ASR System with Dynamic Adaptation Cutting-edge... |
|
Experimental |
| 21 |
j-schmied/RealTimeSpeechRecognition
Various approaches for speech recognition and speaker diarization. |
|
Experimental |
| 22 |
shashikg/X-Vector-Based-Speaker-Diarization
Course project for EE698R (2020-21 Sem 2). An X-Vector Based Speaker... |
|
Experimental |
| 23 |
JeffT13/rd-diarization
Diarizing Legal Proceedings with d-vectors. |
|
Experimental |
| 24 |
yuriyvnv/WAVe
Word Aligned Verification of Synthetic Speech for Automatic Speech Recognition |
|
Experimental |
| 25 |
Karthick47v2/mock-buddy-audio-server
audio processing service for mock-buddy |
|
Experimental |
| 26 |
lottev1991/grimesai-svs-labs
HTK-style label files for GrimesAI dry stems, for training SVS AI models. |
|
Experimental |
| 27 |
SimoneCff/SAND-Challenge-Task-1-Parthenope
classify dysarthria severity in ALS patients. |
|
Experimental |
| 28 |
NefelibataJay/DeepLearningWithPytorch
Implement part of the ASR model using pytorch deep learning |
|
Experimental |
| 29 |
ArunR1408/Dysarthric-Speech-Recognition-MATLAB
A project to classify dysarthric and non-dysarthric speech using deep... |
|
Experimental |
| 30 |
mark-alfred-griffiths-tech/ML
ML Stuttering Classificiation. P.I. Prof. Pete Howell (UCL) |
|
Experimental |
| 31 |
LIZHICHAOUNICORN/Toolkits
Algorithms and implementations, also some awesome notes. |
|
Experimental |
| 32 |
HonglingLei/Unsupervised-Speech-Recognition
Unsupervised speech-to-text transformation using the wave2vec_U algorithm |
|
Experimental |