Speaker Diarization Embedding ML Frameworks

Tools and frameworks for speaker diarization, speaker embedding, and speaker recognition/verification in audio. Does NOT include general speech recognition, speech synthesis, or voice cloning systems.

There are 32 speaker diarization embedding frameworks tracked. 4 score above 50 (established tier). The highest-rated is felixbur/nkululeko at 61/100 with 43 stars.

Get all 32 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=ml-frameworks&subcategory=speaker-diarization-embedding&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Framework Score Tier
1 felixbur/nkululeko

Machine learning speaker characteristics

61
Established
2 claritychallenge/clarity

Clarity Challenge toolkit - software for building Clarity Challenge systems

58
Established
3 juanmc2005/diart

A python package to build AI-powered real-time audio applications

54
Established
4 astorfi/3D-convolutional-speaker-recognition

:speaker: Deep Learning & 3D Convolutional Neural Networks for Speaker Verification

51
Established
5 wq2012/awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets,...

49
Emerging
6 hitachi-speech/EEND

End-to-End Neural Diarization

47
Emerging
7 itmo-mbss-lab/sr_labs_book

The project is related to the development of labs for the ITMO Speaker...

42
Emerging
8 georgygospodinov/speech_course

Deep Learning for Speech

38
Emerging
9 metacore-stack/modular-auto-specch-recog-toolkit

Building a modular, open-source toolkit that advances automatic speech...

36
Emerging
10 BiometricVox/DAE_SpeakerID

Denoising autoencoders for speaker identification on MCE 2018 challenge

35
Emerging
11 rorizzz/YOLO-Stutter

YOLO-Stutter: End-to-end Region-Wise Speech Dysfluency Detection

34
Emerging
12 zycv/Speaker-Recognition-Based-on-Deep-Learning-An-Overview

This repo is to list the references papers of 《Speaker Recognition Based on...

34
Emerging
13 matlab-deep-learning/wav2vec-2.0

This repo provides the pretrained baseline 960 hours wav2vec 2.0 model in MATLAB.

34
Emerging
14 MingLunHan/CIF-ColDec

[ICASSP 2022] Improving End-to-End Contextual Speech Recognition with...

33
Emerging
15 rorizzz/Stutter-Solver

Stutter-Solver: End-to-end Cross-lingual Dysfluency Detection

33
Emerging
16 zabir-nabil/awesome-speaker-recognition-verification

A curated list of awesome speaker recognition/verification papers, projects,...

32
Emerging
17 Paradeluxe/Praditor

Praditor: A DBSCAN-Based Automation for Speech Onset Detection

30
Emerging
18 tarun-bisht/wav2vec2-asr

wav2vec2 asr with transformers

29
Experimental
19 kaistmm/seed-pytorch

[INTERSPEECH 2025] Official code for "SEED: Speaker Embedding Enhancement...

28
Experimental
20 debanjan06/noise-robust-asr

🔊 Advanced Noise-Robust ASR System with Dynamic Adaptation Cutting-edge...

26
Experimental
21 j-schmied/RealTimeSpeechRecognition

Various approaches for speech recognition and speaker diarization.

25
Experimental
22 shashikg/X-Vector-Based-Speaker-Diarization

Course project for EE698R (2020-21 Sem 2). An X-Vector Based Speaker...

22
Experimental
23 JeffT13/rd-diarization

Diarizing Legal Proceedings with d-vectors.

20
Experimental
24 yuriyvnv/WAVe

Word Aligned Verification of Synthetic Speech for Automatic Speech Recognition

20
Experimental
25 Karthick47v2/mock-buddy-audio-server

audio processing service for mock-buddy

18
Experimental
26 lottev1991/grimesai-svs-labs

HTK-style label files for GrimesAI dry stems, for training SVS AI models.

17
Experimental
27 SimoneCff/SAND-Challenge-Task-1-Parthenope

classify dysarthria severity in ALS patients.

13
Experimental
28 NefelibataJay/DeepLearningWithPytorch

Implement part of the ASR model using pytorch deep learning

11
Experimental
29 ArunR1408/Dysarthric-Speech-Recognition-MATLAB

A project to classify dysarthric and non-dysarthric speech using deep...

11
Experimental
30 mark-alfred-griffiths-tech/ML

ML Stuttering Classificiation. P.I. Prof. Pete Howell (UCL)

11
Experimental
31 LIZHICHAOUNICORN/Toolkits

Algorithms and implementations, also some awesome notes.

11
Experimental
32 HonglingLei/Unsupervised-Speech-Recognition

Unsupervised speech-to-text transformation using the wave2vec_U algorithm

10
Experimental

Comparisons in this category