Speaker Diarization Embedding Voice AI Tools

There are 52 speaker diarization embedding tools tracked. 1 score above 70 (verified tier). The highest-rated is espnet/espnet at 83/100 with 9,768 stars. 1 of the top 10 are actively maintained.

Get all 52 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=voice-ai&subcategory=speaker-diarization-embedding&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 espnet/espnet

End-to-End Speech Processing Toolkit

83
Verified
2 yeyupiaoling/PPASR

基于PaddlePaddle实现端到端中文语音识别,从入门到实战,超简单的入门案例,超实用的企业项目。支持当前最流行的DeepSpeech2、Confor...

63
Established
3 yeyupiaoling/PaddlePaddle-DeepSpeech

基于PaddlePaddle实现的语音识别,中文语音识别。项目完善,识别效果好。支持Windows,Linux下训练和预测,支持Nvidia Jetson开发板预测。

57
Established
4 flashlight/wav2letter

Facebook AI Research's Automatic Speech Recognition Toolkit

55
Established
5 pannous/tensorflow-speech-recognition

🎙Speech recognition using the tensorflow deep learning framework,...

51
Established
6 google/uis-rnn

This is the library for the Unbounded Interleaved-State Recurrent Neural...

51
Established
7 noahchalifour/rnnt-speech-recognition

End-to-end speech recognition using RNN Transducers in Tensorflow 2.0

51
Established
8 philipperemy/deep-speaker

Deep Speaker: an End-to-End Neural Speaker Embedding System.

51
Established
9 zzw922cn/Automatic_Speech_Recognition

End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow

50
Established
10 santi-pdp/pase

Problem Agnostic Speech Encoder

49
Emerging
11 filippogiruzzi/voice_activity_detection

Voice Activity Detection based on Deep Learning & TensorFlow

48
Emerging
12 haoheliu/voicefixer_main

General Speech Restoration

48
Emerging
13 bricewalker/Hey-Jetson

Deep Learning based Automatic Speech Recognition with attention for the...

47
Emerging
14 modelscope/ClearerVoice-Studio

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained...

47
Emerging
15 gfdb/wav2aug

A general purpose task-agnostic speech augmentation policy

45
Emerging
16 Picovoice/falcon

On-device speaker diarization powered by deep learning

45
Emerging
17 Berkeley-Speech-Group/sylber

Sylber: Syllabic Embedding Representation of Speech from Raw Audio

44
Emerging
18 chenmingxiang110/Chinese-automatic-speech-recognition

Chinese speech recognition

43
Emerging
19 mravanelli/pytorch-kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid...

42
Emerging
20 wq2012/SpeakerRecognitionFromScratch

Final project for the Speaker Recognition course on Udemy, 机器之心, 深蓝学院 and 语音之家

42
Emerging
21 mostafa-kermaninia/speech-processing-toolkit

A comprehensive machine learning pipeline for robust Speaker Identification...

41
Emerging
22 yxshee/speech-command-recognition

speech command recognition using CNNs, with preprocessing, model training,...

41
Emerging
23 kgnlp/allophant

A multilingual phoneme recognizer capable of generalizing zero-shot to...

41
Emerging
24 lucko515/speech-recognition-neural-network

This is the end-to-end Speech Recognition neural network, deployed in Keras....

41
Emerging
25 shahules786/mayavoz

Pytorch based speech enhancement toolkit.

40
Emerging
26 weimeng23/speech-recognition-learning-resources

:white_check_mark: A list of speech recognition learning resources including...

40
Emerging
27 Speaker-Identification/You-Only-Speak-Once

Deep Learning - one shot learning for speaker recognition using Filter Banks

39
Emerging
28 tuanio/noisy-student-training-asr

Pytorch implementation of Noisy Student Training for Automatic Speech...

35
Emerging
29 matlab-deep-learning/deepspeech

This repo provides the pretrained DeepSpeech model in MATLAB. The model is...

35
Emerging
30 speechbrain/speechbrain.github.io

The SpeechBrain project aims to build a novel speech toolkit fully based on...

34
Emerging
31 EuleMitKeule/speaker-recognition

Speaker recognition service for Home Assistant using voice embeddings. Train...

34
Emerging
32 victor369basu/End2EndAutomaticSpeechRecognition

In this repository, I have developed an end to end Automatic speech...

33
Emerging
33 ASR-project/Multilingual-PR

Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM....

31
Emerging
34 hanifabd/voice-activity-detection-vad-realtime

Real-time Voice Activity Detection (VAD) with some example use case like...

31
Emerging
35 idiap/zff_vad

Unsupervised Voice Activity Detection by Modeling Source and System...

29
Experimental
36 soohyunme/foreigner_speech

Foreigner Korean speech voice recognition hackathon - CSLEE

29
Experimental
37 RhysonYang-2030/ASACA-Automatic-Speech-Analysis-for-Cognitive-Assessment

The automatic system that can extract PRAAT-like speech features from raw...

28
Experimental
38 AmirAbaskohi/Automatic-Speech-recognition-for-Speech-Assessment-of-Persian-Preschool-Children

Preschool evaluation is crucial because it gives teachers and parents...

27
Experimental
39 AlexKly/Simple-Voice-Activity-Detector-using-MFCC-based-on-FPGA-Kintex

Voice Activity Detector based on MFCC features and DNN model

27
Experimental
40 PranavPutsa1006/Speaker-Diarization

Identifying individual speakers in an audio stream based on the unique...

26
Experimental
41 IIP-Sogang/olkavs-avspeech

The Introduction of the OLKAVS Dataset

25
Experimental
42 zsl24/Speech-Processing-Doc

一个关于语音算法技术汇总的文档

23
Experimental
43 A5hG0/Lyrics-To-Song-Generator

Step-by-step toolkit for DiffSinger voice synthesis. Preprocessing scripts +...

22
Experimental
44 Erenyegar2/modular-auto-specch-recog-toolkit

🎤 Build and deploy advanced automatic speech recognition systems with this...

22
Experimental
45 thuantn210823/SpeakerDiarization

This repo reimplemented several popular EEND models, covering everything...

21
Experimental
46 rorizzz/TbDD

Time and Tokens: Benchmarking End-to-End Speech Dysfluency Detection

20
Experimental
47 jackaduma/speaker_recognition_models.pytorch

speaker recognition / speaker verification models in pytorch implementation

19
Experimental
48 jmaczan/asr-dysarthria

Research on Automatic Speech Recognition for dysarthric speech

19
Experimental
49 madebyaris/dsw-voice

Real-time voice noise reduction app for macOS with virtual microphone support

13
Experimental
50 saharshmehrotra/Stutter-Detection-and-Classification

System for classifying stuttering in speech and identification of various...

13
Experimental
51 zashin-AI/project

Speech-Recognition STT Project

12
Experimental
52 Nourine-Nadir/Speech_Processing

This repository explores speech processing techniques like noise...

11
Experimental

Comparisons in this category