Speaker Diarization Embedding Voice AI Tools

There are 52 speaker diarization embedding tools tracked. 1 score above 70 (verified tier). The highest-rated is espnet/espnet at 83/100 with 9,768 stars. 1 of the top 10 are actively maintained.

Get all 52 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=voice-ai&subcategory=speaker-diarization-embedding&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

#	Tool	Score	Tier	Stars	Language
1	espnet/espnet End-to-End Speech Processing Toolkit	83	Verified	9,768	Python
2	yeyupiaoling/PPASR 基于PaddlePaddle实现端到端中文语音识别，从入门到实战，超简单的入门案例，超实用的企业项目。支持当前最流行的DeepSpeech2、Confor...	63	Established	875	Python
3	yeyupiaoling/PaddlePaddle-DeepSpeech 基于PaddlePaddle实现的语音识别，中文语音识别。项目完善，识别效果好。支持Windows，Linux下训练和预测，支持Nvidia Jetson开发板预测。	57	Established	758	Python
4	flashlight/wav2letter Facebook AI Research's Automatic Speech Recognition Toolkit	55	Established	6,446	C++
5	pannous/tensorflow-speech-recognition 🎙Speech recognition using the tensorflow deep learning framework,...	51	Established	2,176	Python
6	google/uis-rnn This is the library for the Unbounded Interleaved-State Recurrent Neural...	51	Established	1,589	Python
7	noahchalifour/rnnt-speech-recognition End-to-end speech recognition using RNN Transducers in Tensorflow 2.0	51	Established	249	Python
8	philipperemy/deep-speaker Deep Speaker: an End-to-End Neural Speaker Embedding System.	51	Established	939	Python
9	zzw922cn/Automatic_Speech_Recognition End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow	50	Established	2,839	Python
10	santi-pdp/pase Problem Agnostic Speech Encoder	49	Emerging	447	Python
11	filippogiruzzi/voice_activity_detection Voice Activity Detection based on Deep Learning & TensorFlow	48	Emerging	371	Python
12	haoheliu/voicefixer_main General Speech Restoration	48	Emerging	284	Python
13	bricewalker/Hey-Jetson Deep Learning based Automatic Speech Recognition with attention for the...	47	Emerging	199	Jupyter Notebook
14	modelscope/ClearerVoice-Studio An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained...	47	Emerging	3,962	Python
15	gfdb/wav2aug A general purpose task-agnostic speech augmentation policy	45	Emerging	16	Python
16	Picovoice/falcon On-device speaker diarization powered by deep learning	45	Emerging	69	Python
17	Berkeley-Speech-Group/sylber Sylber: Syllabic Embedding Representation of Speech from Raw Audio	44	Emerging	74	Jupyter Notebook
18	chenmingxiang110/Chinese-automatic-speech-recognition Chinese speech recognition	43	Emerging	159	Jupyter Notebook
19	mravanelli/pytorch-kaldi pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid...	42	Emerging	2,396	Python
20	wq2012/SpeakerRecognitionFromScratch Final project for the Speaker Recognition course on Udemy, 机器之心, 深蓝学院 and 语音之家	42	Emerging	47	Python
21	mostafa-kermaninia/speech-processing-toolkit A comprehensive machine learning pipeline for robust Speaker Identification...	41	Emerging	4	Jupyter Notebook
22	yxshee/speech-command-recognition speech command recognition using CNNs, with preprocessing, model training,...	41	Emerging	4	Jupyter Notebook
23	kgnlp/allophant A multilingual phoneme recognizer capable of generalizing zero-shot to...	41	Emerging	29	Python
24	lucko515/speech-recognition-neural-network This is the end-to-end Speech Recognition neural network, deployed in Keras....	41	Emerging	190	HTML
25	shahules786/mayavoz Pytorch based speech enhancement toolkit.	40	Emerging	336	Python
26	weimeng23/speech-recognition-learning-resources :white_check_mark: A list of speech recognition learning resources including...	40	Emerging	68	—
27	Speaker-Identification/You-Only-Speak-Once Deep Learning - one shot learning for speaker recognition using Filter Banks	39	Emerging	171	Jupyter Notebook
28	tuanio/noisy-student-training-asr Pytorch implementation of Noisy Student Training for Automatic Speech...	35	Emerging	99	Python
29	matlab-deep-learning/deepspeech This repo provides the pretrained DeepSpeech model in MATLAB. The model is...	35	Emerging	7	MATLAB
30	speechbrain/speechbrain.github.io The SpeechBrain project aims to build a novel speech toolkit fully based on...	34	Emerging	374	HTML
31	EuleMitKeule/speaker-recognition Speaker recognition service for Home Assistant using voice embeddings. Train...	34	Emerging	17	Python
32	victor369basu/End2EndAutomaticSpeechRecognition In this repository, I have developed an end to end Automatic speech...	33	Emerging	34	Python
33	ASR-project/Multilingual-PR Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM....	31	Emerging	258	Python
34	hanifabd/voice-activity-detection-vad-realtime Real-time Voice Activity Detection (VAD) with some example use case like...	31	Emerging	106	Python
35	idiap/zff_vad Unsupervised Voice Activity Detection by Modeling Source and System...	29	Experimental	24	Python
36	soohyunme/foreigner_speech Foreigner Korean speech voice recognition hackathon - CSLEE	29	Experimental	1	Python
37	RhysonYang-2030/ASACA-Automatic-Speech-Analysis-for-Cognitive-Assessment The automatic system that can extract PRAAT-like speech features from raw...	28	Experimental	4	Python
38	AmirAbaskohi/Automatic-Speech-recognition-for-Speech-Assessment-of-Persian-Preschool-Children Preschool evaluation is crucial because it gives teachers and parents...	27	Experimental	20	Jupyter Notebook
39	AlexKly/Simple-Voice-Activity-Detector-using-MFCC-based-on-FPGA-Kintex Voice Activity Detector based on MFCC features and DNN model	27	Experimental	29	VHDL
40	PranavPutsa1006/Speaker-Diarization Identifying individual speakers in an audio stream based on the unique...	26	Experimental	18	Jupyter Notebook
41	IIP-Sogang/olkavs-avspeech The Introduction of the OLKAVS Dataset	25	Experimental	37	Python
42	zsl24/Speech-Processing-Doc 一个关于语音算法技术汇总的文档	23	Experimental	4	—
43	A5hG0/Lyrics-To-Song-Generator Step-by-step toolkit for DiffSinger voice synthesis. Preprocessing scripts +...	22	Experimental	—	Python
44	Erenyegar2/modular-auto-specch-recog-toolkit 🎤 Build and deploy advanced automatic speech recognition systems with this...	22	Experimental	—	Python
45	thuantn210823/SpeakerDiarization This repo reimplemented several popular EEND models, covering everything...	21	Experimental	7	Python
46	rorizzz/TbDD Time and Tokens: Benchmarking End-to-End Speech Dysfluency Detection	20	Experimental	5	Jupyter Notebook
47	jackaduma/speaker_recognition_models.pytorch speaker recognition / speaker verification models in pytorch implementation	19	Experimental	4	—
48	jmaczan/asr-dysarthria Research on Automatic Speech Recognition for dysarthric speech	19	Experimental	19	Jupyter Notebook
49	madebyaris/dsw-voice Real-time voice noise reduction app for macOS with virtual microphone support	13	Experimental	2	Swift
50	saharshmehrotra/Stutter-Detection-and-Classification System for classifying stuttering in speech and identification of various...	13	Experimental	9	Jupyter Notebook
51	zashin-AI/project Speech-Recognition STT Project	12	Experimental	7	Jupyter Notebook
52	Nourine-Nadir/Speech_Processing This repository explores speech processing techniques like noise...	11	Experimental	3	Jupyter Notebook

Comparisons in this category

PPASR and PaddlePaddle-DeepSpeech (63 vs 57)