Speaker Diarization Embedding Voice AI Tools
There are 52 speaker diarization embedding tools tracked. 1 score above 70 (verified tier). The highest-rated is espnet/espnet at 83/100 with 9,768 stars. 1 of the top 10 are actively maintained.
Get all 52 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=voice-ai&subcategory=speaker-diarization-embedding&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Tool | Score | Tier |
|---|---|---|---|
| 1 |
espnet/espnet
End-to-End Speech Processing Toolkit |
|
Verified |
| 2 |
yeyupiaoling/PPASR
基于PaddlePaddle实现端到端中文语音识别,从入门到实战,超简单的入门案例,超实用的企业项目。支持当前最流行的DeepSpeech2、Confor... |
|
Established |
| 3 |
yeyupiaoling/PaddlePaddle-DeepSpeech
基于PaddlePaddle实现的语音识别,中文语音识别。项目完善,识别效果好。支持Windows,Linux下训练和预测,支持Nvidia Jetson开发板预测。 |
|
Established |
| 4 |
flashlight/wav2letter
Facebook AI Research's Automatic Speech Recognition Toolkit |
|
Established |
| 5 |
pannous/tensorflow-speech-recognition
🎙Speech recognition using the tensorflow deep learning framework,... |
|
Established |
| 6 |
google/uis-rnn
This is the library for the Unbounded Interleaved-State Recurrent Neural... |
|
Established |
| 7 |
noahchalifour/rnnt-speech-recognition
End-to-end speech recognition using RNN Transducers in Tensorflow 2.0 |
|
Established |
| 8 |
philipperemy/deep-speaker
Deep Speaker: an End-to-End Neural Speaker Embedding System. |
|
Established |
| 9 |
zzw922cn/Automatic_Speech_Recognition
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow |
|
Established |
| 10 |
santi-pdp/pase
Problem Agnostic Speech Encoder |
|
Emerging |
| 11 |
filippogiruzzi/voice_activity_detection
Voice Activity Detection based on Deep Learning & TensorFlow |
|
Emerging |
| 12 |
haoheliu/voicefixer_main
General Speech Restoration |
|
Emerging |
| 13 |
bricewalker/Hey-Jetson
Deep Learning based Automatic Speech Recognition with attention for the... |
|
Emerging |
| 14 |
modelscope/ClearerVoice-Studio
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained... |
|
Emerging |
| 15 |
gfdb/wav2aug
A general purpose task-agnostic speech augmentation policy |
|
Emerging |
| 16 |
Picovoice/falcon
On-device speaker diarization powered by deep learning |
|
Emerging |
| 17 |
Berkeley-Speech-Group/sylber
Sylber: Syllabic Embedding Representation of Speech from Raw Audio |
|
Emerging |
| 18 |
chenmingxiang110/Chinese-automatic-speech-recognition
Chinese speech recognition |
|
Emerging |
| 19 |
mravanelli/pytorch-kaldi
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid... |
|
Emerging |
| 20 |
wq2012/SpeakerRecognitionFromScratch
Final project for the Speaker Recognition course on Udemy, 机器之心, 深蓝学院 and 语音之家 |
|
Emerging |
| 21 |
mostafa-kermaninia/speech-processing-toolkit
A comprehensive machine learning pipeline for robust Speaker Identification... |
|
Emerging |
| 22 |
yxshee/speech-command-recognition
speech command recognition using CNNs, with preprocessing, model training,... |
|
Emerging |
| 23 |
kgnlp/allophant
A multilingual phoneme recognizer capable of generalizing zero-shot to... |
|
Emerging |
| 24 |
lucko515/speech-recognition-neural-network
This is the end-to-end Speech Recognition neural network, deployed in Keras.... |
|
Emerging |
| 25 |
shahules786/mayavoz
Pytorch based speech enhancement toolkit. |
|
Emerging |
| 26 |
weimeng23/speech-recognition-learning-resources
:white_check_mark: A list of speech recognition learning resources including... |
|
Emerging |
| 27 |
Speaker-Identification/You-Only-Speak-Once
Deep Learning - one shot learning for speaker recognition using Filter Banks |
|
Emerging |
| 28 |
tuanio/noisy-student-training-asr
Pytorch implementation of Noisy Student Training for Automatic Speech... |
|
Emerging |
| 29 |
matlab-deep-learning/deepspeech
This repo provides the pretrained DeepSpeech model in MATLAB. The model is... |
|
Emerging |
| 30 |
speechbrain/speechbrain.github.io
The SpeechBrain project aims to build a novel speech toolkit fully based on... |
|
Emerging |
| 31 |
EuleMitKeule/speaker-recognition
Speaker recognition service for Home Assistant using voice embeddings. Train... |
|
Emerging |
| 32 |
victor369basu/End2EndAutomaticSpeechRecognition
In this repository, I have developed an end to end Automatic speech... |
|
Emerging |
| 33 |
ASR-project/Multilingual-PR
Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM.... |
|
Emerging |
| 34 |
hanifabd/voice-activity-detection-vad-realtime
Real-time Voice Activity Detection (VAD) with some example use case like... |
|
Emerging |
| 35 |
idiap/zff_vad
Unsupervised Voice Activity Detection by Modeling Source and System... |
|
Experimental |
| 36 |
soohyunme/foreigner_speech
Foreigner Korean speech voice recognition hackathon - CSLEE |
|
Experimental |
| 37 |
RhysonYang-2030/ASACA-Automatic-Speech-Analysis-for-Cognitive-Assessment
The automatic system that can extract PRAAT-like speech features from raw... |
|
Experimental |
| 38 |
AmirAbaskohi/Automatic-Speech-recognition-for-Speech-Assessment-of-Persian-Preschool-Children
Preschool evaluation is crucial because it gives teachers and parents... |
|
Experimental |
| 39 |
AlexKly/Simple-Voice-Activity-Detector-using-MFCC-based-on-FPGA-Kintex
Voice Activity Detector based on MFCC features and DNN model |
|
Experimental |
| 40 |
PranavPutsa1006/Speaker-Diarization
Identifying individual speakers in an audio stream based on the unique... |
|
Experimental |
| 41 |
IIP-Sogang/olkavs-avspeech
The Introduction of the OLKAVS Dataset |
|
Experimental |
| 42 |
zsl24/Speech-Processing-Doc
一个关于语音算法技术汇总的文档 |
|
Experimental |
| 43 |
A5hG0/Lyrics-To-Song-Generator
Step-by-step toolkit for DiffSinger voice synthesis. Preprocessing scripts +... |
|
Experimental |
| 44 |
Erenyegar2/modular-auto-specch-recog-toolkit
🎤 Build and deploy advanced automatic speech recognition systems with this... |
|
Experimental |
| 45 |
thuantn210823/SpeakerDiarization
This repo reimplemented several popular EEND models, covering everything... |
|
Experimental |
| 46 |
rorizzz/TbDD
Time and Tokens: Benchmarking End-to-End Speech Dysfluency Detection |
|
Experimental |
| 47 |
jackaduma/speaker_recognition_models.pytorch
speaker recognition / speaker verification models in pytorch implementation |
|
Experimental |
| 48 |
jmaczan/asr-dysarthria
Research on Automatic Speech Recognition for dysarthric speech |
|
Experimental |
| 49 |
madebyaris/dsw-voice
Real-time voice noise reduction app for macOS with virtual microphone support |
|
Experimental |
| 50 |
saharshmehrotra/Stutter-Detection-and-Classification
System for classifying stuttering in speech and identification of various... |
|
Experimental |
| 51 |
zashin-AI/project
Speech-Recognition STT Project |
|
Experimental |
| 52 |
Nourine-Nadir/Speech_Processing
This repository explores speech processing techniques like noise... |
|
Experimental |