BiometricVox/DAE_SpeakerID
Denoising autoencoders for speaker identification on MCE 2018 challenge
This project helps identify specific speakers from speech segments, particularly in call center conversations, even when there's variability in how people speak over time. You provide existing speaker profiles (ivectors) and new speech segments, and it tells you if a blacklisted speaker is present and, if so, which one. This is for professionals in fields like security, forensics, or customer service who need to automate speaker identification.
No commits in the last 6 months.
Use this if you need to reliably identify known speakers from short speech segments in a noisy environment, such as in call center recordings, to screen against a blacklist.
Not ideal if you need to identify speakers from raw audio files, as this project requires pre-processed ivector data.
Stars
12
Forks
3
Language
Python
License
Apache-2.0
Category
Last pushed
Nov 08, 2018
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/BiometricVox/DAE_SpeakerID"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
felixbur/nkululeko
Machine learning speaker characteristics
claritychallenge/clarity
Clarity Challenge toolkit - software for building Clarity Challenge systems
juanmc2005/diart
A python package to build AI-powered real-time audio applications
astorfi/3D-convolutional-speaker-recognition
:speaker: Deep Learning & 3D Convolutional Neural Networks for Speaker Verification
wq2012/awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.