SkyDocs/speaker-identification
Speaker Identification using Neural Net.
This project helps you identify who is speaking in an audio recording. You provide samples of different people's voices, and it learns to recognize them. The output tells you which known speaker is present in new audio, or if an unknown speaker is detected. This is useful for anyone who needs to automatically distinguish between multiple voices in spoken content, like transcribers or content analysts.
No commits in the last 6 months.
Use this if you need to automatically determine the identity of speakers in audio recordings from a known set of individuals.
Not ideal if you need to transcribe speech or identify emotions, as this tool focuses solely on speaker identity.
Stars
20
Forks
5
Language
Python
License
GPL-3.0
Category
Last pushed
Jul 30, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/SkyDocs/speaker-identification"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
julius-speech/julius
Open-Source Large Vocabulary Continuous Speech Recognition Engine
rolczynski/Automatic-Speech-Recognition
🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)
tabahi/formantfeatures
Extract frequency, power, width and dissonance of formants from wav files
libdriver/ld3320
LD3320 full-featured driver library for general-purpose MCU and Linux.
awsaf49/audio_classification_models
Tensorflow Audio Classification Models