zhihanyang2022/gender-audio-classification
A speaker gender classifier. MFC feature engineering and a pre-trained ResNet-50. GradCAM interpretation.
This tool helps you automatically identify the likely gender of a speaker from an audio recording. You provide a short voice sample, and it tells you whether the voice sounds male or female. This is useful for researchers in linguistics, speech science, or anyone analyzing demographic patterns in spoken audio.
No commits in the last 6 months.
Use this if you need a quick, accurate way to classify speaker gender from individual audio clips, even if the model wasn't specifically trained on the exact words spoken.
Not ideal if you need to classify gender for multiple speakers within a single, continuous recording, or if you require fine-grained analysis beyond simple male/female categorization.
Stars
27
Forks
5
Language
Jupyter Notebook
License
GPL-3.0
Category
Last pushed
Nov 18, 2021
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/zhihanyang2022/gender-audio-classification"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
julius-speech/julius
Open-Source Large Vocabulary Continuous Speech Recognition Engine
rolczynski/Automatic-Speech-Recognition
🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)
tabahi/formantfeatures
Extract frequency, power, width and dissonance of formants from wav files
libdriver/ld3320
LD3320 full-featured driver library for general-purpose MCU and Linux.
awsaf49/audio_classification_models
Tensorflow Audio Classification Models