felixbur/nkululeko
Machine learning speaker characteristics
Nkululeko is a tool for speech processing researchers and practitioners to analyze audio and detect speaker characteristics like emotion, age, or gender. It takes raw audio data and configuration settings, then automatically extracts features, trains machine learning models, and evaluates results. This allows users to rapidly explore different models and understand speech patterns without needing to write extensive code.
Available on PyPI.
Use this if you need to quickly set up, run, and evaluate machine learning experiments on speech data to detect various speaker attributes.
Not ideal if you need deep, low-level control over the machine learning code or want to build a custom, highly specialized audio processing pipeline from scratch.
Stars
43
Forks
12
Language
Python
License
MIT
Category
Last pushed
Mar 12, 2026
Commits (30d)
0
Dependencies
35
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/felixbur/nkululeko"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related frameworks
claritychallenge/clarity
Clarity Challenge toolkit - software for building Clarity Challenge systems
juanmc2005/diart
A python package to build AI-powered real-time audio applications
astorfi/3D-convolutional-speaker-recognition
:speaker: Deep Learning & 3D Convolutional Neural Networks for Speaker Verification
wq2012/awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
hitachi-speech/EEND
End-to-End Neural Diarization