stefantaubert/mel-cepstral-distance
A Python library for computing the Mel-Cepstral Distance (Mel-Cepstral Distortion, MCD) between two inputs. This implementation is based on the method proposed by Robert F. Kubichek in "Mel-Cepstral Distance Measure for Objective Speech Quality Assessment".
This tool helps researchers and practitioners in speech technology objectively compare the quality of two speech audio inputs, like a reference recording and a synthesized voice. It calculates the Mel-Cepstral Distance (MCD), a metric indicating how different two sounds are, and can also provide a penalty for misaligned audio segments. Speech synthesis developers, voice conversion researchers, and anyone evaluating speech generation models would use this.
No commits in the last 6 months.
Use this if you need to quantify the perceptual difference between a generated speech audio and a reference speech audio, or compare different speech synthesis models.
Not ideal if you are looking for subjective human-perception based speech quality assessment or a tool for general audio comparison outside of speech.
Stars
65
Forks
12
Language
Python
License
MIT
Category
Last pushed
Aug 24, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/stefantaubert/mel-cepstral-distance"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
julius-speech/julius
Open-Source Large Vocabulary Continuous Speech Recognition Engine
rolczynski/Automatic-Speech-Recognition
🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)
tabahi/formantfeatures
Extract frequency, power, width and dissonance of formants from wav files
libdriver/ld3320
LD3320 full-featured driver library for general-purpose MCU and Linux.
awsaf49/audio_classification_models
Tensorflow Audio Classification Models