tabahi/formantfeatures
Extract frequency, power, width and dissonance of formants from wav files
This tool helps researchers and analysts in fields like speech recognition or music analysis to extract detailed acoustic properties from WAV audio files. It processes an audio file and outputs precise measurements of the frequency, power, width, and dissonance for multiple formants within each short segment of the sound. Speech scientists, phoneticians, and musicologists can use this to quantify fundamental vocal or instrumental characteristics.
No commits in the last 6 months. Available on PyPI.
Use this if you need to precisely measure the acoustic resonance (formants) within speech or music audio to understand vocal tract shapes, speaker identity, emotional content, or musical timbre.
Not ideal if you only need general audio features like loudness or pitch, or if your analysis doesn't require the specific detailed measurements of formants.
Stars
28
Forks
5
Language
Python
License
MIT
Category
Last pushed
Jun 03, 2022
Commits (30d)
0
Dependencies
5
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/tabahi/formantfeatures"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
julius-speech/julius
Open-Source Large Vocabulary Continuous Speech Recognition Engine
rolczynski/Automatic-Speech-Recognition
🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)
libdriver/ld3320
LD3320 full-featured driver library for general-purpose MCU and Linux.
awsaf49/audio_classification_models
Tensorflow Audio Classification Models
shenasa-ai/speech2text
A Deep-Learning-Based Persian Speech Recognition System