awsaf49/audio_classification_models
Tensorflow Audio Classification Models
This tool helps researchers and data scientists classify different types of audio by applying advanced machine learning models. You provide raw audio files, and it tells you what category they belong to. It's especially useful for tasks like identifying synthesized speech.
No commits in the last 6 months. Available on PyPI.
Use this if you need to automatically categorize audio data, such as distinguishing real voices from fake ones, or identifying specific sound events.
Not ideal if you need a simple, out-of-the-box solution without any programming or machine learning expertise.
Stars
13
Forks
4
Language
Jupyter Notebook
License
MIT
Category
Last pushed
Jul 21, 2023
Commits (30d)
0
Dependencies
2
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/awsaf49/audio_classification_models"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
julius-speech/julius
Open-Source Large Vocabulary Continuous Speech Recognition Engine
rolczynski/Automatic-Speech-Recognition
🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)
tabahi/formantfeatures
Extract frequency, power, width and dissonance of formants from wav files
libdriver/ld3320
LD3320 full-featured driver library for general-purpose MCU and Linux.
shenasa-ai/speech2text
A Deep-Learning-Based Persian Speech Recognition System