koushikvikram/speech-recognition-deep-learning
🧑🎤🖥️📜A demonstration of using deep learning to classify audio data. Accompanied by an explanation of relevant Signal Processing concepts.
This project helps developers and engineers create systems that can understand spoken language, like the interactive voice response (IVR) systems used by banks. It takes audio recordings of speech as input and processes them to identify spoken words, effectively converting speech into text. This is designed for professionals building voice-controlled applications, customer service automation, or similar speech recognition features.
No commits in the last 6 months.
Use this if you need to build a system that accurately interprets spoken commands or converts spoken phrases into text without requiring individual speaker training.
Not ideal if you're looking for a ready-to-use speech recognition product rather than the underlying development code and concepts.
Stars
8
Forks
2
Language
Jupyter Notebook
License
—
Category
Last pushed
Nov 24, 2021
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/koushikvikram/speech-recognition-deep-learning"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
aqibsaeed/Urban-Sound-Classification
Urban sound classification using Deep Learning
spotify/realbook
Easier audio-based machine learning with TensorFlow.
ArmDeveloperEcosystem/ml-audio-classifier-example-for-pico
ML Audio Classifier Example for Pico 🔊🔥🔔
IliaZenkov/sklearn-audio-classification
An in-depth analysis of audio classification on the RAVDESS dataset. Feature engineering,...
mimbres/neural-audio-fp
Official implementation of Neural Audio Fingerprint (ICASSP 2021)