koushikvikram/speech-recognition-deep-learning

🧑‍🎤🖥️📜A demonstration of using deep learning to classify audio data. Accompanied by an explanation of relevant Signal Processing concepts.

/ 100

Experimental

This project helps developers and engineers create systems that can understand spoken language, like the interactive voice response (IVR) systems used by banks. It takes audio recordings of speech as input and processes them to identify spoken words, effectively converting speech into text. This is designed for professionals building voice-controlled applications, customer service automation, or similar speech recognition features.

No commits in the last 6 months.

Use this if you need to build a system that accurately interprets spoken commands or converts spoken phrases into text without requiring individual speaker training.

Not ideal if you're looking for a ready-to-use speech recognition product rather than the underlying development code and concepts.

Voice AI development Customer service automation Speech-to-text IVR systems Audio classification

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 4 / 25

Maturity 8 / 25

Community 13 / 25

How are scores calculated?

Stars

Forks

Language

Jupyter Notebook

License

—

Higher-rated alternatives

aqibsaeed/Urban-Sound-Classification

Urban sound classification using Deep Learning

spotify/realbook

Easier audio-based machine learning with TensorFlow.

ArmDeveloperEcosystem/ml-audio-classifier-example-for-pico

ML Audio Classifier Example for Pico 🔊🔥🔔

IliaZenkov/sklearn-audio-classification

An in-depth analysis of audio classification on the RAVDESS dataset. Feature engineering,...

mimbres/neural-audio-fp

Official implementation of Neural Audio Fingerprint (ICASSP 2021)

Explore ML Frameworks

All categories Trending ML Framework directory Insights