shitian-ni/speech-recognition-transfer-learning

Speech command recognition DenseNet transfer learning from UrbanSound8k in keras tensorflow

/ 100

Experimental

This project helps you quickly build a system to recognize specific spoken commands, like "yes," "no," or "stop." It takes existing audio recordings of everyday sounds and adapts that knowledge to understand new short voice commands. Anyone working on voice-controlled devices, accessibility tools, or interactive voice response systems would find this useful.

No commits in the last 6 months.

Use this if you need to train a speech command recognition system using a relatively small dataset of voice commands, leveraging a pre-existing, larger dataset of general urban sounds.

Not ideal if you're building a system for transcribing full sentences or need to recognize speech in a language not represented in the available transfer learning datasets.

voice-control speech-recognition audio-processing embedded-systems human-computer-interaction

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 8 / 25

Community 15 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

julius-speech/julius

Open-Source Large Vocabulary Continuous Speech Recognition Engine

rolczynski/Automatic-Speech-Recognition

🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)

tabahi/formantfeatures

Extract frequency, power, width and dissonance of formants from wav files

libdriver/ld3320

LD3320 full-featured driver library for general-purpose MCU and Linux.

awsaf49/audio_classification_models

Tensorflow Audio Classification Models

Explore Voice AI Tools

All categories Trending Voice AI directory Insights