sk-g/Speech-Recognition-Tensorflow-Challenge

Different CNN Models for keyword spotting in speech recognition

/ 100

Experimental

This project helps audio engineers and researchers automatically identify spoken keywords within audio recordings. It takes raw audio clips, converts them into visual spectrograms, and then processes these images to pinpoint specific words, even in noisy environments. The primary user would be someone involved in audio analysis or developing voice-controlled applications.

No commits in the last 6 months.

Use this if you need to build or evaluate a system for recognizing a limited set of spoken keywords from audio files, especially if you're working with spectrogram images.

Not ideal if you're looking for a general-purpose transcription service for arbitrary speech or if you need to process live audio streams.

speech-recognition keyword-spotting audio-analysis voice-interfaces signal-processing

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 5 / 25

Maturity 16 / 25

Community 7 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

GPL-3.0

Higher-rated alternatives

Picovoice/porcupine

On-device wake word detection powered by deep learning

MycroftAI/mycroft-precise

A lightweight, simple-to-use, RNN wake word listener

arcosoph/nanowakeword

A lightweight, open-source, and intelligent wake word detection engine. Train custom,...

mozilla/DeepSpeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run...

OAID/cortex-m-kws

Cortex M KWS example with Tengine Lite.

Explore Voice AI Tools

All categories Trending Voice AI directory Insights