sk-g/Speech-Recognition-Tensorflow-Challenge

Different CNN Models for keyword spotting in speech recognition

28
/ 100
Experimental

This project helps audio engineers and researchers automatically identify spoken keywords within audio recordings. It takes raw audio clips, converts them into visual spectrograms, and then processes these images to pinpoint specific words, even in noisy environments. The primary user would be someone involved in audio analysis or developing voice-controlled applications.

No commits in the last 6 months.

Use this if you need to build or evaluate a system for recognizing a limited set of spoken keywords from audio files, especially if you're working with spectrogram images.

Not ideal if you're looking for a general-purpose transcription service for arbitrary speech or if you need to process live audio streams.

speech-recognition keyword-spotting audio-analysis voice-interfaces signal-processing
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 5 / 25
Maturity 16 / 25
Community 7 / 25

How are scores calculated?

Stars

10

Forks

1

Language

Python

License

GPL-3.0

Last pushed

Jul 11, 2018

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/sk-g/Speech-Recognition-Tensorflow-Challenge"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.