tugstugi/pytorch-speech-commands

Speech commands recognition with PyTorch | Kaggle 10th place solution in TensorFlow Speech Recognition Challenge

39
/ 100
Emerging

This project offers a pre-trained model for recognizing simple voice commands, like "yes," "no," or "stop." It takes short audio clips (typically 1 second) as input and outputs the specific command spoken, enabling applications controlled by voice. It's designed for developers building voice-controlled interfaces or analyzing speech commands.

201 stars. No commits in the last 6 months.

Use this if you are a developer looking for a robust, pre-trained model to implement basic speech command recognition in your applications or research.

Not ideal if you need to recognize continuous speech, complex sentences, or commands not present in the Google Speech Commands dataset.

voice-user-interface speech-recognition audio-analysis voice-control
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 8 / 25
Community 21 / 25

How are scores calculated?

Stars

201

Forks

45

Language

Python

License

Last pushed

Jan 19, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/tugstugi/pytorch-speech-commands"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.