rwightman/pytorch-commands
Some PyTorch code for the Kaggle Speech Recognition Challenge
This project helps machine learning engineers and researchers quickly train and evaluate deep learning models for speech recognition tasks, specifically for identifying spoken commands. It takes audio datasets as input and outputs trained PyTorch models capable of classifying spoken words or sounds. This is ideal for those working on voice control systems or audio event detection.
No commits in the last 6 months.
Use this if you need a pre-built, high-performing PyTorch solution to classify short audio commands or sounds from a dataset.
Not ideal if you need a general-purpose speech-to-text transcription system or real-time voice command processing.
Stars
12
Forks
7
Language
Python
License
—
Category
Last pushed
Feb 07, 2019
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/rwightman/pytorch-commands"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
julius-speech/julius
Open-Source Large Vocabulary Continuous Speech Recognition Engine
rolczynski/Automatic-Speech-Recognition
🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)
tabahi/formantfeatures
Extract frequency, power, width and dissonance of formants from wav files
libdriver/ld3320
LD3320 full-featured driver library for general-purpose MCU and Linux.
awsaf49/audio_classification_models
Tensorflow Audio Classification Models