rwightman/pytorch-commands

Some PyTorch code for the Kaggle Speech Recognition Challenge

/ 100

Experimental

This project helps machine learning engineers and researchers quickly train and evaluate deep learning models for speech recognition tasks, specifically for identifying spoken commands. It takes audio datasets as input and outputs trained PyTorch models capable of classifying spoken words or sounds. This is ideal for those working on voice control systems or audio event detection.

No commits in the last 6 months.

Use this if you need a pre-built, high-performing PyTorch solution to classify short audio commands or sounds from a dataset.

Not ideal if you need a general-purpose speech-to-text transcription system or real-time voice command processing.

speech-recognition audio-classification voice-commands machine-learning-engineering deep-learning-research

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 5 / 25

Maturity 8 / 25

Community 16 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

julius-speech/julius

Open-Source Large Vocabulary Continuous Speech Recognition Engine

rolczynski/Automatic-Speech-Recognition

🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)

tabahi/formantfeatures

Extract frequency, power, width and dissonance of formants from wav files

libdriver/ld3320

LD3320 full-featured driver library for general-purpose MCU and Linux.

awsaf49/audio_classification_models

Tensorflow Audio Classification Models

Explore Voice AI Tools

All categories Trending Voice AI directory Insights