kamilc/speech-recognition
Companion repository for the blog article: https://www.endpointdev.com/blog/2019/01/speech-recognition-with-tensorflow/
This helps convert spoken audio into written text, making it easier to process and analyze voice recordings. You provide an audio file, and it outputs a transcript of what was said. This tool is for anyone who needs to accurately transcribe audio, such as researchers analyzing interviews or content creators subtitling videos.
No commits in the last 6 months.
Use this if you need a pre-trained model to accurately transcribe speech from audio files into text.
Not ideal if you need a real-time transcription service or want to train a custom speech recognition model from scratch without using TensorFlow.
Stars
22
Forks
27
Language
Jupyter Notebook
License
MIT
Category
Last pushed
Feb 09, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/kamilc/speech-recognition"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
julius-speech/julius
Open-Source Large Vocabulary Continuous Speech Recognition Engine
rolczynski/Automatic-Speech-Recognition
🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)
tabahi/formantfeatures
Extract frequency, power, width and dissonance of formants from wav files
libdriver/ld3320
LD3320 full-featured driver library for general-purpose MCU and Linux.
awsaf49/audio_classification_models
Tensorflow Audio Classification Models