scionoftech/DeepAsr
Keras(Tensorflow) implementations of Automatic Speech Recognition
This tool helps machine learning engineers and researchers convert spoken audio into text. It takes audio files (WAV or FLAC) as input and outputs the transcribed text, allowing you to train or fine-tune models to accurately recognize speech for various applications. It's designed for those building and experimenting with custom speech recognition systems.
No commits in the last 6 months. Available on PyPI.
Use this if you are a machine learning engineer or researcher who needs to build, customize, or fine-tune an automatic speech recognition (ASR) engine using your own data or pre-trained models.
Not ideal if you are a casual user looking for a ready-to-use, off-the-shelf speech-to-text application without needing to understand or train machine learning models.
Stars
24
Forks
11
Language
Jupyter Notebook
License
AGPL-3.0
Category
Last pushed
Jan 13, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/scionoftech/DeepAsr"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
githubharald/CTCDecoder
Connectionist Temporal Classification (CTC) decoding algorithms: best path, beam search, lexicon...
githubharald/CTCWordBeamSearch
Connectionist Temporal Classification (CTC) decoder with dictionary and language model.
nl8590687/ASRT_SpeechRecognition
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
athena-team/athena
an open-source implementation of sequence-to-sequence based speech processing engine
hirofumi0810/tensorflow_end2end_speech_recognition
End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)