scionoftech/DeepAsr

Keras(Tensorflow) implementations of Automatic Speech Recognition

/ 100

Emerging

This tool helps machine learning engineers and researchers convert spoken audio into text. It takes audio files (WAV or FLAC) as input and outputs the transcribed text, allowing you to train or fine-tune models to accurately recognize speech for various applications. It's designed for those building and experimenting with custom speech recognition systems.

No commits in the last 6 months. Available on PyPI.

Use this if you are a machine learning engineer or researcher who needs to build, customize, or fine-tune an automatic speech recognition (ASR) engine using your own data or pre-trained models.

Not ideal if you are a casual user looking for a ready-to-use, off-the-shelf speech-to-text application without needing to understand or train machine learning models.

speech-to-text machine-learning-engineering natural-language-processing audio-transcription deep-learning-research

Stale 6m No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 25 / 25

Community 17 / 25

How are scores calculated?

Stars

Forks

Language

Jupyter Notebook

License

AGPL-3.0

Higher-rated alternatives

githubharald/CTCDecoder

Connectionist Temporal Classification (CTC) decoding algorithms: best path, beam search, lexicon...

githubharald/CTCWordBeamSearch

Connectionist Temporal Classification (CTC) decoder with dictionary and language model.

nl8590687/ASRT_SpeechRecognition

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

athena-team/athena

an open-source implementation of sequence-to-sequence based speech processing engine

hirofumi0810/tensorflow_end2end_speech_recognition

End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)

Explore Voice AI Tools

All categories Trending Voice AI directory Insights