zakuro-ai/asr
ASRDeepspeech x Sakura-ML (English/Japanese) with deepspeech2 model in pytorch with support from Zakuro AI.
This project helps developers build and train speech-to-text models that convert spoken English or Japanese audio into written text. It provides a modular framework for creating custom models or using a pre-trained Japanese model. The end user is a machine learning engineer or data scientist looking to implement or improve automatic speech recognition (ASR) capabilities.
No commits in the last 6 months.
Use this if you are a machine learning engineer building speech recognition applications and need a flexible, modular framework for training and deploying English or Japanese ASR models.
Not ideal if you are an end-user simply looking for a ready-to-use application to transcribe audio without needing to train or develop models.
Stars
69
Forks
11
Language
Python
License
MIT
Category
Last pushed
Nov 03, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/zakuro-ai/asr"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Featured in
Higher-rated alternatives
Uberi/speech_recognition
Speech recognition module for Python, supporting several engines and APIs, online and offline.
cmusphinx/pocketsphinx
A small speech recognizer
tensorflow/lingvo
Lingvo
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models,...
PyThaiNLP/pythaiasr
Python Thai Automatic Speech Recognition