zakuro-ai/asr

ASRDeepspeech x Sakura-ML (English/Japanese) with deepspeech2 model in pytorch with support from Zakuro AI.

/ 100

Emerging

This project helps developers build and train speech-to-text models that convert spoken English or Japanese audio into written text. It provides a modular framework for creating custom models or using a pre-trained Japanese model. The end user is a machine learning engineer or data scientist looking to implement or improve automatic speech recognition (ASR) capabilities.

No commits in the last 6 months.

Use this if you are a machine learning engineer building speech recognition applications and need a flexible, modular framework for training and deploying English or Japanese ASR models.

Not ideal if you are an end-user simply looking for a ready-to-use application to transcribe audio without needing to train or develop models.

automatic-speech-recognition natural-language-processing machine-learning-engineering audio-processing model-training

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 8 / 25

Maturity 16 / 25

Community 15 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Featured in

Things AI Won't Tell You About Building a Voice App Choosing a Voice AI Library in 2026: What's Actually Worth Building On

Higher-rated alternatives

Uberi/speech_recognition

Speech recognition module for Python, supporting several engines and APIs, online and offline.

cmusphinx/pocketsphinx

A small speech recognizer

tensorflow/lingvo

Lingvo

modelscope/FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models,...

PyThaiNLP/pythaiasr

Python Thai Automatic Speech Recognition

Explore Voice AI Tools

All categories Trending Voice AI directory Insights