vectominist/MiniASR

A mini, simple, and fast end-to-end automatic speech recognition toolkit.

/ 100

Emerging

This tool helps researchers and developers quickly build and customize automatic speech recognition (ASR) systems. You provide audio files and their transcriptions, and it outputs a trained ASR model capable of converting spoken words into text. It's designed for machine learning engineers or research scientists who need to adapt ASR for specific languages, accents, or vocabularies with minimal effort.

No commits in the last 6 months.

Use this if you need to rapidly develop or fine-tune a speech-to-text model for a specialized audio dataset without extensive coding.

Not ideal if you're looking for an out-of-the-box, ready-to-use speech-to-text API without any model training or customization.

speech-to-text audio-transcription language-technology AI-model-training

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 8 / 25

Maturity 16 / 25

Community 12 / 25

How are scores calculated?

Stars

Forks

Language

Jupyter Notebook

License

MIT

Higher-rated alternatives

TensorSpeech/TensorFlowASR

:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2....

dangvansam/viet-asr

VietASR - Vietnamese Automatic Speech Recognition

wenet-e2e/wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

xinjli/allosaurus

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

srvk/eesen

The official repository of the Eesen project

Explore Voice AI Tools

All categories Trending Voice AI directory Insights