alphacep/vosk

VOSK Speech Recognition Toolkit

/ 100

Established

Vosk helps you continuously improve your speech recognition models by leveraging a vast database of audio fingerprints. You input audio files and their corresponding phonetic transcriptions, and the system learns from this data. The output is a more accurate speech recognition model that adapts over time. This is for researchers, engineers, or linguists building highly specialized and evolving speech recognition systems.

493 stars. Used by 6 other packages. No commits in the last 6 months. Available on PyPI.

Use this if you need to train a speech recognition system on extremely large datasets quickly and want the ability to easily correct recognition errors by adding more samples.

Not ideal if you expect your model to fit into the memory of a single server or need strong generalization capabilities for entirely new, unseen acoustic conditions without additional training data.

speech-recognition linguistics audio-processing machine-learning-engineering lifelong-learning

Stale 6m

Maintenance 0 / 25

Adoption 15 / 25

Maturity 25 / 25

Community 18 / 25

How are scores calculated?

Stars

493

Forks

Language

License

Apache-2.0

Compare

vosk and vosk-browser vosk and vosk-asterisk vosk and IBus-Speech-To-Text

Related tools

k2-fsa/sherpa-onnx

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and...

ccoreilly/vosk-browser

A speech recognition library running in the browser thanks to a WebAssembly build of Vosk

alphacep/vosk-server

WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries

solyarisoftware/voskJs

Vosk ASR offline engine API for NodeJs developers. With a simple HTTP ASR server.

alphacep/vosk-asterisk

Speech Recognition in Asterisk with Vosk Server

Explore Voice AI Tools

All categories Trending Voice AI directory Insights