alphacep/vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

/ 100

Verified

Vosk helps you convert spoken audio into written text, even without an internet connection. It takes audio input in over 20 languages and dialects, and outputs accurate text transcriptions. This tool is ideal for developers building applications that need to understand speech, such as chatbots, virtual assistants, or transcription services.

14,377 stars. Used by 6 other packages. Available on PyPI.

Use this if you need to integrate reliable, offline speech-to-text capabilities into your applications for various devices, from smartphones to servers.

Not ideal if you're looking for a ready-to-use end-user application for transcription rather than a developer toolkit.

speech-to-text voice-user-interface offline-transcription audio-processing natural-language-processing

Maintenance 10 / 25

Adoption 15 / 25

Maturity 25 / 25

Community 21 / 25

How are scores calculated?

Stars

14,377

Forks

1,687

Language

Jupyter Notebook

License

Apache-2.0

Featured in

Things AI Won't Tell You About Building a Voice App

Related tools

huggingface/speech-to-speech

Build local voice agents with open-source models

linto-ai/WebVoiceSDK

Buildings block for voice-enabled applications in the browser

Picovoice/speech-to-text-benchmark

speech to text benchmark framework

vox-serve/vox-serve

A Streaming-Native Serving Engine for TTS/STS Models

Lex-au/Orpheus-FastAPI

High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices, emotion tags, and...

Explore Voice AI Tools

All categories Trending Voice AI directory Insights