alphacep/vosk-api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Vosk helps you convert spoken audio into written text, even without an internet connection. It takes audio input in over 20 languages and dialects, and outputs accurate text transcriptions. This tool is ideal for developers building applications that need to understand speech, such as chatbots, virtual assistants, or transcription services.
14,377 stars. Used by 6 other packages. Available on PyPI.
Use this if you need to integrate reliable, offline speech-to-text capabilities into your applications for various devices, from smartphones to servers.
Not ideal if you're looking for a ready-to-use end-user application for transcription rather than a developer toolkit.
Stars
14,377
Forks
1,687
Language
Jupyter Notebook
License
Apache-2.0
Category
Last pushed
Feb 22, 2026
Commits (30d)
0
Dependencies
5
Reverse dependents
6
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/alphacep/vosk-api"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
huggingface/speech-to-speech
Build local voice agents with open-source models
linto-ai/WebVoiceSDK
Buildings block for voice-enabled applications in the browser
Picovoice/speech-to-text-benchmark
speech to text benchmark framework
vox-serve/vox-serve
A Streaming-Native Serving Engine for TTS/STS Models
Lex-au/Orpheus-FastAPI
High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices, emotion tags, and...