sskorol/vosk-api-gpu
Vosk ASR Docker images with GPU for Jetson boards, PCs, M1 laptops and GPC
This project provides pre-configured packages for Vosk, an open-source speech-to-text engine, optimized for NVIDIA GPUs on various hardware like Jetson boards, PCs, and Google Cloud. It takes audio recordings (like WAV files) and converts them into text transcripts, making it easier for users to integrate voice recognition into their applications without complex setup. This is ideal for developers, engineers, and researchers working on applications that need efficient, local audio transcription.
No commits in the last 6 months.
Use this if you need to build or run a high-performance speech-to-text application on NVIDIA GPU-enabled hardware or Google Cloud, and want to simplify the setup of Vosk with Docker.
Not ideal if you don't have access to NVIDIA GPU hardware, prefer a cloud-based speech-to-text API, or only need basic, non-optimized transcription on a CPU.
Stars
45
Forks
7
Language
Shell
License
Apache-2.0
Category
Last pushed
May 16, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/sskorol/vosk-api-gpu"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
k2-fsa/sherpa-onnx
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and...
ccoreilly/vosk-browser
A speech recognition library running in the browser thanks to a WebAssembly build of Vosk
alphacep/vosk
VOSK Speech Recognition Toolkit
alphacep/vosk-server
WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
solyarisoftware/voskJs
Vosk ASR offline engine API for NodeJs developers. With a simple HTTP ASR server.