sskorol/vosk-api-gpu

Vosk ASR Docker images with GPU for Jetson boards, PCs, M1 laptops and GPC

/ 100

Emerging

This project provides pre-configured packages for Vosk, an open-source speech-to-text engine, optimized for NVIDIA GPUs on various hardware like Jetson boards, PCs, and Google Cloud. It takes audio recordings (like WAV files) and converts them into text transcripts, making it easier for users to integrate voice recognition into their applications without complex setup. This is ideal for developers, engineers, and researchers working on applications that need efficient, local audio transcription.

No commits in the last 6 months.

Use this if you need to build or run a high-performance speech-to-text application on NVIDIA GPU-enabled hardware or Google Cloud, and want to simplify the setup of Vosk with Docker.

Not ideal if you don't have access to NVIDIA GPU hardware, prefer a cloud-based speech-to-text API, or only need basic, non-optimized transcription on a CPU.

speech-to-text voice-recognition audio-transcription edge-ai cloud-ml

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 8 / 25

Maturity 16 / 25

Community 14 / 25

How are scores calculated?

Stars

Forks

Language

Shell

License

Apache-2.0

Higher-rated alternatives

k2-fsa/sherpa-onnx

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and...

ccoreilly/vosk-browser

A speech recognition library running in the browser thanks to a WebAssembly build of Vosk

alphacep/vosk

VOSK Speech Recognition Toolkit

alphacep/vosk-server

WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries

solyarisoftware/voskJs

Vosk ASR offline engine API for NodeJs developers. With a simple HTTP ASR server.

Explore Voice AI Tools

All categories Trending Voice AI directory Insights