Picovoice/cobra
On-device voice activity detection (VAD) powered by deep learning
This helps identify when someone is speaking in an audio stream, making it easier to process voice recordings efficiently. It takes live microphone input or audio files and outputs a probability score indicating the presence of speech. Anyone building voice-enabled applications, like call center analytics, voice assistants, or smart home devices, would use this to manage audio input.
248 stars.
Use this if you need to accurately detect voice activity in real-time or from recorded audio on various devices without relying on cloud services.
Not ideal if you need to transcribe speech into text, identify specific speakers, or understand the content of the conversation.
Stars
248
Forks
17
Language
Python
License
Apache-2.0
Category
Last pushed
Mar 02, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/Picovoice/cobra"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
FluidInference/FluidAudio
Frontier CoreML audio models in your apps — text-to-speech, speech-to-text, voice activity...
k2-fsa/sherpa-ncnn
Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn...
phuc-nt/my-translator
Real-time speech translation — macOS & Windows, free TTS, no server, your API keys only
pot-app/pot-desktop
🌈一个跨平台的划词翻译和OCR软件 | A cross-platform software for text translation and recognition.
Blaizzy/mlx-audio-swift
A modular Swift SDK for audio processing with MLX on Apple Silicon