Picovoice/cobra

On-device voice activity detection (VAD) powered by deep learning

/ 100

Emerging

This helps identify when someone is speaking in an audio stream, making it easier to process voice recordings efficiently. It takes live microphone input or audio files and outputs a probability score indicating the presence of speech. Anyone building voice-enabled applications, like call center analytics, voice assistants, or smart home devices, would use this to manage audio input.

248 stars.

Use this if you need to accurately detect voice activity in real-time or from recorded audio on various devices without relying on cloud services.

Not ideal if you need to transcribe speech into text, identify specific speakers, or understand the content of the conversation.

voice-enabled-apps audio-processing speech-detection on-device-AI real-time-audio

No Package No Dependents

Maintenance 10 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 12 / 25

How are scores calculated?

Stars

248

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

FluidInference/FluidAudio

Frontier CoreML audio models in your apps — text-to-speech, speech-to-text, voice activity...

k2-fsa/sherpa-ncnn

Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn...

phuc-nt/my-translator

Real-time speech translation — macOS & Windows, free TTS, no server, your API keys only

pot-app/pot-desktop

🌈一个跨平台的划词翻译和OCR软件 | A cross-platform software for text translation and recognition.

Blaizzy/mlx-audio-swift

A modular Swift SDK for audio processing with MLX on Apple Silicon

Explore Voice AI Tools

All categories Trending Voice AI directory Insights