baochuquan/ios-vad

iOS Voice Activity Detection (VAD). Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.

/ 100

Experimental

This tool helps mobile app developers create iOS applications that can detect human speech in real-time audio. It takes live audio streams from an iPhone or iPad and identifies segments where a person is speaking, separating speech from background noise. This allows for features like voice assistants, transcription apps, or smart recording tools.

No commits in the last 6 months.

Use this if you are building an iOS app and need to accurately identify when a user is speaking into the device's microphone, especially for features like voice commands or audio recording optimization.

Not ideal if your application primarily needs to identify a wide range of non-speech sounds (like music or animal noises) with high specificity, or if you are not developing for the iOS platform.

mobile-app-development voice-user-interface audio-processing speech-recognition real-time-audio

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 7 / 25

Maturity 8 / 25

Community 9 / 25

How are scores calculated?

Stars

Forks

Language

Swift

License

—

Higher-rated alternatives

FluidInference/FluidAudio

Frontier CoreML audio models in your apps — text-to-speech, speech-to-text, voice activity...

k2-fsa/sherpa-ncnn

Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn...

phuc-nt/my-translator

Real-time speech translation — macOS & Windows, free TTS, no server, your API keys only

pot-app/pot-desktop

🌈一个跨平台的划词翻译和OCR软件 | A cross-platform software for text translation and recognition.

Blaizzy/mlx-audio-swift

A modular Swift SDK for audio processing with MLX on Apple Silicon

Explore Voice AI Tools

All categories Trending Voice AI directory Insights