baochuquan/ios-vad
iOS Voice Activity Detection (VAD). Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.
This tool helps mobile app developers create iOS applications that can detect human speech in real-time audio. It takes live audio streams from an iPhone or iPad and identifies segments where a person is speaking, separating speech from background noise. This allows for features like voice assistants, transcription apps, or smart recording tools.
No commits in the last 6 months.
Use this if you are building an iOS app and need to accurately identify when a user is speaking into the device's microphone, especially for features like voice commands or audio recording optimization.
Not ideal if your application primarily needs to identify a wide range of non-speech sounds (like music or animal noises) with high specificity, or if you are not developing for the iOS platform.
Stars
33
Forks
3
Language
Swift
License
—
Category
Last pushed
Nov 14, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/baochuquan/ios-vad"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
FluidInference/FluidAudio
Frontier CoreML audio models in your apps — text-to-speech, speech-to-text, voice activity...
k2-fsa/sherpa-ncnn
Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn...
phuc-nt/my-translator
Real-time speech translation — macOS & Windows, free TTS, no server, your API keys only
pot-app/pot-desktop
🌈一个跨平台的划词翻译和OCR软件 | A cross-platform software for text translation and recognition.
Blaizzy/mlx-audio-swift
A modular Swift SDK for audio processing with MLX on Apple Silicon