guozhonghao1994/Voice_Activity_Detection_V1

2018 Lenovo AI Lab Summer Intern

/ 100

Experimental

This tool helps analyze audio recordings by separating speech from silence and background noise. You provide an audio file, and it outputs segmented audio clips, with each clip containing detected speech. Call center analysts, speech researchers, or anyone working with large volumes of spoken audio would find this useful for pre-processing recordings.

No commits in the last 6 months.

Use this if you need to automatically identify and extract spoken segments from audio files, especially for applications like transcription or analyzing customer calls.

Not ideal if you need highly nuanced sound event detection beyond just human speech, or if you're working with very low-quality audio that's heavily distorted or obscured by loud, non-speech noise.

call-center-analytics speech-transcription audio-analysis telephony sound-event-detection

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 4 / 25

Maturity 16 / 25

Community 9 / 25

How are scores calculated?

Stars

Forks

Language

License

Apache-2.0

Higher-rated alternatives

FluidInference/FluidAudio

Frontier CoreML audio models in your apps — text-to-speech, speech-to-text, voice activity...

k2-fsa/sherpa-ncnn

Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn...

phuc-nt/my-translator

Real-time speech translation — macOS & Windows, free TTS, no server, your API keys only

pot-app/pot-desktop

🌈一个跨平台的划词翻译和OCR软件 | A cross-platform software for text translation and recognition.

Blaizzy/mlx-audio-swift

A modular Swift SDK for audio processing with MLX on Apple Silicon

Explore Voice AI Tools

All categories Trending Voice AI directory Insights