idiap/zff_vad

Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering

29
/ 100
Experimental

This tool helps researchers and analysts automatically identify segments of speech within audio recordings, distinguishing human voice from background noise. You provide raw audio files (like WAV or MP3), and it outputs a CSV file listing the start and end times of all detected speech segments. This is ideal for anyone working with spoken language data, such as in linguistics, speech processing, or audio analytics.

No commits in the last 6 months.

Use this if you need an efficient and accurate way to preprocess large batches of audio, focusing only on the spoken parts for further analysis.

Not ideal if your primary goal is to transcribe speech to text, as this tool only marks where speech occurs, not what is being said.

audio-analysis speech-processing sound-segmentation linguistics voice-detection
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 6 / 25
Maturity 16 / 25
Community 7 / 25

How are scores calculated?

Stars

24

Forks

2

Language

Python

License

GPL-3.0

Last pushed

Oct 19, 2023

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/idiap/zff_vad"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.