filippogiruzzi/voice_activity_detection

Voice Activity Detection based on Deep Learning & TensorFlow

48
/ 100
Emerging

This project helps you automatically detect when someone is speaking in an audio recording, distinguishing it from background noise. You provide raw audio files, and it tells you exactly where the speech segments are. It's ideal for anyone who needs to process large amounts of audio data and isolate spoken content, such as researchers, transcribers, or call center analysts.

371 stars. No commits in the last 6 months.

Use this if you need to precisely identify speech segments in audio recordings, especially for tasks like transcribing, indexing, or analyzing spoken content.

Not ideal if you're looking for a simple, out-of-the-box solution without any setup, or if you need to differentiate between multiple speakers.

audio-analysis speech-processing transcription call-center-analytics audio-indexing
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 22 / 25

How are scores calculated?

Stars

371

Forks

69

Language

Python

License

GPL-3.0

Last pushed

Mar 24, 2023

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/filippogiruzzi/voice_activity_detection"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.