wq2012/VB_diarization

VB Diarization with Eigenvoice and HMM Priors, refactored

28
/ 100
Experimental

This tool helps researchers and analysts automatically identify 'who spoke when' in audio recordings. It takes raw audio as input and outputs a timeline indicating speech segments attributed to different speakers. Anyone working with audio data, such as speech scientists, linguists, or media analysts, who needs to segment conversations by speaker would find this useful.

No commits in the last 6 months.

Use this if you need to automatically distinguish between multiple speakers in an audio recording, assigning each speech segment to the correct person.

Not ideal if you need to identify *who* the specific speakers are (e.g., 'John' vs 'Jane'), as it only differentiates between 'Speaker 1,' 'Speaker 2,' etc.

speech-analysis audio-transcription conversation-analysis linguistics sound-processing
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 6 / 25
Maturity 8 / 25
Community 14 / 25

How are scores calculated?

Stars

15

Forks

3

Language

Python

License

Last pushed

Jul 27, 2021

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/wq2012/VB_diarization"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.