google/uis-rnn

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

51
/ 100
Established

This tool helps identify who spoke when in an audio recording, even if multiple people talk over each other. You provide audio features (like speaker embeddings) for an utterance, and it outputs a sequence of labels indicating which speaker is speaking at each moment. It's designed for anyone working with audio that needs to automatically separate and label different speakers.

1,589 stars. No commits in the last 6 months.

Use this if you need to automatically determine and label individual speakers in an audio recording where speakers might overlap.

Not ideal if you need an out-of-the-box solution without providing your own audio features, or if you need to identify *who* the speakers are rather than just separating them.

speaker-diarization audio-analysis voice-transcription meeting-minutes call-center-analytics
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 25 / 25

How are scores calculated?

Stars

1,589

Forks

319

Language

Python

License

Apache-2.0

Last pushed

Sep 25, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/google/uis-rnn"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.