GeorgeEfstathiadis/LLM-Diarize-ASR-Agnostic

Repository for "LLM-based speaker diarization correction: A generalizable approach" paper

/ 100

Experimental

This project helps machine learning engineers and researchers improve the accuracy of speaker diarization in audio transcripts. It takes raw audio transcripts, optionally from services like AWS Transcribe or Google Speech-to-Text, along with a reference transcript, and outputs corrected speaker labels. The primary users are individuals working on speech processing applications where precise speaker identification is crucial.

No commits in the last 6 months.

Use this if you need to fine-tune a Large Language Model (LLM) to correct speaker diarization errors in ASR transcripts and evaluate its performance.

Not ideal if you are a non-developer seeking an out-of-the-box solution for speaker diarization without needing to train or deploy machine learning models.

speaker-diarization speech-to-text LLM-fine-tuning machine-learning-research audio-transcription-correction

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 8 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Jupyter Notebook

License

—

Higher-rated alternatives

lifeiteng/NotebookTTS

Text-To-Speech for NotebookLM

Explore Voice AI Tools

All categories Trending Voice AI directory Insights