MahmoudAshraf97/whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

/ 100

Established

This tool helps you automatically transcribe audio recordings and identify who said what. You provide an audio file, and it delivers a text transcript where each sentence is attributed to a specific speaker. This is useful for anyone needing to analyze conversations, meetings, or interviews, such as researchers, journalists, or content creators.

5,437 stars.

Use this if you need a precise, speaker-attributed transcript of an audio recording for tasks like meeting minutes, interview analysis, or podcast show notes.

Not ideal if your audio contains multiple speakers talking over each other frequently, as the tool currently has limitations with overlapping speech.

meeting-transcription interview-analysis audio-to-text speaker-identification podcast-transcription

No Package No Dependents

Maintenance 10 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 20 / 25

How are scores calculated?

Stars

5,437

Forks

500

Language

Jupyter Notebook

License

BSD-2-Clause

Compare

whisper-diarization and whisperX whisper-diarization and whisper-run whisper-diarization and whisper-v3-diarization whisper-diarization and gpt-speaker-diarization

Related tools

m-bain/whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

tsmdt/whisply

💬 Fast, cross-platform CLI and GUI for batch transcription, translation, speaker annotation and...

jim60105/docker-whisperX

Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker...

linto-ai/linto-stt

An automatic speech recognition API

linto-ai/linto-studio

Transcription and annotation interface for recorded audio or video files

Explore Voice AI Tools

All categories Trending Voice AI directory Insights