MahmoudAshraf97/whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
This tool helps you automatically transcribe audio recordings and identify who said what. You provide an audio file, and it delivers a text transcript where each sentence is attributed to a specific speaker. This is useful for anyone needing to analyze conversations, meetings, or interviews, such as researchers, journalists, or content creators.
5,437 stars.
Use this if you need a precise, speaker-attributed transcript of an audio recording for tasks like meeting minutes, interview analysis, or podcast show notes.
Not ideal if your audio contains multiple speakers talking over each other frequently, as the tool currently has limitations with overlapping speech.
Stars
5,437
Forks
500
Language
Jupyter Notebook
License
BSD-2-Clause
Category
Last pushed
Feb 23, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/MahmoudAshraf97/whisper-diarization"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Related tools
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
tsmdt/whisply
💬 Fast, cross-platform CLI and GUI for batch transcription, translation, speaker annotation and...
jim60105/docker-whisperX
Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker...
linto-ai/linto-stt
An automatic speech recognition API
linto-ai/linto-studio
Transcription and annotation interface for recorded audio or video files