ElmiraGhorbani/gpt-speaker-diarization
Conversational Speaker Diarization using OpenAI AI Language Models(gpt-4) and OpenAI Whisper.
This tool helps organize who said what in a conversation. You provide an audio recording, and it gives you a written transcript where each sentence is clearly labeled with the speaker. This is perfect for anyone analyzing interviews, meetings, or customer service calls to understand individual contributions.
No commits in the last 6 months.
Use this if you need to quickly identify and separate speakers in recorded conversations to simplify analysis.
Not ideal if you require extremely precise, timestamped speaker turns for highly complex audio or academic research.
Stars
14
Forks
2
Language
Jupyter Notebook
License
—
Category
Last pushed
Aug 13, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/ElmiraGhorbani/gpt-speaker-diarization"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
tsmdt/whisply
💬 Fast, cross-platform CLI and GUI for batch transcription, translation, speaker annotation and...
jim60105/docker-whisperX
Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker...
MahmoudAshraf97/whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
linto-ai/linto-stt
An automatic speech recognition API