showlab/whisperVideo
Find out who said what in the video.
This tool helps content creators, educators, or researchers automatically identify who is speaking in a video and what they are saying. You provide a video file, and it generates a new video with on-screen speaker panels and subtitles, clearly linking each spoken word to the person who said it. This is ideal for anyone needing to quickly review conversations or generate accurate, speaker-attributed transcripts from long-form videos.
138 stars.
Use this if you need to accurately transcribe multi-speaker videos and visually attribute speech to individuals on screen, especially for long recordings like interviews or lectures.
Not ideal if you only need a basic audio-to-text transcript without speaker identification or visual attribution.
Stars
138
Forks
16
Language
Jupyter Notebook
License
—
Category
Last pushed
Jan 22, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/showlab/whisperVideo"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Higher-rated alternatives
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
tsmdt/whisply
💬 Fast, cross-platform CLI and GUI for batch transcription, translation, speaker annotation and...
jim60105/docker-whisperX
Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker...
MahmoudAshraf97/whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
linto-ai/linto-stt
An automatic speech recognition API