showlab/whisperVideo

Find out who said what in the video.

/ 100

Emerging

This tool helps content creators, educators, or researchers automatically identify who is speaking in a video and what they are saying. You provide a video file, and it generates a new video with on-screen speaker panels and subtitles, clearly linking each spoken word to the person who said it. This is ideal for anyone needing to quickly review conversations or generate accurate, speaker-attributed transcripts from long-form videos.

138 stars.

Use this if you need to accurately transcribe multi-speaker videos and visually attribute speech to individuals on screen, especially for long recordings like interviews or lectures.

Not ideal if you only need a basic audio-to-text transcript without speaker identification or visual attribution.

video-transcription content-creation education meeting-analysis research-interviews

No License No Package No Dependents

Maintenance 10 / 25

Adoption 10 / 25

Maturity 8 / 25

Community 14 / 25

How are scores calculated?

Stars

138

Forks

Language

Jupyter Notebook

License

—

Compare

whisperVideo and whisperX

Higher-rated alternatives

m-bain/whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

tsmdt/whisply

💬 Fast, cross-platform CLI and GUI for batch transcription, translation, speaker annotation and...

jim60105/docker-whisperX

Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker...

MahmoudAshraf97/whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

linto-ai/linto-stt

An automatic speech recognition API

Explore Voice AI Tools

All categories Trending Voice AI directory Insights