whisperX and whisper-v3-diarization
WhisperX is the underlying diarization enhancement library that whisper-v3-diarization wraps into a production-ready CLI/GUI application, making them complements designed to be used together.
About whisperX
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
This tool helps you accurately transcribe audio recordings, providing not just the words but also precise timestamps for each word. It can also identify who is speaking at any given time, separating conversations by speaker. Anyone who needs highly accurate transcripts for audio analysis, subtitling, or content review would find this useful, such as researchers, journalists, or content creators.
About whisper-v3-diarization
TharanaBope/whisper-v3-diarization
Production-ready audio transcription & speaker diarization CLI & GUI using OpenAI Whisper and WhisperX
Related comparisons
Scores updated daily from GitHub, PyPI, and npm data. How scores work