m-bain/whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

/ 100

Verified

This tool helps you accurately transcribe audio recordings, providing not just the words but also precise timestamps for each word. It can also identify who is speaking at any given time, separating conversations by speaker. Anyone who needs highly accurate transcripts for audio analysis, subtitling, or content review would find this useful, such as researchers, journalists, or content creators.

20,758 stars. Used by 5 other packages. Actively maintained with 11 commits in the last 30 days. Available on PyPI.

Use this if you need to turn audio into text with exact word timings and speaker identification, especially for long recordings or multi-speaker conversations.

Not ideal if you only need a basic transcript without precise word-level timings or speaker separation, or if you prefer a service with a graphical user interface.

audio-transcription speech-to-text speaker-diarization subtitling qualitative-research

Maintenance 20 / 25

Adoption 15 / 25

Maturity 25 / 25

Community 20 / 25

How are scores calculated?

Stars

20,758

Forks

2,188

Language

Python

License

BSD-2-Clause

Featured in

Things AI Won't Tell You About Building a Voice App Choosing a Voice AI Library in 2026: What's Actually Worth Building On

Recent Releases

v3.8.5 01 Apr 2026 v3.7.9 25 Mar 2026 v3.6.2 25 Mar 2026 v3.5.2 25 Mar 2026 v3.4.5 25 Mar 2026

Compare

whisperX and whisply whisperX and docker-whisperX whisperX and whisper-diarization whisperX and CrisperWhisper whisperX and whisperVideo whisperX and whisper-run whisperX and whisper-v3-diarization whisperX and gpt-speaker-diarization

Related tools

tsmdt/whisply

💬 Fast, cross-platform CLI and GUI for batch transcription, translation, speaker annotation and...

jim60105/docker-whisperX

Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker...

MahmoudAshraf97/whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

linto-ai/linto-stt

An automatic speech recognition API

linto-ai/linto-studio

Transcription and annotation interface for recorded audio or video files

Explore Voice AI Tools

All categories Trending Voice AI directory Insights