whisperX and whisply

WhisperX provides the underlying speech recognition and diarization engine with word-level timestamps, while Whisply is a higher-level application layer that wraps Whisper (and potentially WhisperX) to deliver batch processing and user interface functionality—making them complements rather than direct competitors.

whisperX
80
Verified
whisply
62
Established
Maintenance 20/25
Adoption 15/25
Maturity 25/25
Community 20/25
Maintenance 13/25
Adoption 9/25
Maturity 25/25
Community 15/25
Stars: 20,758
Forks: 2,188
Downloads:
Commits (30d): 11
Language: Python
License: BSD-2-Clause
Stars: 108
Forks: 16
Downloads:
Commits (30d): 0
Language: Python
License: MIT
No risk flags
No risk flags

About whisperX

m-bain/whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

This tool helps you accurately transcribe audio recordings, providing not just the words but also precise timestamps for each word. It can also identify who is speaking at any given time, separating conversations by speaker. Anyone who needs highly accurate transcripts for audio analysis, subtitling, or content review would find this useful, such as researchers, journalists, or content creators.

audio-transcription speech-to-text speaker-diarization subtitling qualitative-research

About whisply

tsmdt/whisply

💬 Fast, cross-platform CLI and GUI for batch transcription, translation, speaker annotation and subtitle generation using OpenAI’s Whisper on CPU, Nvidia GPU and Apple MLX.

This tool helps you quickly convert audio and video files into text. You provide your media files, and it generates precise transcriptions, translations, speaker annotations, and even subtitles. It's designed for anyone who needs to process many recordings, such as researchers, podcasters, or content creators, to make their content more accessible and searchable.

transcription media-localization content-creation research-analysis accessibility

Scores updated daily from GitHub, PyPI, and npm data. How scores work