whisperX and CrisperWhisper
WhisperX provides the foundational diarization and word-level timestamping infrastructure that CrisperWhisper builds upon, making them complements rather than competitors—CrisperWhisper adds filler detection refinements to WhisperX's base output.
About whisperX
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
This tool helps you accurately transcribe audio recordings, providing not just the words but also precise timestamps for each word. It can also identify who is speaking at any given time, separating conversations by speaker. Anyone who needs highly accurate transcripts for audio analysis, subtitling, or content review would find this useful, such as researchers, journalists, or content creators.
About CrisperWhisper
nyrahealth/CrisperWhisper
Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection
CrisperWhisper helps you get extremely accurate, word-for-word transcriptions from audio, perfect for detailed analysis of spoken interactions. It takes audio recordings and produces a text transcript that includes every sound, like 'um' or 'uh,' along with precise timing for each word. Anyone who needs to analyze speech patterns, interview content, or conversational flow will find this tool valuable.
Related comparisons
Scores updated daily from GitHub, PyPI, and npm data. How scores work