nyrahealth/CrisperWhisper
Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection
CrisperWhisper helps you get extremely accurate, word-for-word transcriptions from audio, perfect for detailed analysis of spoken interactions. It takes audio recordings and produces a text transcript that includes every sound, like 'um' or 'uh,' along with precise timing for each word. Anyone who needs to analyze speech patterns, interview content, or conversational flow will find this tool valuable.
927 stars. No commits in the last 6 months.
Use this if you need to know exactly what was said and when, including all natural speech quirks like pauses, fillers, and stutters, for in-depth linguistic or behavioral analysis.
Not ideal if you only need a clean, polished transcription that removes common speech disfluencies and focuses on the speaker's intended message.
Stars
927
Forks
48
Language
Python
License
—
Category
Last pushed
Jun 03, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/nyrahealth/CrisperWhisper"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Higher-rated alternatives
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
tsmdt/whisply
💬 Fast, cross-platform CLI and GUI for batch transcription, translation, speaker annotation and...
jim60105/docker-whisperX
Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker...
MahmoudAshraf97/whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
linto-ai/linto-stt
An automatic speech recognition API