orianemartin/WhispGrid
A Whisper to TextGrid script that I use to automatize Corpus Annotation on Praat, with speaker diarization.
This tool helps linguistics researchers and phoneticians automate the initial steps of annotating audio corpora. You provide audio files, and it automatically generates a TextGrid file ready for use in Praat, complete with speech segments and individual words marked. It also identifies different speakers within the audio, making your manual review much faster.
No commits in the last 6 months.
Use this if you need to quickly get a first pass at transcribing and segmenting spoken audio for linguistic analysis in Praat, especially with multiple speakers.
Not ideal if your audio files are larger than 25MB each, as you will encounter a size limit.
Stars
13
Forks
3
Language
Python
License
MIT
Category
Last pushed
Nov 14, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/orianemartin/WhispGrid"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
tsmdt/whisply
💬 Fast, cross-platform CLI and GUI for batch transcription, translation, speaker annotation and...
jim60105/docker-whisperX
Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker...
MahmoudAshraf97/whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
linto-ai/linto-stt
An automatic speech recognition API