orianemartin/WhispGrid

A Whisper to TextGrid script that I use to automatize Corpus Annotation on Praat, with speaker diarization.

35
/ 100
Emerging

This tool helps linguistics researchers and phoneticians automate the initial steps of annotating audio corpora. You provide audio files, and it automatically generates a TextGrid file ready for use in Praat, complete with speech segments and individual words marked. It also identifies different speakers within the audio, making your manual review much faster.

No commits in the last 6 months.

Use this if you need to quickly get a first pass at transcribing and segmenting spoken audio for linguistic analysis in Praat, especially with multiple speakers.

Not ideal if your audio files are larger than 25MB each, as you will encounter a size limit.

linguistics phonetics corpus-annotation speech-analysis speaker-diarization
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 5 / 25
Maturity 16 / 25
Community 14 / 25

How are scores calculated?

Stars

13

Forks

3

Language

Python

License

MIT

Last pushed

Nov 14, 2023

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/orianemartin/WhispGrid"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.