JJWRoeloffs/transcribe_align_textgrid
A small wrapper package around whisper-timestamped. Create force-aligned transcription TextGrids from raw audio!
This tool helps researchers and linguists automatically generate precise, time-aligned transcriptions from audio files. You input raw audio (like interviews or spoken recordings), and it outputs a Praat TextGrid file. The TextGrid shows exact timings for spoken words and segments, making it easier to analyze speech.
Available on PyPI.
Use this if you need to accurately transcribe and time-align spoken audio for linguistic analysis, phonetics research, or any task requiring detailed word and segment timing.
Not ideal if you only need a simple, untimed text transcription and don't require the detailed, time-aligned structure of a TextGrid file.
Stars
18
Forks
3
Language
Python
License
AGPL-3.0
Category
Last pushed
Dec 16, 2025
Commits (30d)
0
Dependencies
3
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/JJWRoeloffs/transcribe_align_textgrid"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
jdepoix/youtube-transcript-api
This is a python API which allows you to get the transcript/subtitles for a given YouTube video....
MatteoFasulo/Whisper-TikTok
From AI tools to TikTok video creation using FFMPEG, Microsoft Edge read aloud and OpenAI Whisper model
pszemraj/vid2cleantxt
Python API & command-line tool to easily transcribe speech-based video files into clean text
antor44/livestream_video
playlist4whisper manages media streams playlists for livestream_video.sh, plays media, and...
ArthurFDLR/whisper-youtube
🔉 Youtube Videos Transcription with OpenAI's Whisper