samuelbradshaw/text-to-timestamps
Python and command-line utility for aligning audio to a transcript.
This tool helps content creators, educators, or media professionals synchronize audio with written transcripts. You provide an audio file (like an MP3) and optionally an existing text transcript. It then outputs a file with precise timestamps, showing exactly when each word, phrase, or block of text is spoken in the audio.
No commits in the last 6 months.
Use this if you need to quickly generate synchronized text and audio, for example, to create captions, subtitles, or interactive transcripts for videos and podcasts.
Not ideal if you need a fully integrated, visual subtitle editing suite with advanced styling and preview features.
Stars
15
Forks
4
Language
Python
License
MIT
Category
Last pushed
Aug 15, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/samuelbradshaw/text-to-timestamps"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
SYSTRAN/faster-whisper
Faster Whisper transcription with CTranslate2
machinelearningZH/audio-transcription
Transcribe any audio or video file. Edit and view your transcripts in a standalone HTML editor.
saharmor/whisper-playground
Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/
shhossain/BanglaSpeech2Text
BanglaSpeech2Text: An open-source offline speech-to-text package for Bangla language. Fine-tuned...
oseiskar/autosubsync
Automatically synchronize subtitles with audio using machine learning