samuelbradshaw/text-to-timestamps

Python and command-line utility for aligning audio to a transcript.

/ 100

Emerging

This tool helps content creators, educators, or media professionals synchronize audio with written transcripts. You provide an audio file (like an MP3) and optionally an existing text transcript. It then outputs a file with precise timestamps, showing exactly when each word, phrase, or block of text is spoken in the audio.

No commits in the last 6 months.

Use this if you need to quickly generate synchronized text and audio, for example, to create captions, subtitles, or interactive transcripts for videos and podcasts.

Not ideal if you need a fully integrated, visual subtitle editing suite with advanced styling and preview features.

transcription subtitling captioning podcast-production e-learning-content

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 6 / 25

Maturity 16 / 25

Community 15 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

SYSTRAN/faster-whisper

Faster Whisper transcription with CTranslate2

machinelearningZH/audio-transcription

Transcribe any audio or video file. Edit and view your transcripts in a standalone HTML editor.

saharmor/whisper-playground

Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/

shhossain/BanglaSpeech2Text

BanglaSpeech2Text: An open-source offline speech-to-text package for Bangla language. Fine-tuned...

oseiskar/autosubsync

Automatically synchronize subtitles with audio using machine learning

Explore Voice AI Tools

All categories Trending Voice AI directory Insights