lukereichold/SpeechTimestamper

Generate an accurate, timestamped transcript given an audio file and its text using Google Cloud's Speech-to-Text API via gRPC.

/ 100

Experimental

This tool helps educators, language learners, or music enthusiasts synchronize audio with its exact text. You provide an audio file and its correct written transcript, and it generates a version where each word in the transcript is precisely timestamped. This is ideal for anyone needing to link specific spoken words to their exact moment in an audio recording.

No commits in the last 6 months.

Use this if you have an audio recording and already know its correct word-for-word transcript, and you need to find out the precise start time of each word within that audio.

Not ideal if you only have an audio file and need a basic transcript without pre-supplying the text, or if you don't need highly accurate, word-level timestamps.

language-education speech-analysis audio-synchronization lyric-alignment transcript-timing

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 16 / 25

Community 4 / 25

How are scores calculated?

Stars

Forks

Language

Swift

License

MIT

Higher-rated alternatives

AbdullahHendy/live-translation

Real-time speech-to-text translation over WebSocket. Streams Opus or raw PCM audio from client...

i4Ds/whisper-finetune

This repository contains code for fine-tuning the Whisper speech-to-text model.

512z/podlens

Free Podwise: AI Podcast & Youtube Transcription & Understanding Agent | 播客+youtube转文字/学习/可视化AI工具

Gr122lyBr/voicetag

Speaker identification powered by pyannote and resemblyzer

aws-solutions/content-localization-on-aws

Automatically generate multi-language subtitles using AWS AI/ML services. Machine generated...

Explore Voice AI Tools

All categories Trending Voice AI directory Insights