lukereichold/SpeechTimestamper
Generate an accurate, timestamped transcript given an audio file and its text using Google Cloud's Speech-to-Text API via gRPC.
This tool helps educators, language learners, or music enthusiasts synchronize audio with its exact text. You provide an audio file and its correct written transcript, and it generates a version where each word in the transcript is precisely timestamped. This is ideal for anyone needing to link specific spoken words to their exact moment in an audio recording.
No commits in the last 6 months.
Use this if you have an audio recording and already know its correct word-for-word transcript, and you need to find out the precise start time of each word within that audio.
Not ideal if you only have an audio file and need a basic transcript without pre-supplying the text, or if you don't need highly accurate, word-level timestamps.
Stars
21
Forks
1
Language
Swift
License
MIT
Category
Last pushed
Aug 16, 2020
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/lukereichold/SpeechTimestamper"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
AbdullahHendy/live-translation
Real-time speech-to-text translation over WebSocket. Streams Opus or raw PCM audio from client...
i4Ds/whisper-finetune
This repository contains code for fine-tuning the Whisper speech-to-text model.
512z/podlens
Free Podwise: AI Podcast & Youtube Transcription & Understanding Agent | 播客+youtube转文字/学习/可视化AI工具
Gr122lyBr/voicetag
Speaker identification powered by pyannote and resemblyzer
aws-solutions/content-localization-on-aws
Automatically generate multi-language subtitles using AWS AI/ML services. Machine generated...