feldberlin/timething

Timething is a library for aligning text transcripts with their audio recordings.

48
/ 100
Emerging

This project helps media producers, educators, or researchers precisely synchronize audio recordings with their written transcripts. You provide an audio file and its corresponding text, and it outputs detailed time codes for each word and character, indicating exactly when it was spoken. This is ideal for anyone needing to tightly link spoken content to its textual representation.

130 stars. No commits in the last 6 months. Available on PyPI.

Use this if you need to accurately timestamp every word and character in an audio recording against its full transcript, whether for long-form content like podcasts or smaller audio snippets.

Not ideal if you're looking for a tool to transcribe audio from scratch, as it requires a pre-existing transcript to perform the alignment.

media-production audiobook-creation speech-research podcast-editing educational-content
Stale 6m
Maintenance 0 / 25
Adoption 10 / 25
Maturity 25 / 25
Community 13 / 25

How are scores calculated?

Stars

130

Forks

14

Language

Jupyter Notebook

License

MIT

Last pushed

Dec 03, 2024

Commits (30d)

0

Dependencies

8

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/feldberlin/timething"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.