readbeyond/aeneas

aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)

55
/ 100
Established

Automatically generates a precise synchronization map between an audio file and its corresponding text, marking the exact start and end times for each text fragment within the audio. This helps create synchronized content for various applications, from research to digital publishing. It's used by anyone needing to align spoken words with written text, such as content creators, educators, or accessibility specialists.

2,811 stars. No commits in the last 6 months. Available on PyPI.

Use this if you need to automatically match specific parts of a written transcript to their exact spoken moments in an audio recording.

Not ideal if you need to transcribe audio from scratch or if the audio quality is very poor, as it relies on a clear correspondence between existing text and audio.

audio-text-synchronization digital-publishing closed-captioning e-learning-content multimedia-accessibility
Stale 6m No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 25 / 25
Community 20 / 25

How are scores calculated?

Stars

2,811

Forks

270

Language

Python

License

AGPL-3.0

Last pushed

Jun 22, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/readbeyond/aeneas"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.