readbeyond/aeneas

aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)

/ 100

Established

Automatically generates a precise synchronization map between an audio file and its corresponding text, marking the exact start and end times for each text fragment within the audio. This helps create synchronized content for various applications, from research to digital publishing. It's used by anyone needing to align spoken words with written text, such as content creators, educators, or accessibility specialists.

2,811 stars. No commits in the last 6 months. Available on PyPI.

Use this if you need to automatically match specific parts of a written transcript to their exact spoken moments in an audio recording.

Not ideal if you need to transcribe audio from scratch or if the audio quality is very poor, as it relies on a clear correspondence between existing text and audio.

audio-text-synchronization digital-publishing closed-captioning e-learning-content multimedia-accessibility

Stale 6m No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 25 / 25

Community 20 / 25

How are scores calculated?

Stars

2,811

Forks

270

Language

Python

License

AGPL-3.0

Featured in

Things AI Won't Tell You About Building a Voice App

Compare

aeneas and ForcedAlignment

Related tools

kahne/fastwer

A PyPI package for fast word/character error rate (WER/CER) calculation

analyticsinmotion/werpy

🐍📦 Ultra-fast Python package for calculating and analyzing the Word Error Rate (WER). Built for...

fgnt/meeteval

MeetEval - A meeting transcription evaluation toolkit

tabahi/bournemouth-forced-aligner

Extract phoneme-level timestamps from speeh audio.

wq2012/SimpleDER

A lightweight library to compute Diarization Error Rate (DER).

Explore Voice AI Tools

All categories Trending Voice AI directory Insights