readbeyond/aeneas
aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
Automatically generates a precise synchronization map between an audio file and its corresponding text, marking the exact start and end times for each text fragment within the audio. This helps create synchronized content for various applications, from research to digital publishing. It's used by anyone needing to align spoken words with written text, such as content creators, educators, or accessibility specialists.
2,811 stars. No commits in the last 6 months. Available on PyPI.
Use this if you need to automatically match specific parts of a written transcript to their exact spoken moments in an audio recording.
Not ideal if you need to transcribe audio from scratch or if the audio quality is very poor, as it relies on a clear correspondence between existing text and audio.
Stars
2,811
Forks
270
Language
Python
License
AGPL-3.0
Category
Last pushed
Jun 22, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/readbeyond/aeneas"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Related tools
kahne/fastwer
A PyPI package for fast word/character error rate (WER/CER) calculation
analyticsinmotion/werpy
🐍📦 Ultra-fast Python package for calculating and analyzing the Word Error Rate (WER). Built for...
fgnt/meeteval
MeetEval - A meeting transcription evaluation toolkit
tabahi/bournemouth-forced-aligner
Extract phoneme-level timestamps from speeh audio.
wq2012/SimpleDER
A lightweight library to compute Diarization Error Rate (DER).