tabahi/bournemouth-forced-aligner
Extract phoneme-level timestamps from speeh audio.
Automatically labels the exact start and end times of individual speech sounds (phonemes) within an audio recording, given its text transcript. It takes an audio file and a matching transcript, then outputs millisecond-level timestamps for each phoneme. This tool is ideal for phoneticians, speech researchers, and language educators who need precise timing information for spoken language.
121 stars. Used by 1 other package. Available on PyPI.
Use this if you need to precisely measure when each sound occurs in an audio recording, for detailed phonetic analysis or speech synthesis.
Not ideal if you only need word-level timings or are working with audio that doesn't have a perfectly matching text transcript.
Stars
121
Forks
12
Language
Python
License
GPL-3.0
Category
Last pushed
Feb 28, 2026
Commits (30d)
0
Dependencies
9
Reverse dependents
1
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/tabahi/bournemouth-forced-aligner"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
kahne/fastwer
A PyPI package for fast word/character error rate (WER/CER) calculation
analyticsinmotion/werpy
🐍📦 Ultra-fast Python package for calculating and analyzing the Word Error Rate (WER). Built for...
fgnt/meeteval
MeetEval - A meeting transcription evaluation toolkit
wq2012/SimpleDER
A lightweight library to compute Diarization Error Rate (DER).
readbeyond/aeneas
aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka...