tiefenauer/ip9

Code for my master thesis at FHNW

/ 100

Experimental

This project helps linguists, speech researchers, or educators accurately match specific words or phrases in a written text to their corresponding locations within an audio recording. You input an audio file and its full transcription, and it outputs precise timing information for each word, indicating when it's spoken. This is ideal for anyone needing to analyze speech patterns or create synchronized captions.

No commits in the last 6 months.

Use this if you need to determine the exact start and end times of words or phrases in an audio recording, given its complete text transcript.

Not ideal if you need a solution for operating systems other than Linux or require a pre-built, easy-to-install application without technical setup.

linguistics speech analysis audio synchronization phonetics captioning

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 4 / 25

Maturity 16 / 25

Community 9 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

Picovoice/rhino

On-device Speech-to-Intent engine powered by deep learning

yandexdataschool/speech_course

YSDA course in Speech Processing.

MycroftAI/adapt

Adapt Intent Parser

Picovoice/speech-to-intent-benchmark

benchmark for Speech-to-Intent engines

IBM/BigLittleNet

Official repository for Big-Little Net

Explore Voice AI Tools

All categories Trending Voice AI directory Insights