NatGr/annotate_audio

Helper scripts to split a large audio file into smaller chunks and annotate these chunks

/ 100

Experimental

This tool helps anyone working with long audio recordings to prepare them for speech-to-text (STT) or text-to-speech (TTS) training. It takes a large audio file, automatically splits it into smaller, manageable clips, and generates initial text transcripts for each. The output is a collection of short audio files with corresponding text annotations, ready for use by data scientists or linguists training speech models.

No commits in the last 6 months.

Use this if you have lengthy audio recordings and need to quickly segment them and create initial transcriptions for STT or TTS model development.

Not ideal if you need extremely precise, human-level transcription without any automated assistance or if you are working with very short, pre-segmented audio clips.

speech-data-preparation audio-segmentation transcription-assistance AI-model-training linguistics-data

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 4 / 25

Maturity 8 / 25

Community 8 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

Spr-Aachen/Easy-Voice-Toolkit

A user-friendly audio toolkit for voice recognition, voice transcription, voice conversion etc.

PrzemyslawSwiderski/python-gradle-plugin

Gradle plugin to run Python projects.

alphacep/awesome-russian-speech

Russian speech technology links

ftyers/commonvoice-utils

Linguistic processing for Common Voice

microsoft/UniSpeech

UniSpeech - Large Scale Self-Supervised Learning for Speech

Explore Voice AI Tools

All categories Trending Voice AI directory Insights