NatGr/annotate_audio

Helper scripts to split a large audio file into smaller chunks and annotate these chunks

20
/ 100
Experimental

This tool helps anyone working with long audio recordings to prepare them for speech-to-text (STT) or text-to-speech (TTS) training. It takes a large audio file, automatically splits it into smaller, manageable clips, and generates initial text transcripts for each. The output is a collection of short audio files with corresponding text annotations, ready for use by data scientists or linguists training speech models.

No commits in the last 6 months.

Use this if you have lengthy audio recordings and need to quickly segment them and create initial transcriptions for STT or TTS model development.

Not ideal if you need extremely precise, human-level transcription without any automated assistance or if you are working with very short, pre-segmented audio clips.

speech-data-preparation audio-segmentation transcription-assistance AI-model-training linguistics-data
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 4 / 25
Maturity 8 / 25
Community 8 / 25

How are scores calculated?

Stars

8

Forks

1

Language

Python

License

Last pushed

Oct 19, 2021

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/NatGr/annotate_audio"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.