gokhaneraslan/tts-dataset-generator

With this tool you can create custom TTS dataset from video or audio.

/ 100

Emerging

This tool helps you turn long audio or video recordings into neatly organized datasets for training custom text-to-speech (TTS) voices. You input raw audio or video files, and it automatically breaks them into speech segments, transcribes them using AI, and outputs properly formatted audio clips and a text file of aligned transcripts. It's perfect for voice actors, linguists, or educators who need to create custom voice models from their own recordings.

No commits in the last 6 months.

Use this if you need to create a high-quality, segmented, and transcribed dataset from audio or video files to train a custom text-to-speech voice or for large-scale transcription.

Not ideal if you only need a quick transcription of a short audio file without the need for segmentation or dataset formatting for voice model training.

voice-synthesis speech-recognition audio-transcription voice-cloning e-learning-content

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 5 / 25

Maturity 16 / 25

Community 15 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

hetpandya/youtube_tts_data_generator

A python library to generate speech dataset from Youtube videos

IS2AI/Kazakh_TTS

An expanded version of the previously released Kazakh text-to-speech (KazakhTTS) synthesis...

taresh18/TTSizer

🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨

Hecate2/sukasuka-vocal-dataset-builder

すかすかアニメボカロデータセット。1st anime vocal dataset. Extract audio (vocal) files from video based on .ass...

youmebangbang/TTS-dataset-tools

Automatically generates TTS dataset using audio and associated text. Make cuts under a custom...

Explore Voice AI Tools

All categories Trending Voice AI directory Insights