danklabs/tts_dataset_maker

A gui to help make a text to speech dataset.

/ 100

Emerging

This tool helps content creators and voice artists prepare high-quality audio recordings for training custom text-to-speech models. You input raw audio clips and their corresponding scripts, then use the interface to segment and organize them. The output is a structured dataset, similar to industry-standard formats, ready for use in voice cloning or other speech synthesis projects.

No commits in the last 6 months.

Use this if you need to create a meticulously organized dataset of audio and text pairs to train a unique voice for text-to-speech applications.

Not ideal if you're looking for an automated voice cloning solution or a simple audio editor; this tool requires significant manual effort in data preparation.

voice-cloning audio-dataset-preparation speech-synthesis content-creation voice-acting

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 16 / 25

Community 9 / 25

How are scores calculated?

Stars

Forks

Language

JavaScript

License

MIT

Higher-rated alternatives

hetpandya/youtube_tts_data_generator

A python library to generate speech dataset from Youtube videos

IS2AI/Kazakh_TTS

An expanded version of the previously released Kazakh text-to-speech (KazakhTTS) synthesis...

taresh18/TTSizer

🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨

Hecate2/sukasuka-vocal-dataset-builder

すかすかアニメボカロデータセット。1st anime vocal dataset. Extract audio (vocal) files from video based on .ass...

youmebangbang/TTS-dataset-tools

Automatically generates TTS dataset using audio and associated text. Make cuts under a custom...

Explore Voice AI Tools

All categories Trending Voice AI directory Insights