danklabs/tts_dataset_maker
A gui to help make a text to speech dataset.
This tool helps content creators and voice artists prepare high-quality audio recordings for training custom text-to-speech models. You input raw audio clips and their corresponding scripts, then use the interface to segment and organize them. The output is a structured dataset, similar to industry-standard formats, ready for use in voice cloning or other speech synthesis projects.
No commits in the last 6 months.
Use this if you need to create a meticulously organized dataset of audio and text pairs to train a unique voice for text-to-speech applications.
Not ideal if you're looking for an automated voice cloning solution or a simple audio editor; this tool requires significant manual effort in data preparation.
Stars
18
Forks
2
Language
JavaScript
License
MIT
Category
Last pushed
Dec 10, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/danklabs/tts_dataset_maker"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
hetpandya/youtube_tts_data_generator
A python library to generate speech dataset from Youtube videos
IS2AI/Kazakh_TTS
An expanded version of the previously released Kazakh text-to-speech (KazakhTTS) synthesis...
taresh18/TTSizer
ποΈ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets β¨
Hecate2/sukasuka-vocal-dataset-builder
γγγγγ’γγ‘γγ«γγγΌγΏγ»γγγ1st anime vocal dataset. Extract audio (vocal) files from video based on .ass...
youmebangbang/TTS-dataset-tools
Automatically generates TTS dataset using audio and associated text. Make cuts under a custom...