danklabs/tts_dataset_maker

A gui to help make a text to speech dataset.

31
/ 100
Emerging

This tool helps content creators and voice artists prepare high-quality audio recordings for training custom text-to-speech models. You input raw audio clips and their corresponding scripts, then use the interface to segment and organize them. The output is a structured dataset, similar to industry-standard formats, ready for use in voice cloning or other speech synthesis projects.

No commits in the last 6 months.

Use this if you need to create a meticulously organized dataset of audio and text pairs to train a unique voice for text-to-speech applications.

Not ideal if you're looking for an automated voice cloning solution or a simple audio editor; this tool requires significant manual effort in data preparation.

voice-cloning audio-dataset-preparation speech-synthesis content-creation voice-acting
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 6 / 25
Maturity 16 / 25
Community 9 / 25

How are scores calculated?

Stars

18

Forks

2

Language

JavaScript

License

MIT

Last pushed

Dec 10, 2022

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/danklabs/tts_dataset_maker"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.