Hecate2/sukasuka-vocal-dataset-builder

すかすかアニメボカロデータセット。1st anime vocal dataset. Extract audio (vocal) files from video based on .ass subtitle files; manually label vocal files to characters. Will be used for PITS/VITS/Diffusion text-to-speech/SVC. 根据字幕，从视频里抽取全部语音，然后手动按角色标注。

/ 100

Emerging

This tool helps animators, voice actors, and AI voice developers create custom voice datasets from anime videos. It takes video files and accompanying subtitle files as input. It then extracts individual vocal segments, allowing you to manually label each segment with the character who is speaking. The output is a structured dataset of character-specific vocal audio files, ready for training text-to-speech or voice conversion models.

Use this if you need to build a specialized dataset of character voices from anime or drama CDs for training AI voice models.

Not ideal if you're not familiar with the anime characters or lack the patience for manual audio labeling and data organization.

voice-synthesis anime-production audio-dataset-creation character-voice-training media-localization

No Package No Dependents

Maintenance 10 / 25

Adoption 8 / 25

Maturity 16 / 25

Community 9 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

hetpandya/youtube_tts_data_generator

A python library to generate speech dataset from Youtube videos

IS2AI/Kazakh_TTS

An expanded version of the previously released Kazakh text-to-speech (KazakhTTS) synthesis...

taresh18/TTSizer

🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨

youmebangbang/TTS-dataset-tools

Automatically generates TTS dataset using audio and associated text. Make cuts under a custom...

souvikg544/TTS_Data_Maker

Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio...

Explore Voice AI Tools

All categories Trending Voice AI directory Insights