Hecate2/sukasuka-vocal-dataset-builder

すかすかアニメボカロデータセット。1st anime vocal dataset. Extract audio (vocal) files from video based on .ass subtitle files; manually label vocal files to characters. Will be used for PITS/VITS/Diffusion text-to-speech/SVC. 根据字幕,从视频里抽取全部语音,然后手动按角色标注。

43
/ 100
Emerging

This tool helps animators, voice actors, and AI voice developers create custom voice datasets from anime videos. It takes video files and accompanying subtitle files as input. It then extracts individual vocal segments, allowing you to manually label each segment with the character who is speaking. The output is a structured dataset of character-specific vocal audio files, ready for training text-to-speech or voice conversion models.

Use this if you need to build a specialized dataset of character voices from anime or drama CDs for training AI voice models.

Not ideal if you're not familiar with the anime characters or lack the patience for manual audio labeling and data organization.

voice-synthesis anime-production audio-dataset-creation character-voice-training media-localization
No Package No Dependents
Maintenance 10 / 25
Adoption 8 / 25
Maturity 16 / 25
Community 9 / 25

How are scores calculated?

Stars

49

Forks

4

Language

Python

License

MIT

Last pushed

Feb 25, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/Hecate2/sukasuka-vocal-dataset-builder"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.