lottev1991/Project-AIdol-Public-English-Dataset
Public female English corpus used for Project AI❤dol
This dataset provides English singing voice data from a female vocalist, suitable for training AI models that generate singing. It offers nearly two hours of audio, complete with phonetic transcriptions using ARPABET, designed for parallel training of speech synthesis systems. Voice artists, music producers, or researchers developing synthetic singing voices would find this valuable.
Use this if you need a publicly available dataset of female English singing to train a model for synthesizing songs.
Not ideal if you require a native English speaker's voice, professional vocal quality, or intend to use it with voice changers.
Stars
14
Forks
2
Language
—
License
CC-BY-SA-4.0
Category
Last pushed
Dec 28, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/lottev1991/Project-AIdol-Public-English-Dataset"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
double22a/speech_dataset
The dataset of Speech Recognition
Jakobovski/free-spoken-digit-dataset
A free audio dataset of spoken digits. An audio version of MNIST.
Ijwi-ry-Ikirundi-AI/Kirundi_Dataset
🇧🇮 The first large-scale, open-source speech and text dataset for Kirundi language. Building AI...
Jahangirbd23/WenetSpeech-Yue
📑 Explore WenetSpeech-Yue, a comprehensive Cantonese speech corpus with rich annotations,...
Nexdata-AI/338-Hours-Spanish-Speech-Data-by-Mobile-Phone
Spanish Speech Dataset