Yuan-ManX/ai-audio-datasets

AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications.

47
/ 100
Emerging

This project helps anyone building or researching AI models that interact with audio. It curates a wide range of audio datasets—including speech, music, and sound effects—that serve as inputs for training generative AI, speech recognition systems, or emotional voice conversion. The output is a highly capable AI model or application, ideal for data scientists, AI researchers, or audio engineers working on intelligent audio tools.

914 stars. No commits in the last 6 months.

Use this if you need to find specialized audio data to train, evaluate, or improve AI models for speech processing, music generation, or sound event recognition.

Not ideal if you are looking for ready-to-use AI models or applications, as this project provides the raw data for building them.

AI model training speech recognition text-to-speech audio generation sound analysis
Stale 6m No Package No Dependents
Maintenance 2 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 19 / 25

How are scores calculated?

Stars

914

Forks

90

Language

License

MIT

Last pushed

Jul 08, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/generative-ai/Yuan-ManX/ai-audio-datasets"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.