Yuan-ManX/ai-audio-datasets

AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications.

/ 100

Emerging

This project helps anyone building or researching AI models that interact with audio. It curates a wide range of audio datasets—including speech, music, and sound effects—that serve as inputs for training generative AI, speech recognition systems, or emotional voice conversion. The output is a highly capable AI model or application, ideal for data scientists, AI researchers, or audio engineers working on intelligent audio tools.

914 stars. No commits in the last 6 months.

Use this if you need to find specialized audio data to train, evaluate, or improve AI models for speech processing, music generation, or sound event recognition.

Not ideal if you are looking for ready-to-use AI models or applications, as this project provides the raw data for building them.

AI model training speech recognition text-to-speech audio generation sound analysis

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 19 / 25

How are scores calculated?

Stars

914

Forks

Language

—

License

MIT

Related tools

geeknik/ai-audio-fingerprint-remover

A comprehensive Python tool to remove AI-generated fingerprints, watermarks, and metadata from...

Explore Generative AI Tools

All categories Trending Generative AI directory Insights