jim-schwoebel/download_audioset
📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).
This project helps researchers and data scientists working with audio by providing an easy way to download the raw sound files from AudioSet, a large dataset of environmental sounds, music, and speech. It takes the original YouTube video links and extracts the specific 10-second audio clips, organizing them into folders by sound class. This is ideal for anyone training models or analyzing diverse audio events.
105 stars. No commits in the last 6 months.
Use this if you need a local collection of raw, categorized audio clips from AudioSet for tasks like sound event detection, audio classification, or acoustic analysis.
Not ideal if you prefer to work with pre-extracted audio features or embeddings rather than raw audio files, or if you only need a small subset of the AudioSet data.
Stars
105
Forks
22
Language
Python
License
—
Category
Last pushed
Aug 01, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/jim-schwoebel/download_audioset"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
ynop/audiomate
Python library for handling audio datasets.
reazon-research/ReazonSpeech
Massive open Japanese speech corpus
common-voice/cv-dataset
Metadata and versioning details for the Common Voice dataset
davidmartinrius/speech-dataset-generator
🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset...
EgorLakomkin/KTSpeechCrawler
Automatically constructing corpus for automatic speech recognition from YouTube videos