Sreyan88/Synthio
Code for ICLR 2025 Paper: Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic Data
Synthio helps researchers and practitioners classify audio effectively even when they have very little real-world audio data. It takes your small audio dataset and generates additional synthetic audio examples, which are then used to train a more robust audio classification model. This is designed for anyone working with audio classification tasks who struggles with limited access to diverse or large-scale audio recordings.
No commits in the last 6 months.
Use this if you need to build or improve an audio classification system but are hampered by the scarcity of training data.
Not ideal if you already have a very large and diverse audio dataset, as the benefits of synthetic data generation will be less pronounced.
Stars
12
Forks
—
Language
Python
License
MIT
Category
Last pushed
Mar 31, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/generative-ai/Sreyan88/Synthio"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
sdv-dev/SDV
Synthetic data generation for tabular data
sdv-dev/SDGym
Benchmarking synthetic data generation methods.
NVIDIA-NeMo/DataDesigner
🎨 NeMo Data Designer: A general library for generating high-quality synthetic data from scratch...
AlexanderVNikitin/tsgm
Generation and evaluation of synthetic time series datasets (also, augmentations,...
mostly-ai/mostlyai
Synthetic Data SDK ✨