Sreyan88/Synthio

Code for ICLR 2025 Paper: Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic Data

/ 100

Experimental

Synthio helps researchers and practitioners classify audio effectively even when they have very little real-world audio data. It takes your small audio dataset and generates additional synthetic audio examples, which are then used to train a more robust audio classification model. This is designed for anyone working with audio classification tasks who struggles with limited access to diverse or large-scale audio recordings.

No commits in the last 6 months.

Use this if you need to build or improve an audio classification system but are hampered by the scarcity of training data.

Not ideal if you already have a very large and diverse audio dataset, as the benefits of synthetic data generation will be less pronounced.

audio-classification sound-recognition machine-listening dataset-augmentation low-resource-ai

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 5 / 25

Maturity 16 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Python

License

MIT

Higher-rated alternatives

sdv-dev/SDV

Synthetic data generation for tabular data

sdv-dev/SDGym

Benchmarking synthetic data generation methods.

NVIDIA-NeMo/DataDesigner

🎨 NeMo Data Designer: A general library for generating high-quality synthetic data from scratch...

AlexanderVNikitin/tsgm

Generation and evaluation of synthetic time series datasets (also, augmentations,...

mostly-ai/mostlyai

Synthetic Data SDK ✨

Explore Generative AI Tools

All categories Trending Generative AI directory Insights