statice/awesome-synthetic-data

A curated list of awesome synthetic data tools (open source and commercial).

43
/ 100
Emerging

This is a curated collection of tools and resources for generating synthetic data. It helps data practitioners and researchers find suitable solutions for creating artificial datasets that mimic the statistical properties of real data. You can find open-source libraries, commercial platforms, and community resources to support various synthetic data generation needs.

244 stars. No commits in the last 6 months.

Use this if you need to find tools to generate artificial datasets for testing, development, or analysis while preserving privacy or overcoming data scarcity.

Not ideal if you are looking for a single, ready-to-use application to generate synthetic data without exploring different options or understanding the underlying technologies.

data-privacy data-augmentation machine-learning-development data-sharing data-testing
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 17 / 25

How are scores calculated?

Stars

244

Forks

32

Language

License

Apache-2.0

Last pushed

Jan 11, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/statice/awesome-synthetic-data"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.