tdspora/syngen

Open-source version of the TDspora synthetic data generation algorithm.

58
/ 100
Established

This tool helps you create realistic fake datasets from your existing tabular data, such as CSVs or Excel files, without revealing sensitive information. It takes your original dataset as input and generates a new, synthetic dataset that mimics the statistical properties of your real data. Data scientists, analysts, and anyone needing test data for development, training, or sharing would find this useful.

Available on PyPI.

Use this if you need to generate privacy-preserving test data from an existing tabular dataset for development, testing, or sharing with others.

Not ideal if you need to generate synthetic data without any original data as a template, or if your data is not in a tabular format (e.g., images, audio).

data-privacy data-masking test-data-generation data-anonymization data-simulation
Maintenance 10 / 25
Adoption 6 / 25
Maturity 25 / 25
Community 17 / 25

How are scores calculated?

Stars

18

Forks

11

Language

Jupyter Notebook

License

GPL-3.0

Last pushed

Mar 13, 2026

Commits (30d)

0

Dependencies

38

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/tdspora/syngen"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.