SDV and synthetic-data-generator

The two tools appear to be **competitors**, as both are designed to generate high-quality structured tabular synthetic data, with SDV having significantly higher adoption based on stars and monthly downloads.

SDV
81
Verified
synthetic-data-generator
59
Established
Maintenance 20/25
Adoption 15/25
Maturity 25/25
Community 21/25
Maintenance 10/25
Adoption 10/25
Maturity 16/25
Community 23/25
Stars: 3,439
Forks: 417
Downloads:
Commits (30d): 38
Language: Python
License:
Stars: 2,409
Forks: 385
Downloads:
Commits (30d): 0
Language: Python
License: Apache-2.0
No risk flags
No Package No Dependents

About SDV

sdv-dev/SDV

Synthetic data generation for tabular data

This project helps data professionals create artificial datasets that statistically resemble their real-world tabular data, like customer records or transaction logs. You input your original sensitive data and it outputs a new, entirely fake dataset that maintains the essential patterns and relationships without exposing any private information. This is ideal for data scientists, analysts, and researchers who need to share or develop with data while adhering to privacy regulations.

data-anonymization privacy-compliance data-generation data-analysis machine-learning-development

About synthetic-data-generator

hitsz-ids/synthetic-data-generator

SDG is a specialized framework designed to generate high-quality structured tabular data.

This tool helps data professionals create artificial datasets that mimic the real characteristics of their original structured data, like customer records or transaction logs, without containing any sensitive information. You provide your existing tabular data, and it generates a new, privacy-compliant dataset that can be used for various purposes. It's designed for data scientists, analysts, and developers who need to work with data while adhering to privacy regulations.

data privacy data sharing model training data testing tabular data analysis

Related comparisons

Scores updated daily from GitHub, PyPI, and npm data. How scores work