SDV and synthetic-data-generator
The two tools appear to be **competitors**, as both are designed to generate high-quality structured tabular synthetic data, with SDV having significantly higher adoption based on stars and monthly downloads.
About SDV
sdv-dev/SDV
Synthetic data generation for tabular data
This project helps data professionals create artificial datasets that statistically resemble their real-world tabular data, like customer records or transaction logs. You input your original sensitive data and it outputs a new, entirely fake dataset that maintains the essential patterns and relationships without exposing any private information. This is ideal for data scientists, analysts, and researchers who need to share or develop with data while adhering to privacy regulations.
About synthetic-data-generator
hitsz-ids/synthetic-data-generator
SDG is a specialized framework designed to generate high-quality structured tabular data.
This tool helps data professionals create artificial datasets that mimic the real characteristics of their original structured data, like customer records or transaction logs, without containing any sensitive information. You provide your existing tabular data, and it generates a new, privacy-compliant dataset that can be used for various purposes. It's designed for data scientists, analysts, and developers who need to work with data while adhering to privacy regulations.
Related comparisons
Scores updated daily from GitHub, PyPI, and npm data. How scores work