aia39/Synthetic-Tabular-Data-Generation-using-CTGAN-and-classify-with-XGboost

This is the repository to generate synthetic tabular data when the tabular data has imbalance in some feature.

21
/ 100
Experimental

This tool helps data analysts and researchers create artificial data entries that mimic the patterns of your existing spreadsheet-like data. It takes your raw tabular data, especially when certain categories are underrepresented, and generates new, synthetic rows. This allows you to work with a larger, more balanced dataset for tasks like predictive modeling or statistical analysis.

No commits in the last 6 months.

Use this if you have an imbalanced dataset where one outcome or category has significantly fewer examples than others, and you need more data for robust analysis or model training.

Not ideal if your primary concern is data privacy and you need to generate synthetic data without any direct reference to existing sensitive information.

data-balancing predictive-modeling dataset-augmentation statistical-analysis classification-problems
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 4 / 25
Maturity 8 / 25
Community 9 / 25

How are scores calculated?

Stars

7

Forks

1

Language

Jupyter Notebook

License

Last pushed

Jun 11, 2022

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/aia39/Synthetic-Tabular-Data-Generation-using-CTGAN-and-classify-with-XGboost"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.