SilenceX12138/TabStruct
🗼 [ICLR 2026 Oral] Official implementation of “TabStruct: Measuring Structural Fidelity of Tabular Data”
This tool helps researchers and practitioners evaluate how well synthetic tabular data generation methods preserve the underlying structure and characteristics of real-world datasets. You provide real tabular data and various synthetic data generation models, and it outputs comprehensive evaluation metrics including structural fidelity, privacy preservation, and how well machine learning models trained on the synthetic data perform. Data scientists, machine learning researchers, and anyone working with synthetic data for privacy or augmentation would find this useful.
Use this if you need to rigorously compare and benchmark different synthetic tabular data generators or predictive models, especially when structural integrity and data utility are critical concerns.
Not ideal if you are looking for a simple, one-click solution to generate synthetic data without needing to deeply analyze or compare the fidelity of different generation methods.
Stars
11
Forks
1
Language
Jupyter Notebook
License
Apache-2.0
Category
Last pushed
Mar 04, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/generative-ai/SilenceX12138/TabStruct"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
sdv-dev/SDV
Synthetic data generation for tabular data
sdv-dev/SDGym
Benchmarking synthetic data generation methods.
NVIDIA-NeMo/DataDesigner
🎨 NeMo Data Designer: A general library for generating high-quality synthetic data from scratch...
AlexanderVNikitin/tsgm
Generation and evaluation of synthetic time series datasets (also, augmentations,...
mostly-ai/mostlyai
Synthetic Data SDK ✨