Data-Centric-AI-Community/nist-crc-2023
NIST Collaborative Research Cycle on Synthetic Data. Learn about Synthetic Data week by week!
This project helps data practitioners and privacy engineers learn how to create and evaluate synthetic data for public release. You'll work with sensitive private datasets and generate de-identified, synthetic versions, learning to assess their quality and privacy. This is for anyone who needs to share data while protecting individual privacy, such as researchers or data stewards.
No commits in the last 6 months.
Use this if you need to understand, generate, and evaluate synthetic data for safely sharing sensitive information while adhering to privacy standards.
Not ideal if you're looking for a quick, automated solution without needing to understand the underlying methods and evaluation principles.
Stars
27
Forks
2
Language
Jupyter Notebook
License
MIT
Category
Last pushed
Jul 13, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/Data-Centric-AI-Community/nist-crc-2023"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
Diyago/Tabular-data-generation
We well know GANs for success in the realistic image generation. However, they can be applied in...
meta-llama/synthetic-data-kit
Tool for generating high quality Synthetic datasets
Data-Centric-AI-Community/ydata-synthetic
Synthetic data generators for tabular and time-series data
tdspora/syngen
Open-source version of the TDspora synthetic data generation algorithm.
vanderschaarlab/synthcity
A library for generating and evaluating synthetic tabular data for privacy, fairness and data...