Clearbox-AI/clearbox-synthetic-kit
Clearbox AI's all-in-one solution for generation and evaluation of synthetic tabular and time-series data.
This tool helps data professionals create artificial versions of sensitive tabular or time-series datasets. You input your original data, and it generates new, synthetic data that mimics the original's patterns without revealing real individual information. This is ideal for data scientists, analysts, and researchers working with confidential information.
No commits in the last 6 months. Available on PyPI.
Use this if you need to share or analyze data without compromising privacy, such as for machine learning model development, testing, or public research.
Not ideal if you need to work with your exact original data or require perfectly lossless data reproduction.
Stars
44
Forks
1
Language
Python
License
Apache-2.0
Category
Last pushed
Sep 24, 2025
Commits (30d)
0
Dependencies
25
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/generative-ai/Clearbox-AI/clearbox-synthetic-kit"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
sdv-dev/SDV
Synthetic data generation for tabular data
sdv-dev/SDGym
Benchmarking synthetic data generation methods.
NVIDIA-NeMo/DataDesigner
🎨 NeMo Data Designer: A general library for generating high-quality synthetic data from scratch...
AlexanderVNikitin/tsgm
Generation and evaluation of synthetic time series datasets (also, augmentations,...
mostly-ai/mostlyai
Synthetic Data SDK ✨