All Synthetic Data Tools

13 tools ranked by quality score

# Tool Score Tier
1 benkeen/generatedata

A powerful, feature-rich, random test data generator.

71
Verified
2 sdv-dev/CTGAN

Conditional GAN for generating synthetic tabular data.

67
Established
3 databrickslabs/dbldatagen

Generate relevant synthetic data quickly for your projects. The Databricks...

63
Established
4 DexForce/EmbodiChain

An end-to-end, GPU-accelerated, and modular platform for building...

63
Established
5 synthesized-io/tdk-demo

This is a collection of TDK demo projects that use different databases and options

50
Established
6 Stranger6667/hypothesis-graphql

Generate arbitrary queries matching your GraphQL schema, and use them to...

44
Emerging
7 Buddhi19/SyntheticGen

[IGARSS 2026] Generates synthetic image–mask pairs by denoising joint...

43
Emerging
8 Mukhopadhyay/pyfake

A Flexible and Extensible fake data generator based on Pydantic models.

31
Emerging
9 lasgroup/ActiveUltraFeedback

Code for the paper "ActiveUltraFeedback: Efficient Preference Data...

29
Experimental
10 jaehyeon-kim/dynamic-des

Real-time SimPy control plane to dynamically update parameters and stream...

27
Experimental
11 aborruso/fauxdata

CLI to generate and validate realistic fake datasets from YAML schemas —...

27
Experimental
12 PDBeurope/mmcif-gen

This application is designed to create mmcif files from facilities data.

25
Experimental
13 doachyz/IIoT-simulator

An advanced Industrial IoT (IIoT) simulator for Smart Factory 4.0...

24
Experimental