Synthetic Data Generation RAG Tools

There are 3 synthetic data generation tools tracked. 1 score above 50 (established tier). The highest-rated is nicolas-hbt/pygraft at 51/100 with 699 stars.

Get all 3 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=rag&subcategory=synthetic-data-generation&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 nicolas-hbt/pygraft

Configurable Generation of Synthetic Schemas and Knowledge Graphs at Your Fingertips

51
Established
2 SciPhi-AI/synthesizer

A multi-purpose LLM framework for RAG and data creation.

42
Emerging
3 findalexli/SciGraphQA

SciGraphQA: Large-Scale Synthetic Multi-Turn Question-Answering Dataset for...

29
Experimental