always-further/deepfabric
Generate High-Quality Synthetics, Train, Measure, and Evaluate in a Single Pipeline
DeepFabric helps AI developers and researchers create specialized, high-quality synthetic datasets to train language models and evaluate agent behaviors. It takes a high-level topic prompt and system instructions, then generates structured training samples that teach models to reason, plan, and use tools correctly. This tool is ideal for anyone developing or testing intelligent agents and language models that need to perform complex tasks reliably.
843 stars.
Use this if you need to generate diverse, domain-specific training data to teach your language models or AI agents how to think, call tools, and follow schemas without overfitting.
Not ideal if you are looking for a general-purpose text generation tool or if your models do not require complex reasoning, tool use, or strict adherence to structured outputs.
Stars
843
Forks
76
Language
Python
License
Apache-2.0
Category
Last pushed
Mar 09, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/always-further/deepfabric"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related frameworks
meta-llama/synthetic-data-kit
Tool for generating high quality Synthetic datasets
Diyago/Tabular-data-generation
We well know GANs for success in the realistic image generation. However, they can be applied in...
Data-Centric-AI-Community/ydata-synthetic
Synthetic data generators for tabular and time-series data
tdspora/syngen
Open-source version of the TDspora synthetic data generation algorithm.
vanderschaarlab/synthcity
A library for generating and evaluating synthetic tabular data for privacy, fairness and data...