tabularis-ai/be_great
A novel approach for synthesizing tabular data using pretrained large language models
This tool helps you generate entirely new, realistic tabular datasets based on an existing one, or fill in missing values within your data. You provide a spreadsheet-like dataset, and it produces a similar-looking, new dataset that preserves the patterns and relationships of your original. Data scientists, researchers, and analysts can use this for various purposes like expanding small datasets, creating test data, or addressing data privacy concerns.
350 stars. Available on PyPI.
Use this if you need to create synthetic tabular datasets for testing, augmenting small datasets, or sharing data while protecting sensitive information.
Not ideal if you require synthetic data with absolute mathematical guarantees of privacy or if your primary goal is simple data anonymization rather than full generation.
Stars
350
Forks
58
Language
Python
License
MIT
Category
Last pushed
Feb 09, 2026
Commits (30d)
0
Dependencies
9
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/tabularis-ai/be_great"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related models
EleutherAI/gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron...
shibing624/textgen
TextGen: Implementation of Text Generation models, include LLaMA, BLOOM, GPT2, BART, T5, SongNet...
ai-forever/ru-gpts
Russian GPT3 models.
AdityaNG/kan-gpt
The PyTorch implementation of Generative Pre-trained Transformers (GPTs) using Kolmogorov-Arnold...
zemlyansky/gpt-tfjs
GPT in TensorFlow.js