tabularis-ai/be_great

A novel approach for synthesizing tabular data using pretrained large language models

65
/ 100
Established

This tool helps you generate entirely new, realistic tabular datasets based on an existing one, or fill in missing values within your data. You provide a spreadsheet-like dataset, and it produces a similar-looking, new dataset that preserves the patterns and relationships of your original. Data scientists, researchers, and analysts can use this for various purposes like expanding small datasets, creating test data, or addressing data privacy concerns.

350 stars. Available on PyPI.

Use this if you need to create synthetic tabular datasets for testing, augmenting small datasets, or sharing data while protecting sensitive information.

Not ideal if you require synthetic data with absolute mathematical guarantees of privacy or if your primary goal is simple data anonymization rather than full generation.

data-generation data-augmentation data-privacy synthetic-data dataset-expansion
Maintenance 10 / 25
Adoption 10 / 25
Maturity 25 / 25
Community 20 / 25

How are scores calculated?

Stars

350

Forks

58

Language

Python

License

MIT

Last pushed

Feb 09, 2026

Commits (30d)

0

Dependencies

9

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/transformers/tabularis-ai/be_great"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.