tabularis-ai/be_great

A novel approach for synthesizing tabular data using pretrained large language models

/ 100

Established

This tool helps you generate entirely new, realistic tabular datasets based on an existing one, or fill in missing values within your data. You provide a spreadsheet-like dataset, and it produces a similar-looking, new dataset that preserves the patterns and relationships of your original. Data scientists, researchers, and analysts can use this for various purposes like expanding small datasets, creating test data, or addressing data privacy concerns.

350 stars. Available on PyPI.

Use this if you need to create synthetic tabular datasets for testing, augmenting small datasets, or sharing data while protecting sensitive information.

Not ideal if you require synthetic data with absolute mathematical guarantees of privacy or if your primary goal is simple data anonymization rather than full generation.

data-generation data-augmentation data-privacy synthetic-data dataset-expansion

Maintenance 10 / 25

Adoption 10 / 25

Maturity 25 / 25

Community 20 / 25

How are scores calculated?

Stars

350

Forks

Language

Python

License

MIT

Related models

EleutherAI/gpt-neox

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron...

shibing624/textgen

TextGen: Implementation of Text Generation models, include LLaMA, BLOOM, GPT2, BART, T5, SongNet...

ai-forever/ru-gpts

Russian GPT3 models.

AdityaNG/kan-gpt

The PyTorch implementation of Generative Pre-trained Transformers (GPTs) using Kolmogorov-Arnold...

zemlyansky/gpt-tfjs

GPT in TensorFlow.js

Explore Transformer Models

All categories Trending Transformer directory Insights