donwany/gpt3datagen

GPT3DataGen is a python package that generates fake data for fine-tuning openai models

43
/ 100
Emerging

When you need to customize an OpenAI model for a specific task, you need a lot of examples to teach it. This tool helps you quickly create many examples of text data, formatted correctly for fine-tuning. It takes your instructions for the kind of data you need and outputs ready-to-use data files (like CSVs or JSONL) for your custom AI model. This is for anyone who wants to fine-tune an OpenAI model for tasks like classification or text completion, but doesn't have enough real-world examples.

No commits in the last 6 months. Available on PyPI.

Use this if you need a large volume of formatted text data to train your custom OpenAI model, but don't have enough real-world examples readily available.

Not ideal if you already have a comprehensive dataset of real-world examples for your specific fine-tuning task.

AI model training natural language processing text generation data synthesis machine learning
Stale 6m
Maintenance 0 / 25
Adoption 5 / 25
Maturity 25 / 25
Community 13 / 25

How are scores calculated?

Stars

9

Forks

2

Language

Python

License

MIT

Last pushed

Mar 15, 2023

Commits (30d)

0

Dependencies

3

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/prompt-engineering/donwany/gpt3datagen"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.