donwany/gpt3datagen
GPT3DataGen is a python package that generates fake data for fine-tuning openai models
When you need to customize an OpenAI model for a specific task, you need a lot of examples to teach it. This tool helps you quickly create many examples of text data, formatted correctly for fine-tuning. It takes your instructions for the kind of data you need and outputs ready-to-use data files (like CSVs or JSONL) for your custom AI model. This is for anyone who wants to fine-tune an OpenAI model for tasks like classification or text completion, but doesn't have enough real-world examples.
No commits in the last 6 months. Available on PyPI.
Use this if you need a large volume of formatted text data to train your custom OpenAI model, but don't have enough real-world examples readily available.
Not ideal if you already have a comprehensive dataset of real-world examples for your specific fine-tuning task.
Stars
9
Forks
2
Language
Python
License
MIT
Category
Last pushed
Mar 15, 2023
Commits (30d)
0
Dependencies
3
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/prompt-engineering/donwany/gpt3datagen"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
vorniches/snap2txt
Convert your project into a text prompt.
promptslab/openai-detector
AI classifier for indicating AI-written text
awekrx/ChatGPT-MidJourney-prompt
This is a ChatGPT based prompt generation model for MidJorney. The purpose of this model is to...
flatypus/flowchat
A Python library for building clean and efficient multi-step prompt chains.
whwu95/GPT4Vis
GPT4Vis: What Can GPT-4 Do for Zero-shot Visual Recognition?