Itachi-Uchiha581/Auto-Data

Auto Data is a library designed for quick and effortless creation of datasets tailored for fine-tuning Large Language Models (LLMs).

36
/ 100
Emerging

This tool helps developers and AI practitioners create custom datasets to train or fine-tune Large Language Models (LLMs). You input a topic, desired output format (JSON, Parquet), and an optional system prompt, and it generates realistic conversation data. This helps overcome the common challenge of scarce or imbalanced data when building specialized AI assistants or agents.

106 stars. No commits in the last 6 months.

Use this if you need to quickly generate high-quality, topic-specific conversation data for fine-tuning Large Language Models.

Not ideal if you are looking for a tool to process or analyze existing datasets, rather than generate new ones.

AI development Large Language Models data generation model fine-tuning custom AI agents
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 9 / 25
Maturity 16 / 25
Community 11 / 25

How are scores calculated?

Stars

106

Forks

9

Language

Python

License

GPL-3.0

Last pushed

Oct 31, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/Itachi-Uchiha581/Auto-Data"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.