argilla-io/argilla

Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets

56
/ 100
Established

Argilla is a collaboration tool that helps AI engineers and domain experts work together to build high-quality datasets. It takes raw data, like text or images, and allows human experts to label, evaluate, and refine it, producing accurately tagged datasets ready for training and improving AI models. This is for anyone involved in developing AI applications, from machine learning engineers to subject matter experts, who needs to ensure their models are built on reliable, human-validated data.

4,895 stars.

Use this if you need to create, refine, and continuously improve datasets with human feedback for your AI models, especially for tasks like text classification, named entity recognition, or evaluating large language models.

Not ideal if you are looking for a tool that automates data labeling entirely without human input, or if your primary need is for a general-purpose data management system not focused on AI dataset curation.

AI data labeling dataset curation human-in-the-loop ML NLP data preparation LLM fine-tuning data
No Package No Dependents
Maintenance 10 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 20 / 25

How are scores calculated?

Stars

4,895

Forks

475

Language

Python

License

Apache-2.0

Last pushed

Mar 09, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/argilla-io/argilla"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.