HazyResearch/data-centric-ai

Resources for Data Centric AI

45
/ 100
Emerging

This resource provides a curated collection of techniques and insights for improving artificial intelligence systems by focusing on the quality and structure of data, rather than just tweaking models. It offers guidance on how to programmatically label data, augment datasets, and clean information to get better results from your AI. This is for AI practitioners, machine learning engineers, and data scientists who are building or deploying AI models and need practical strategies to enhance their performance.

1,134 stars. No commits in the last 6 months.

Use this if you are struggling with poor AI model performance and suspect the issue lies with your training data rather than the model architecture itself.

Not ideal if you are primarily interested in developing new AI model architectures or optimizing existing ones without touching the underlying data.

AI development machine learning operations data quality dataset preparation model deployment
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 19 / 25

How are scores calculated?

Stars

1,134

Forks

118

Language

TeX

License

Apache-2.0

Last pushed

Dec 13, 2023

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/HazyResearch/data-centric-ai"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.