daochenzha/data-centric-AI

A curated, but incomplete, list of data-centric AI resources.

34
/ 100
Emerging

This is a curated collection of resources for improving AI system performance by focusing on data quality and quantity. It provides links to research papers, tutorials, blogs, and codebases covering techniques for developing training and inference data, and maintaining data quality. Data scientists, machine learning engineers, and AI researchers interested in practical data-centric AI methodologies would find this valuable.

1,138 stars. No commits in the last 6 months.

Use this if you want to explore techniques for engineering your datasets to enhance AI model performance, rather than just tweaking model architectures.

Not ideal if you are looking for a pre-built tool or software to directly implement data-centric AI solutions.

AI development machine learning engineering data quality AI research model training
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 8 / 25
Community 16 / 25

How are scores calculated?

Stars

1,138

Forks

79

Language

License

Last pushed

Jun 26, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/daochenzha/data-centric-AI"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.