daochenzha/data-centric-AI
A curated, but incomplete, list of data-centric AI resources.
This is a curated collection of resources for improving AI system performance by focusing on data quality and quantity. It provides links to research papers, tutorials, blogs, and codebases covering techniques for developing training and inference data, and maintaining data quality. Data scientists, machine learning engineers, and AI researchers interested in practical data-centric AI methodologies would find this valuable.
1,138 stars. No commits in the last 6 months.
Use this if you want to explore techniques for engineering your datasets to enhance AI model performance, rather than just tweaking model architectures.
Not ideal if you are looking for a pre-built tool or software to directly implement data-centric AI solutions.
Stars
1,138
Forks
79
Language
—
License
—
Category
Last pushed
Jun 26, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/daochenzha/data-centric-AI"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
voxel51/fiftyone
Refine high-quality datasets and visual AI models
academic/awesome-datascience
:memo: An awesome Data Science repository to learn and apply for real world problems.
sacridini/Awesome-Geospatial
Long list of geospatial tools and resources
r0f1/datascience
Curated list of Python resources for data science.
nhivp/Awesome-Embedded
A curated list of awesome embedded programming.