cleanlab/cleanvision

Automatically find issues in image datasets and practice data-centric computer vision.

59
/ 100
Established

Building a computer vision model often starts with gathering many images. CleanVision helps you automatically check a folder of images for common flaws like blurriness, over/under-exposure, or duplicates. It takes your raw image files and produces a report identifying these issues so you can address them before training your model. This is for anyone preparing image datasets for machine learning applications, from AI researchers to data scientists.

1,158 stars. Used by 2 other packages. Available on PyPI.

Use this if you need to quickly identify and fix quality problems within a large collection of raw images intended for a computer vision project.

Not ideal if you need to find issues with the labels associated with your images, rather than the images themselves.

computer-vision image-processing machine-learning-engineering data-quality ai-development
Maintenance 6 / 25
Adoption 12 / 25
Maturity 25 / 25
Community 16 / 25

How are scores calculated?

Stars

1,158

Forks

75

Language

Python

License

Apache-2.0

Last pushed

Jan 08, 2026

Commits (30d)

0

Dependencies

8

Reverse dependents

2

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/computer-vision/cleanlab/cleanvision"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.