autonlab/aqua

AQuA: A Benchmarking Tool for Label Quality Assessment, NeurIPS'23 D&B

/ 100

Experimental

This tool helps machine learning engineers and researchers assess the quality of labels in their datasets. You provide your dataset, and it evaluates different label error detection methods, showing you how well each method identifies mislabeled data. This helps you choose the best strategy to improve your dataset's quality before training your ML models.

No commits in the last 6 months.

Use this if you need to objectively compare and select the most effective methods for identifying and correcting errors in your machine learning dataset labels across various data types.

Not ideal if you're looking for a simple, automated 'fix-all' solution for label errors without wanting to compare different detection methods.

data-quality-assessment machine-learning-engineering data-labeling model-training computer-vision

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 16 / 25

Community 4 / 25

How are scores calculated?

Stars

Forks

Language

Jupyter Notebook

License

MIT

Higher-rated alternatives

open-edge-platform/datumaro

Dataset Management Framework, a Python library and a CLI tool to build, analyze and manage...

explosion/ml-datasets

🌊 Machine learning dataset loaders for testing and example scripts

webdataset/webdataset

A high-performance Python-based I/O system for large (and small) deep learning problems, with...

tensorflow/datasets

TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...

mlcommons/croissant

Croissant is a high-level format for machine learning datasets that brings together four rich layers.

Explore ML Frameworks

All categories Trending ML Framework directory Insights