seedatnabeel/Data-SUITE

Data-SUITE: Data-centric identification of in-distribution incongruous examples (ICML 2022)

35
/ 100
Emerging

Data-SUITE helps data scientists or machine learning engineers understand the limitations of their training data and identify unreliable predictions. It takes your existing dataset and a trained model, then it tells you which new data points might not be reliably handled by your model, and it maps out the areas where your training data is sparse or unusual. This helps you know when to trust your model's outputs and where to focus efforts on collecting more relevant data.

No commits in the last 6 months.

Use this if you need to determine which new data instances your trained model can reliably predict and to understand where your existing training data might have gaps or inconsistencies.

Not ideal if you are looking for a tool to automatically improve your model's performance without needing to understand the underlying data limitations.

data-quality machine-learning-operations model-governance data-acquisition predictive-analytics
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 5 / 25
Maturity 16 / 25
Community 14 / 25

How are scores calculated?

Stars

9

Forks

3

Language

Jupyter Notebook

License

MIT

Last pushed

Mar 08, 2023

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/seedatnabeel/Data-SUITE"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.