Renumics/sliceguard
A library for detecting problematic data segments in structured and unstructured data with few lines of code.
This tool helps data analysts and machine learning engineers identify problematic segments within their datasets, whether they contain numbers, categories, text, images, or audio. You input your raw data, and it generates an interactive report highlighting suspicious groups of data points that might skew your analysis or model performance. It's designed for anyone who needs to ensure their data quality is high before making decisions or deploying models.
Used by 1 other package. No commits in the last 6 months. Available on PyPI.
Use this if you need to quickly find and visualize hidden issues or biased segments in your complex datasets to improve data quality and model reliability.
Not ideal if you are looking for a comprehensive data labeling or manual data cleaning solution, as it focuses on automated anomaly detection.
Stars
64
Forks
3
Language
Python
License
MIT
Category
Last pushed
Jan 10, 2024
Commits (30d)
0
Dependencies
10
Reverse dependents
1
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/Renumics/sliceguard"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
skrub-data/skrub
Machine learning with dataframes
biolab/orange3
🍊 :bar_chart: :bulb: Orange: Interactive data analysis
root-project/root
The official repository for ROOT: analyzing, storing and visualizing big data, scientifically
cleanlab/cleanlab
Cleanlab's open-source library is the standard data-centric AI package for data quality and...
drivendataorg/deon
A command line tool to easily add an ethics checklist to your data science projects.