HazyResearch/meerkat
Explore and understand your training and validation data.
Meerkat helps machine learning practitioners, data scientists, and researchers explore, visualize, and annotate complex datasets, especially those with unstructured data like images, video, audio, or free text. It allows you to bring in your raw data and machine learning model outputs, creating interactive views to understand model behavior and data quality. This helps you efficiently identify patterns, spot errors, and prepare validation data for your AI projects.
852 stars. No commits in the last 6 months.
Use this if you need to interactively explore and understand unstructured data types, such as images, text, or video, often in conjunction with machine learning model predictions.
Not ideal if you are exclusively working with structured numerical/categorical data, creating simple single-input/output model demos, or need a robust platform for large-scale, team-based manual data labeling.
Stars
852
Forks
44
Language
Python
License
Apache-2.0
Category
Last pushed
Dec 24, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/HazyResearch/meerkat"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
skrub-data/skrub
Machine learning with dataframes
biolab/orange3
🍊 :bar_chart: :bulb: Orange: Interactive data analysis
root-project/root
The official repository for ROOT: analyzing, storing and visualizing big data, scientifically
cleanlab/cleanlab
Cleanlab's open-source library is the standard data-centric AI package for data quality and...
drivendataorg/deon
A command line tool to easily add an ethics checklist to your data science projects.