BatsResearch/alfred
A system for prompted weak supervision. Alfred is a powerful tool that leverages large language models to accelerate data annotation.
Alfred helps non-technical users quickly label and classify large amounts of text or images for machine learning projects. It takes raw data like documents or pictures and, by using large language models, suggests classifications or labels. This tool is designed for data scientists, researchers, or anyone who needs to prepare high-quality labeled datasets for training AI models, but lacks the time or resources for extensive manual annotation.
No commits in the last 6 months.
Use this if you need to rapidly annotate large datasets with consistent labels for machine learning, especially when dealing with text or image classification tasks, and want to leverage AI models to accelerate the process.
Not ideal if your annotation tasks require highly subjective human judgment that even advanced AI struggles with, or if you prefer purely manual annotation for maximum control.
Stars
58
Forks
9
Language
Python
License
BSD-3-Clause
Category
Last pushed
Apr 03, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/BatsResearch/alfred"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
NVIDIA-NeMo/Curator
Scalable data pre processing and curation toolkit for LLMs
MigoXLab/dingo
Dingo: A Comprehensive AI Data, Model and Application Quality Evaluation Tool
data-prep-kit/data-prep-kit
Open source project for data preparation for GenAI applications
TheDataStation/pneuma
LLM-Powered Data Discovery System for Tabular Data
cleanlab/cleanlab-studio
Client interface to Cleanlab Studio