xlang-ai/icl-selective-annotation

[ICLR 2023] Code for our paper "Selective Annotation Makes Language Models Better Few-Shot Learners"

32
/ 100
Emerging

This project helps natural language processing (NLP) practitioners create high-quality datasets for new tasks more efficiently. It takes unlabeled text data and, using a novel selective annotation method, identifies the most impactful examples for human annotators to label. The result is a smaller, more effective dataset that teaches large language models new tasks with significantly less annotation effort.

109 stars. No commits in the last 6 months.

Use this if you need to quickly adapt large language models to new text-based classification, reasoning, or generation tasks and want to drastically reduce the cost and time spent on data annotation.

Not ideal if you already have a fully labeled, extensive dataset for your task or if your task doesn't involve natural language processing.

natural-language-processing data-annotation machine-learning-engineering text-classification data-efficiency
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 9 / 25
Maturity 8 / 25
Community 15 / 25

How are scores calculated?

Stars

109

Forks

16

Language

Python

License

Last pushed

Jul 15, 2023

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/xlang-ai/icl-selective-annotation"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.