xlang-ai/icl-selective-annotation

[ICLR 2023] Code for our paper "Selective Annotation Makes Language Models Better Few-Shot Learners"

/ 100

Emerging

This project helps natural language processing (NLP) practitioners create high-quality datasets for new tasks more efficiently. It takes unlabeled text data and, using a novel selective annotation method, identifies the most impactful examples for human annotators to label. The result is a smaller, more effective dataset that teaches large language models new tasks with significantly less annotation effort.

109 stars. No commits in the last 6 months.

Use this if you need to quickly adapt large language models to new text-based classification, reasoning, or generation tasks and want to drastically reduce the cost and time spent on data annotation.

Not ideal if you already have a fully labeled, extensive dataset for your task or if your task doesn't involve natural language processing.

natural-language-processing data-annotation machine-learning-engineering text-classification data-efficiency

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 9 / 25

Maturity 8 / 25

Community 15 / 25

How are scores calculated?

Stars

109

Forks

Language

Python

License

—

Higher-rated alternatives

n-waves/multifit

The code to reproduce results from paper "MultiFiT: Efficient Multi-lingual Language Model...

princeton-nlp/SimCSE

[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821

yxuansu/SimCTG

[NeurIPS'22 Spotlight] A Contrastive Framework for Neural Text Generation

alibaba-edu/simple-effective-text-matching

Source code of the ACL2019 paper "Simple and Effective Text Matching with Richer Alignment Features".

Shark-NLP/OpenICL

OpenICL is an open-source framework to facilitate research, development, and prototyping of...

Explore NLP Tools

All categories Trending NLP directory Insights