yumeng5/FewGen

[ICML 2023] Tuning Language Models as Training Data Generators for Augmentation-Enhanced Few-Shot Learning

29
/ 100
Experimental

This project helps developers and researchers working with Natural Language Understanding (NLU) tasks to improve their models when only a small amount of labeled training data is available. It takes a few existing examples for a classification task and generates a larger, synthetic dataset. The output is an enhanced training set that can be used to fine-tune NLU classifiers for better performance.

No commits in the last 6 months.

Use this if you are a machine learning engineer or researcher dealing with NLU classification problems and have limited labeled data for training your models.

Not ideal if you have abundant labeled data, are not working on NLU tasks, or do not have access to substantial GPU resources for model tuning.

Natural Language Processing few-shot learning data augmentation text classification language model tuning
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 8 / 25
Maturity 16 / 25
Community 5 / 25

How are scores calculated?

Stars

44

Forks

2

Language

Python

License

Apache-2.0

Last pushed

May 10, 2023

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/yumeng5/FewGen"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.