zhaominyiz/EPiDA

Official Code for 'EPiDA: An Easy Plug-in Data Augmentation Framework for High Performance Text Classification' - NAACL 2022

/ 100

Experimental

This framework helps machine learning practitioners improve the accuracy of text classification models, especially when working with limited labeled data. It takes your existing text datasets and generates additional, similar text examples, resulting in a more robust and accurate classification model. Data scientists and AI researchers focused on natural language processing would find this useful for enhancing their text-based machine learning projects.

No commits in the last 6 months.

Use this if you need to train a high-performing text classifier but have a small or imbalanced dataset, and you want to artificially expand your training data to achieve better results.

Not ideal if your primary goal is to interpret the linguistic features of your text or if you have an abundance of labeled data, making augmentation unnecessary.

natural-language-processing text-analytics machine-learning-engineering dataset-augmentation AI-research

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 16 / 25

Community 7 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

dsfsi/textaugment

TextAugment: Text Augmentation Library

425776024/nlpcda

一键中文数据增强包； NLP数据增强、bert数据增强、EDA：pip install nlpcda

google-research/uda

Unsupervised Data Augmentation (UDA)

searchableai/KitanaQA

KitanaQA: Adversarial training and data augmentation for neural question-answering models

SanghunYun/UDA_pytorch

UDA(Unsupervised Data Augmentation) implemented by pytorch

Explore NLP Tools

All categories Trending NLP directory Insights