SanghunYun/UDA_pytorch
UDA(Unsupervised Data Augmentation) implemented by pytorch
This project helps improve the accuracy of text classification tasks, like sentiment analysis, even when you have very few labeled examples. It takes your existing text data, both labeled and unlabeled, and processes it to train a more robust text classification model. This is for data scientists, machine learning engineers, or researchers working with natural language processing who need to build high-performing text models with limited annotated data.
278 stars. No commits in the last 6 months.
Use this if you need to train accurate text classification models but have a small number of labeled examples and a larger pool of unlabeled text.
Not ideal if you have a large, fully labeled dataset for your text classification problem, or if you're not working with text data.
Stars
278
Forks
59
Language
Python
License
Apache-2.0
Category
Last pushed
Dec 13, 2019
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/SanghunYun/UDA_pytorch"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Higher-rated alternatives
dsfsi/textaugment
TextAugment: Text Augmentation Library
425776024/nlpcda
一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda
google-research/uda
Unsupervised Data Augmentation (UDA)
searchableai/KitanaQA
KitanaQA: Adversarial training and data augmentation for neural question-answering models
KennethEnevoldsen/augmenty
Augmenty is an augmentation library based on spaCy for augmenting texts.