google-research/uda
Unsupervised Data Augmentation (UDA)
This project helps machine learning practitioners significantly reduce the amount of labeled data needed to train high-performing models for tasks like classifying images or text. You feed it a small set of labeled examples alongside a larger pool of unlabeled data, and it outputs a highly accurate classification model. Data scientists, machine learning engineers, and researchers can use this to build effective models even when manual data labeling is expensive or time-consuming.
2,202 stars. No commits in the last 6 months.
Use this if you have a classification problem (image or text) but only a limited number of labeled examples, along with a larger quantity of unlabeled data.
Not ideal if your task is not classification, you lack any unlabeled data, or you have ample labeled data readily available.
Stars
2,202
Forks
312
Language
Python
License
Apache-2.0
Category
Last pushed
Aug 28, 2021
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/google-research/uda"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Higher-rated alternatives
dsfsi/textaugment
TextAugment: Text Augmentation Library
425776024/nlpcda
一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda
searchableai/KitanaQA
KitanaQA: Adversarial training and data augmentation for neural question-answering models
SanghunYun/UDA_pytorch
UDA(Unsupervised Data Augmentation) implemented by pytorch
KennethEnevoldsen/augmenty
Augmenty is an augmentation library based on spaCy for augmenting texts.