KennethEnevoldsen/augmenty
Augmenty is an augmentation library based on spaCy for augmenting texts.
When training text classification or entity recognition models, you often need more diverse examples to improve accuracy. This tool takes your existing text data with its assigned labels and generates new, subtly varied versions. It helps machine learning engineers and data scientists expand their text datasets for more robust model training.
157 stars. Used by 1 other package. No commits in the last 6 months. Available on PyPI.
Use this if you need to create more training data from your existing labeled text to improve the performance of your natural language processing models.
Not ideal if you're looking for a general-purpose text generation tool for creative writing or content creation.
Stars
157
Forks
10
Language
Python
License
MIT
Category
Last pushed
May 24, 2024
Commits (30d)
0
Dependencies
5
Reverse dependents
1
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/KennethEnevoldsen/augmenty"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Higher-rated alternatives
dsfsi/textaugment
TextAugment: Text Augmentation Library
425776024/nlpcda
一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda
searchableai/KitanaQA
KitanaQA: Adversarial training and data augmentation for neural question-answering models
SanghunYun/UDA_pytorch
UDA(Unsupervised Data Augmentation) implemented by pytorch
google-research/uda
Unsupervised Data Augmentation (UDA)