KennethEnevoldsen/augmenty

Augmenty is an augmentation library based on spaCy for augmenting texts.

/ 100

Emerging

When training text classification or entity recognition models, you often need more diverse examples to improve accuracy. This tool takes your existing text data with its assigned labels and generates new, subtly varied versions. It helps machine learning engineers and data scientists expand their text datasets for more robust model training.

157 stars. Used by 1 other package. No commits in the last 6 months. Available on PyPI.

Use this if you need to create more training data from your existing labeled text to improve the performance of your natural language processing models.

Not ideal if you're looking for a general-purpose text generation tool for creative writing or content creation.

NLP data-augmentation machine-learning-training text-classification entity-recognition

Stale 6m

Maintenance 0 / 25

Adoption 11 / 25

Maturity 25 / 25

Community 10 / 25

How are scores calculated?

Stars

157

Forks

Language

Python

License

MIT

Compare

augmenty and textaugment

Higher-rated alternatives

dsfsi/textaugment

TextAugment: Text Augmentation Library

425776024/nlpcda

一键中文数据增强包； NLP数据增强、bert数据增强、EDA：pip install nlpcda

searchableai/KitanaQA

KitanaQA: Adversarial training and data augmentation for neural question-answering models

SanghunYun/UDA_pytorch

UDA(Unsupervised Data Augmentation) implemented by pytorch

google-research/uda

Unsupervised Data Augmentation (UDA)

Explore NLP Tools

All categories Trending NLP directory Insights