dsfsi/textaugment

TextAugment: Text Augmentation Library

/ 100

Established

When you have a limited amount of text data for training machine learning models, this tool helps you automatically generate more diverse sentences without manual effort. You input your existing text data, and it provides new, synthetically varied sentences. This is useful for data scientists, machine learning engineers, and researchers working on natural language processing tasks who need to improve their model's performance by expanding their dataset.

433 stars. Available on PyPI.

Use this if you need to quickly and easily create more training examples from your existing text data to boost the accuracy of your text classification or other NLP models.

Not ideal if you require entirely new, contextually unique text data that isn't derived from existing examples, or if you need to augment non-textual data.

natural-language-processing machine-learning-data text-classification data-augmentation model-training

Maintenance 10 / 25

Adoption 10 / 25

Maturity 25 / 25

Community 19 / 25

How are scores calculated?

Stars

433

Forks

Language

Python

License

MIT

Compare

textaugment and augmenty

Related tools

425776024/nlpcda

一键中文数据增强包； NLP数据增强、bert数据增强、EDA：pip install nlpcda

searchableai/KitanaQA

KitanaQA: Adversarial training and data augmentation for neural question-answering models

SanghunYun/UDA_pytorch

UDA(Unsupervised Data Augmentation) implemented by pytorch

google-research/uda

Unsupervised Data Augmentation (UDA)

KennethEnevoldsen/augmenty

Augmenty is an augmentation library based on spaCy for augmenting texts.

Explore NLP Tools

All categories Trending NLP directory Insights