dsfsi/textaugment
TextAugment: Text Augmentation Library
When you have a limited amount of text data for training machine learning models, this tool helps you automatically generate more diverse sentences without manual effort. You input your existing text data, and it provides new, synthetically varied sentences. This is useful for data scientists, machine learning engineers, and researchers working on natural language processing tasks who need to improve their model's performance by expanding their dataset.
433 stars. Available on PyPI.
Use this if you need to quickly and easily create more training examples from your existing text data to boost the accuracy of your text classification or other NLP models.
Not ideal if you require entirely new, contextually unique text data that isn't derived from existing examples, or if you need to augment non-textual data.
Stars
433
Forks
60
Language
Python
License
MIT
Category
Last pushed
Mar 04, 2026
Commits (30d)
0
Dependencies
6
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/dsfsi/textaugment"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Related tools
425776024/nlpcda
一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda
searchableai/KitanaQA
KitanaQA: Adversarial training and data augmentation for neural question-answering models
SanghunYun/UDA_pytorch
UDA(Unsupervised Data Augmentation) implemented by pytorch
google-research/uda
Unsupervised Data Augmentation (UDA)
KennethEnevoldsen/augmenty
Augmenty is an augmentation library based on spaCy for augmenting texts.