dsfsi/textaugment

TextAugment: Text Augmentation Library

64
/ 100
Established

When you have a limited amount of text data for training machine learning models, this tool helps you automatically generate more diverse sentences without manual effort. You input your existing text data, and it provides new, synthetically varied sentences. This is useful for data scientists, machine learning engineers, and researchers working on natural language processing tasks who need to improve their model's performance by expanding their dataset.

433 stars. Available on PyPI.

Use this if you need to quickly and easily create more training examples from your existing text data to boost the accuracy of your text classification or other NLP models.

Not ideal if you require entirely new, contextually unique text data that isn't derived from existing examples, or if you need to augment non-textual data.

natural-language-processing machine-learning-data text-classification data-augmentation model-training
Maintenance 10 / 25
Adoption 10 / 25
Maturity 25 / 25
Community 19 / 25

How are scores calculated?

Stars

433

Forks

60

Language

Python

License

MIT

Last pushed

Mar 04, 2026

Commits (30d)

0

Dependencies

6

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/dsfsi/textaugment"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.