425776024/nlpcda
一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda
This tool helps people who work with Chinese text data to expand their datasets. You provide existing Chinese text, and it generates multiple variations of that text, carefully designed to retain the original meaning. This is useful for anyone training natural language processing (NLP) models, such as AI engineers or data scientists, who need more diverse training examples to improve model performance.
1,878 stars. Used by 1 other package. No commits in the last 6 months. Available on PyPI.
Use this if you need to create more diverse training data from your existing Chinese text corpus to make your NLP models more robust and performant.
Not ideal if your primary goal is to achieve marginal accuracy increases in competitive leaderboards, as this tool primarily focuses on enhancing model generalization rather than raw scoring.
Stars
1,878
Forks
172
Language
Python
License
Apache-2.0
Category
Last pushed
Mar 18, 2025
Commits (30d)
0
Reverse dependents
1
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/425776024/nlpcda"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
dsfsi/textaugment
TextAugment: Text Augmentation Library
searchableai/KitanaQA
KitanaQA: Adversarial training and data augmentation for neural question-answering models
SanghunYun/UDA_pytorch
UDA(Unsupervised Data Augmentation) implemented by pytorch
google-research/uda
Unsupervised Data Augmentation (UDA)
KennethEnevoldsen/augmenty
Augmenty is an augmentation library based on spaCy for augmenting texts.