nlpcda and nlp-data-augmentation
These are competitors: both provide Chinese NLP data augmentation functionality with overlapping techniques (EDA, BERT-based augmentation), but nlpcda is significantly more mature and widely adopted (6x more stars, active downloads vs. abandoned project).
About nlpcda
425776024/nlpcda
一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda
This tool helps people who work with Chinese text data to expand their datasets. You provide existing Chinese text, and it generates multiple variations of that text, carefully designed to retain the original meaning. This is useful for anyone training natural language processing (NLP) models, such as AI engineers or data scientists, who need more diverse training examples to improve model performance.
About nlp-data-augmentation
quincyliang/nlp-data-augmentation
Data Augmentation for NLP. NLP数据增强
When working with text data for AI models, you often don't have enough examples to train effectively. This project helps you create more varied text samples from your existing data using techniques like synonym replacement, word shuffling, and translation. It's for data scientists, machine learning engineers, and NLP practitioners who need to expand their datasets to build more robust language models.
Related comparisons
Scores updated daily from GitHub, PyPI, and npm data. How scores work