nlpcda and nlp-data-augmentation

These are competitors: both provide Chinese NLP data augmentation functionality with overlapping techniques (EDA, BERT-based augmentation), but nlpcda is significantly more mature and widely adopted (6x more stars, active downloads vs. abandoned project).

nlpcda
56
Established
nlp-data-augmentation
36
Emerging
Maintenance 0/25
Adoption 11/25
Maturity 25/25
Community 20/25
Maintenance 0/25
Adoption 10/25
Maturity 8/25
Community 18/25
Stars: 1,878
Forks: 172
Downloads:
Commits (30d): 0
Language: Python
License: Apache-2.0
Stars: 294
Forks: 41
Downloads:
Commits (30d): 0
Language:
License:
Stale 6m
No License Stale 6m No Package No Dependents

About nlpcda

425776024/nlpcda

一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda

This tool helps people who work with Chinese text data to expand their datasets. You provide existing Chinese text, and it generates multiple variations of that text, carefully designed to retain the original meaning. This is useful for anyone training natural language processing (NLP) models, such as AI engineers or data scientists, who need more diverse training examples to improve model performance.

NLP-model-training Chinese-text-processing AI-data-preparation text-analytics machine-learning-engineering

About nlp-data-augmentation

quincyliang/nlp-data-augmentation

Data Augmentation for NLP. NLP数据增强

When working with text data for AI models, you often don't have enough examples to train effectively. This project helps you create more varied text samples from your existing data using techniques like synonym replacement, word shuffling, and translation. It's for data scientists, machine learning engineers, and NLP practitioners who need to expand their datasets to build more robust language models.

text analytics machine learning datasets natural language processing AI model training data enrichment

Related comparisons

Scores updated daily from GitHub, PyPI, and npm data. How scores work