nlpcda and EDA_NLP_for_Chinese

These are ecosystem siblings—B is an implementation of the EDA (Easy Data Augmentation) paper for Chinese that inspired A's more polished, production-ready package which incorporates EDA as one of several augmentation techniques.

nlpcda
56
Established
EDA_NLP_for_Chinese
42
Emerging
Maintenance 0/25
Adoption 11/25
Maturity 25/25
Community 20/25
Maintenance 0/25
Adoption 10/25
Maturity 8/25
Community 24/25
Stars: 1,878
Forks: 172
Downloads:
Commits (30d): 0
Language: Python
License: Apache-2.0
Stars: 1,385
Forks: 236
Downloads:
Commits (30d): 0
Language: Python
License:
Stale 6m
No License Stale 6m No Package No Dependents

About nlpcda

425776024/nlpcda

一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda

This tool helps people who work with Chinese text data to expand their datasets. You provide existing Chinese text, and it generates multiple variations of that text, carefully designed to retain the original meaning. This is useful for anyone training natural language processing (NLP) models, such as AI engineers or data scientists, who need more diverse training examples to improve model performance.

NLP-model-training Chinese-text-processing AI-data-preparation text-analytics machine-learning-engineering

About EDA_NLP_for_Chinese

zhanlaoban/EDA_NLP_for_Chinese

An implement of the paper of EDA for Chinese corpus.中文语料的EDA数据增强工具。NLP数据增强。论文阅读笔记。

This tool helps data scientists, NLP engineers, and machine learning practitioners improve the performance of their text classification models, especially with Chinese text. It takes a file of Chinese sentences, each with an associated label, and outputs a larger file of augmented sentences. This expanded dataset can then be used to train more robust and accurate classification models.

Chinese-NLP text-classification data-augmentation machine-learning-engineering natural-language-processing

Related comparisons

Scores updated daily from GitHub, PyPI, and npm data. How scores work