nlpcda and EDA_NLP_for_Chinese
These are ecosystem siblings—B is an implementation of the EDA (Easy Data Augmentation) paper for Chinese that inspired A's more polished, production-ready package which incorporates EDA as one of several augmentation techniques.
About nlpcda
425776024/nlpcda
一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda
This tool helps people who work with Chinese text data to expand their datasets. You provide existing Chinese text, and it generates multiple variations of that text, carefully designed to retain the original meaning. This is useful for anyone training natural language processing (NLP) models, such as AI engineers or data scientists, who need more diverse training examples to improve model performance.
About EDA_NLP_for_Chinese
zhanlaoban/EDA_NLP_for_Chinese
An implement of the paper of EDA for Chinese corpus.中文语料的EDA数据增强工具。NLP数据增强。论文阅读笔记。
This tool helps data scientists, NLP engineers, and machine learning practitioners improve the performance of their text classification models, especially with Chinese text. It takes a file of Chinese sentences, each with an associated label, and outputs a larger file of augmented sentences. This expanded dataset can then be used to train more robust and accurate classification models.
Related comparisons
Scores updated daily from GitHub, PyPI, and npm data. How scores work