toriving/KoEDA

Korean Easy Data Augmentation

/ 100

Emerging

This tool helps data scientists and machine learning engineers working with Korean text improve their models by generating more training data. You input existing Korean sentences, and it outputs multiple varied versions of those sentences. This is especially useful for tasks like sentiment analysis, chatbots, or any application relying on natural language understanding in Korean.

No commits in the last 6 months. Available on PyPI.

Use this if you need to expand your Korean text dataset to train more robust natural language processing models.

Not ideal if you are working with languages other than Korean or if your task doesn't require text augmentation.

Korean-NLP text-augmentation machine-learning-data-prep data-science natural-language-processing

Stale 6m

Maintenance 0 / 25

Adoption 9 / 25

Maturity 25 / 25

Community 9 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

dsfsi/textaugment

TextAugment: Text Augmentation Library

425776024/nlpcda

一键中文数据增强包； NLP数据增强、bert数据增强、EDA：pip install nlpcda

google-research/uda

Unsupervised Data Augmentation (UDA)

searchableai/KitanaQA

KitanaQA: Adversarial training and data augmentation for neural question-answering models

SanghunYun/UDA_pytorch

UDA(Unsupervised Data Augmentation) implemented by pytorch

Explore NLP Tools

All categories Trending NLP directory Insights