toriving/KoEDA
Korean Easy Data Augmentation
This tool helps data scientists and machine learning engineers working with Korean text improve their models by generating more training data. You input existing Korean sentences, and it outputs multiple varied versions of those sentences. This is especially useful for tasks like sentiment analysis, chatbots, or any application relying on natural language understanding in Korean.
No commits in the last 6 months. Available on PyPI.
Use this if you need to expand your Korean text dataset to train more robust natural language processing models.
Not ideal if you are working with languages other than Korean or if your task doesn't require text augmentation.
Stars
91
Forks
6
Language
Python
License
MIT
Category
Last pushed
Sep 30, 2021
Commits (30d)
0
Dependencies
3
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/toriving/KoEDA"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
dsfsi/textaugment
TextAugment: Text Augmentation Library
425776024/nlpcda
一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda
google-research/uda
Unsupervised Data Augmentation (UDA)
searchableai/KitanaQA
KitanaQA: Adversarial training and data augmentation for neural question-answering models
SanghunYun/UDA_pytorch
UDA(Unsupervised Data Augmentation) implemented by pytorch