styfeng/DataAug4NLP

Collection of papers and resources for data augmentation for NLP.

36
/ 100
Emerging

This is a curated collection of research papers and resources focused on improving natural language processing (NLP) models. It helps researchers and practitioners find effective strategies to boost model performance when training data is limited. The collection provides specific techniques, categorized by NLP tasks like text classification or machine translation, to generate more diverse training examples from existing datasets.

831 stars. No commits in the last 6 months.

Use this if you are an NLP researcher or practitioner looking for proven data augmentation techniques to improve the performance of your text classification, translation, summarization, or other NLP models, especially when you have small datasets.

Not ideal if you are looking for ready-to-use software libraries or code implementations without wanting to explore the underlying research papers.

Natural Language Processing Machine Learning Research Data Science Text Analytics Model Training
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 8 / 25
Community 18 / 25

How are scores calculated?

Stars

831

Forks

76

Language

License

Last pushed

Aug 12, 2022

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/transformers/styfeng/DataAug4NLP"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.