styfeng/DataAug4NLP

Collection of papers and resources for data augmentation for NLP.

/ 100

Emerging

This is a curated collection of research papers and resources focused on improving natural language processing (NLP) models. It helps researchers and practitioners find effective strategies to boost model performance when training data is limited. The collection provides specific techniques, categorized by NLP tasks like text classification or machine translation, to generate more diverse training examples from existing datasets.

831 stars. No commits in the last 6 months.

Use this if you are an NLP researcher or practitioner looking for proven data augmentation techniques to improve the performance of your text classification, translation, summarization, or other NLP models, especially when you have small datasets.

Not ideal if you are looking for ready-to-use software libraries or code implementations without wanting to explore the underlying research papers.

Natural Language Processing Machine Learning Research Data Science Text Analytics Model Training

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 8 / 25

Community 18 / 25

How are scores calculated?

Stars

831

Forks

Language

—

License

—

Higher-rated alternatives

varunkumar-dev/TransformersDataAugmentation

Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper

Akshint0407/Automated-Answer-Checker

AI-powered grading system for educators 🔹 Streamlit web app that automates answer sheet...

Anjum48/commonlitreadabilityprize

4th Place solution for the Kaggle CommonLit Readability Prize

yuchen0515/2022-Competition-CUDAOutOfMemory

Our team placed 6th out of 119 teams in E.SUN AI Open Competition Summer 2022 - ASR...

kozodoi/Text_Readability_Prediction

Predicting text reading complexity with transformers (top-9% Kaggle solution & interactive web app)

Explore Transformer Models

All categories Trending Transformer directory Insights