MLGroup-JLU/LLM-data-aug-survey

The official GitHub page for the survey paper "A Survey on Data Augmentation in Large Model Era"

/ 100

Experimental

This resource provides a curated collection of research papers and materials focused on using large language models (LLMs) to enhance datasets for various machine learning tasks. It helps researchers and practitioners explore different data augmentation techniques, offering insights into methods for generating synthetic images, text, and paired data. The collection is ideal for machine learning researchers, data scientists, and AI practitioners looking to improve model performance by expanding their training data.

132 stars. No commits in the last 6 months.

Use this if you are a researcher or practitioner in AI/ML looking for a comprehensive overview and specific techniques for leveraging large models to augment your datasets for tasks in areas like natural language processing, computer vision, or audio processing.

Not ideal if you are looking for ready-to-use software or code implementations that directly perform data augmentation without requiring deep dives into research papers.

data-science machine-learning-research natural-language-processing computer-vision AI-model-training

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 8 / 25

Community 9 / 25

How are scores calculated?

Stars

132

Forks

Language

—

License

—

Higher-rated alternatives

rafska/awesome-local-llm

A curated list of awesome platforms, tools, practices and resources that helps run LLMs locally

KalyanKS-NLP/llm-engineer-toolkit

A curated list of 120+ LLM libraries category wise.

yzhao062/anomaly-detection-resources

Anomaly detection related books, papers, videos, and toolboxes. Last update late 2025 for LLM...

llm-jp/awesome-japanese-llm

日本語LLMまとめ - Overview of Japanese LLMs

InftyAI/Awesome-LLMOps

🎉 An awesome & curated list of best LLMOps tools.

Explore LLM Tools

All categories Trending LLM Tool directory Insights