MLGroup-JLU/LLM-data-aug-survey
The official GitHub page for the survey paper "A Survey on Data Augmentation in Large Model Era"
This resource provides a curated collection of research papers and materials focused on using large language models (LLMs) to enhance datasets for various machine learning tasks. It helps researchers and practitioners explore different data augmentation techniques, offering insights into methods for generating synthetic images, text, and paired data. The collection is ideal for machine learning researchers, data scientists, and AI practitioners looking to improve model performance by expanding their training data.
132 stars. No commits in the last 6 months.
Use this if you are a researcher or practitioner in AI/ML looking for a comprehensive overview and specific techniques for leveraging large models to augment your datasets for tasks in areas like natural language processing, computer vision, or audio processing.
Not ideal if you are looking for ready-to-use software or code implementations that directly perform data augmentation without requiring deep dives into research papers.
Stars
132
Forks
7
Language
—
License
—
Category
Last pushed
Jul 10, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/MLGroup-JLU/LLM-data-aug-survey"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
rafska/awesome-local-llm
A curated list of awesome platforms, tools, practices and resources that helps run LLMs locally
KalyanKS-NLP/llm-engineer-toolkit
A curated list of 120+ LLM libraries category wise.
yzhao062/anomaly-detection-resources
Anomaly detection related books, papers, videos, and toolboxes. Last update late 2025 for LLM...
llm-jp/awesome-japanese-llm
日本語LLMまとめ - Overview of Japanese LLMs
InftyAI/Awesome-LLMOps
🎉 An awesome & curated list of best LLMOps tools.