weAIDB/awesome-data-llm

Official Repository of "LLM × DATA" Survey Paper

52
/ 100
Established

This resource provides a comprehensive overview of how Large Language Models (LLMs) interact with data across various stages, from initial preparation to analysis and system optimization. It consolidates research papers and projects into a structured collection, offering insights into data characteristics, processing, storage, and serving for LLMs. Data scientists, machine learning engineers, and researchers working with LLMs will find this a valuable guide.

740 stars. Actively maintained with 8 commits in the last 30 days.

Use this if you are developing or working with Large Language Models and need to understand best practices and emerging trends in data handling, quality, and preparation for optimal model performance.

Not ideal if you are looking for a practical tool or library for direct use, as this is primarily a survey and collection of research papers.

Large Language Models Data Science Research Machine Learning Engineering Data Management AI Development
No License No Package No Dependents
Maintenance 17 / 25
Adoption 10 / 25
Maturity 8 / 25
Community 17 / 25

How are scores calculated?

Stars

740

Forks

66

Language

License

Last pushed

Mar 05, 2026

Commits (30d)

8

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/weAIDB/awesome-data-llm"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.