Zjh-819/LLMDataHub
A quick guide (especially) for trending instruction finetuning datasets
This resource helps individuals and small organizations develop custom AI chatbots by providing a comprehensive, curated list of high-quality datasets. It takes a problem you want your chatbot to solve and helps you find suitable training data, outputting links, sizes, languages, and descriptions of various datasets. This is ideal for AI researchers, data scientists, or anyone looking to train or fine-tune their own large language models.
3,369 stars. No commits in the last 6 months.
Use this if you are developing or fine-tuning a large language model and need to quickly find and compare suitable training datasets, especially for chatbot instruction.
Not ideal if you are looking for ready-to-use large language models or need a tool for model deployment and inference.
Stars
3,369
Forks
233
Language
—
License
MIT
Category
Last pushed
Nov 28, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/Zjh-819/LLMDataHub"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
AI-Planning/l2p
Library for LLM-driven action model acquisition via natural language
datawhalechina/self-llm
《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程
microsoft/LMOps
General technology for enabling AI capabilities w/ LLMs and MLLMs
theaniketgiri/create-llm
The fastest way to build and start training your own LLM. CLI tool that scaffolds...
liguodongiot/llm-action
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)