basicv8vc/chinese-instruction-datasets-for-llms
用于微调LLM的中文指令数据集
This project helps AI developers and researchers fine-tune large language models (LLMs) to perform specific tasks using Chinese language instructions. It provides a collection of ready-to-use Chinese instruction datasets as input, enabling the creation of specialized Chinese LLMs. Anyone working with LLMs who needs to train models for Chinese language applications would find this useful.
No commits in the last 6 months.
Use this if you need high-quality, pre-compiled Chinese instruction datasets to fine-tune your own large language models for better performance on specific tasks.
Not ideal if you are looking for English instruction datasets or a tool to generate new instruction data from scratch.
Stars
29
Forks
1
Language
—
License
Apache-2.0
Category
Last pushed
Apr 12, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/basicv8vc/chinese-instruction-datasets-for-llms"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
MantisAI/sieves
Plug-and-play document AI with zero-shot models.
xiaoya-li/Instruction-Tuning-Survey
Project for the paper entitled `Instruction Tuning for Large Language Models: A Survey`
rafaelpierre/bullet
bullet: A Zero-Shot / Few-Shot Learning, LLM Based, text classification framework
TencentARC-QQ/TagGPT
TagGPT: Large Language Models are Zero-shot Multimodal Taggers
amazon-science/adaptive-in-context-learning
AdaICL: Which Examples to Annotate of In-Context Learning? Towards Effective and Efficient Selection