OceanPresentChao/llm-corpus

从零搭建大模型知识库(Build LLM RAG Corpus from scratch)

29
/ 100
Experimental

This project helps you build a custom knowledge base for large language models (LLMs) from scratch. It takes your Chinese text documents, processes them, converts them into a format that LLMs can understand, and stores them in a searchable database. The output is a functional chatbot that can answer questions using the information in your specific documents. This is for developers, researchers, or data scientists looking to create tailored LLM applications for specific domains.

No commits in the last 6 months.

Use this if you need to create a specialized chatbot or question-answering system that uses your own collection of documents, rather than general internet knowledge.

Not ideal if you are a non-technical user looking for a ready-to-use application without any coding or model setup.

knowledge-base-creation LLM-customization chatbot-development information-retrieval NLP-engineering
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 9 / 25
Maturity 8 / 25
Community 12 / 25

How are scores calculated?

Stars

86

Forks

9

Language

Python

License

Category

local-rag-stacks

Last pushed

Oct 23, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/vector-db/OceanPresentChao/llm-corpus"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.