huangjia2019/llm-gpt

From classic NLP to modern LLMs: building language models step by step. 异步图书：《 GPT图解大模型是怎样构建的》- 这套代码是AI Coder出现之前，自己用纯手工搭建的一套简单有效的NLP经典算法集合。在大语言模型推动的AI Coder兴起之后，很少有机会再创作这么有“手工风”的代码了，不知道这是值得开心还是值得遗憾的事情。

/ 100

Emerging

This project helps you understand and build language models from the ground up, moving from traditional natural language processing (NLP) to modern large language models (LLMs). You'll start with fundamental NLP concepts and progress to constructing sophisticated language models. This is ideal for students, researchers, or anyone in the AI field looking to deeply grasp the underlying mechanics of how these models work.

194 stars. No commits in the last 6 months.

Use this if you want to learn the step-by-step process of building language models and understand the core principles behind natural language processing and large language models.

Not ideal if you are looking for a ready-to-use, high-performance language model for immediate application or if you prefer to use existing AI coding tools.

natural-language-processing large-language-models machine-learning-education AI-research computational-linguistics

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 8 / 25

Community 22 / 25

How are scores calculated?

Stars

194

Forks

Language

Jupyter Notebook

License

—

Compare

llm-gpt and litgpt

Higher-rated alternatives

Lightning-AI/litgpt

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

liangyuwang/Tiny-DeepSpeed

Tiny-DeepSpeed, a minimalistic re-implementation of the DeepSpeed library

catherinesyeh/attention-viz

Visualizing query-key interactions in language + vision transformers (VIS 2023)

microsoft/Text2Grad

🚀 Text2Grad: Converting natural language feedback into gradient signals for precise model...

FareedKhan-dev/Building-llama3-from-scratch

LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's...

Explore LLM Tools

All categories Trending LLM Tool directory Insights