ai-art-dev99/llm-from-scratch
Build a Large Language Model From Scratch
This project offers a guided, hands-on path to building a large language model (LLM) from scratch. It takes raw text data and, through a series of steps, helps you create your own functional language model capable of tasks like text generation, classification, or instruction following. This is for anyone, like AI researchers or advanced students, who wants to deeply understand the mechanics behind modern AI language models.
No commits in the last 6 months.
Use this if you want to gain a comprehensive, step-by-step understanding of the internal workings of large language models by building one yourself.
Not ideal if you're looking for a pre-built tool or library to simply use an LLM without delving into its construction.
Stars
22
Forks
7
Language
Jupyter Notebook
License
—
Category
Last pushed
Oct 08, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/ai-art-dev99/llm-from-scratch"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
rasbt/LLMs-from-scratch
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
facebookresearch/LayerSkip
Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024
FareedKhan-dev/train-llm-from-scratch
A straightforward method for training your LLM, from downloading data to generating text.
kmeng01/rome
Locating and editing factual associations in GPT (NeurIPS 2022)
datawhalechina/llms-from-scratch-cn
仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理