rickiepark/llm-from-scratch

<밑바닥부터 만들면서 공부하는 LLM>(길벗, 2025)의 코드 저장소

/ 100

Established

This project provides the official code repository for the book "LLM From Scratch" (Gilbut, 2025). It helps you understand how large language models (LLMs) like GPT work internally by guiding you through coding them step-by-step from the ground up. You'll input explanations, diagrams, and examples, and output a functional, albeit small, LLM that you can pre-train and fine-tune. This is for anyone, particularly data scientists, machine learning engineers, or researchers, who wants a deep, hands-on understanding of LLM architecture and development.

Use this if you want to build and understand large language models from their fundamental components without relying on high-level libraries.

Not ideal if you need to use existing, large-scale LLMs for practical applications or if you are not comfortable with Python programming.

Large Language Models Machine Learning Engineering Natural Language Processing Deep Learning AI Development

No Package No Dependents

Maintenance 6 / 25

Adoption 9 / 25

Maturity 16 / 25

Community 24 / 25

How are scores calculated?

Stars

Forks

108

Language

Jupyter Notebook

License

Apache-2.0

Related models

AI-Hypercomputer/maxtext

A simple, performant and scalable Jax LLM!

rasbt/reasoning-from-scratch

Implement a reasoning LLM in PyTorch from scratch, step by step

mindspore-lab/mindnlp

MindSpore + 🤗Huggingface: Run any Transformers/Diffusers model on MindSpore with seamless...

mosaicml/llm-foundry

LLM training code for Databricks foundation models

CASE-Lab-UMD/LLM-Drop

The official implementation of the paper "Uncovering the Redundancy in Transformers via a...

Explore Transformer Models

All categories Trending Transformer directory Insights