JohnMachado11/Build-a-Large-Language-Model-from-Scratch

Building a GPT-like LLM from scratch with PyTorch.

/ 100

Emerging

This project helps machine learning engineers and researchers understand the core mechanics of large language models (LLMs) by guiding them through building one from the ground up. You'll input raw text data and configuration parameters, and the output will be a functional, albeit small, language model. This is for professionals who want to deeply grasp how models like ChatGPT are constructed.

337 stars. No commits in the last 6 months.

Use this if you are a machine learning practitioner who wants to learn the internal workings of large language models by actively building one.

Not ideal if you are looking for a pre-built, production-ready LLM to use out-of-the-box for applications, or if you don't have a background in machine learning and programming.

deep-learning natural-language-processing model-architecture machine-learning-education neural-networks

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 8 / 25

Community 23 / 25

How are scores calculated?

Stars

337

Forks

Language

Python

License

—

Higher-rated alternatives

AI-Hypercomputer/maxtext

A simple, performant and scalable Jax LLM!

rasbt/reasoning-from-scratch

Implement a reasoning LLM in PyTorch from scratch, step by step

mindspore-lab/mindnlp

MindSpore + 🤗Huggingface: Run any Transformers/Diffusers model on MindSpore with seamless...

mosaicml/llm-foundry

LLM training code for Databricks foundation models

rickiepark/llm-from-scratch

<밑바닥부터 만들면서 공부하는 LLM>(길벗, 2025)의 코드 저장소

Explore Transformer Models

All categories Trending Transformer directory Insights