JohnMachado11/Build-a-Large-Language-Model-from-Scratch

Building a GPT-like LLM from scratch with PyTorch.

41
/ 100
Emerging

This project helps machine learning engineers and researchers understand the core mechanics of large language models (LLMs) by guiding them through building one from the ground up. You'll input raw text data and configuration parameters, and the output will be a functional, albeit small, language model. This is for professionals who want to deeply grasp how models like ChatGPT are constructed.

337 stars. No commits in the last 6 months.

Use this if you are a machine learning practitioner who wants to learn the internal workings of large language models by actively building one.

Not ideal if you are looking for a pre-built, production-ready LLM to use out-of-the-box for applications, or if you don't have a background in machine learning and programming.

deep-learning natural-language-processing model-architecture machine-learning-education neural-networks
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 8 / 25
Community 23 / 25

How are scores calculated?

Stars

337

Forks

83

Language

Python

License

Last pushed

Dec 20, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/transformers/JohnMachado11/Build-a-Large-Language-Model-from-Scratch"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.