hesamsheikh/llm-mechanics

Coding an LLM and its building blocks from scratch.

/ 100

Emerging

This project is a personal learning environment for those who want to understand the foundational mechanics of large language models (LLMs). It helps you learn how LLMs and their core components are built from the ground up using PyTorch. This is for machine learning engineers, researchers, or students who are curious about the inner workings of AI language models.

116 stars. No commits in the last 6 months.

Use this if you are a machine learning practitioner who wants to deeply understand the theoretical and practical implementation details of LLMs by building them from first principles.

Not ideal if you are looking to apply pre-built LLMs for specific tasks like content generation, data summarization, or chatbot development.

Machine Learning Education Deep Learning Fundamentals AI Model Development Natural Language Processing Research

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 8 / 25

Community 16 / 25

How are scores calculated?

Stars

116

Forks

Language

Jupyter Notebook

License

—

Higher-rated alternatives

AI-Hypercomputer/maxtext

A simple, performant and scalable Jax LLM!

rasbt/reasoning-from-scratch

Implement a reasoning LLM in PyTorch from scratch, step by step

mindspore-lab/mindnlp

MindSpore + 🤗Huggingface: Run any Transformers/Diffusers model on MindSpore with seamless...

mosaicml/llm-foundry

LLM training code for Databricks foundation models

rickiepark/llm-from-scratch

<밑바닥부터 만들면서 공부하는 LLM>(길벗, 2025)의 코드 저장소

Explore Transformer Models

All categories Trending Transformer directory Insights