Llm Implementation Tutorials Transformer Models
There are 111 llm implementation tutorials models tracked. 1 score above 70 (verified tier). The highest-rated is AI-Hypercomputer/maxtext at 82/100 with 2,169 stars. 3 of the top 10 are actively maintained.
Get all 111 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=llm-implementation-tutorials&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Model | Score | Tier |
|---|---|---|---|
| 1 |
AI-Hypercomputer/maxtext
A simple, performant and scalable Jax LLM! |
|
Verified |
| 2 |
rasbt/reasoning-from-scratch
Implement a reasoning LLM in PyTorch from scratch, step by step |
|
Established |
| 3 |
mindspore-lab/mindnlp
MindSpore + 🤗Huggingface: Run any Transformers/Diffusers model on MindSpore... |
|
Established |
| 4 |
mosaicml/llm-foundry
LLM training code for Databricks foundation models |
|
Established |
| 5 |
rickiepark/llm-from-scratch
<밑바닥부터 만들면서 공부하는 LLM>(길벗, 2025)의 코드 저장소 |
|
Established |
| 6 |
CASE-Lab-UMD/LLM-Drop
The official implementation of the paper "Uncovering the Redundancy in... |
|
Established |
| 7 |
rllm-team/rllm
Pytorch Library for Relational Table Learning with LLMs. |
|
Established |
| 8 |
ridgerchu/matmulfreellm
Implementation for MatMul-free LM. |
|
Established |
| 9 |
FareedKhan-dev/train-llama4
Building LLaMA 4 MoE from Scratch |
|
Emerging |
| 10 |
joyehuang/minimind-notes
🚀 [从零构建 LLM] 极简大模型训练原理与实践指南。包含 Transformer, Pretraining, SFT 核心代码与对照实验。 | A... |
|
Emerging |
| 11 |
donaldafeith/Pytorch_Merge
Merge LLM that are split in to parts |
|
Emerging |
| 12 |
xinzhanguo/hellollm
pre train a new llm |
|
Emerging |
| 13 |
Tongjilibo/build_MiniLLM_from_scratch
从0到1构建一个MiniLLM (pretrain+sft+dpo实践中) |
|
Emerging |
| 14 |
AviSoori1x/seemore
From scratch implementation of a vision language model in pure PyTorch |
|
Emerging |
| 15 |
hitz-zentroa/whisper-lm-transformers
Add n-gram and LLM language model support to HF Transformers Whisper models. |
|
Emerging |
| 16 |
shivendrra/SmallLanguageModel
a LLM cookbook, for building your own from scratch, all the way from... |
|
Emerging |
| 17 |
JohnMachado11/Build-a-Large-Language-Model-from-Scratch
Building a GPT-like LLM from scratch with PyTorch. |
|
Emerging |
| 18 |
fangpin/llm-from-scratch
Build LLM from scratch |
|
Emerging |
| 19 |
ChaitanyaK77/Building-a-Small-Language-Model-SLM-
This Repository provides a Jupyter Notebook for building a small language... |
|
Emerging |
| 20 |
ronniross/attention-heatmap-visualizer
A set of scripts to generate full attention-head heatmaps for transformer-based LLMs |
|
Emerging |
| 21 |
NVIDIA/logits-processor-zoo
A collection of LogitsProcessors to customize and enhance LLM behavior for... |
|
Emerging |
| 22 |
zeyadusf/LLMs-from-Scratch
Build a Large Language Model (From Scratch) book and Finetuned Models |
|
Emerging |
| 23 |
SPUTNIKAI/LeechTransformer
Leech-Lila: A Geometric Attention Transformer(Language Model) with the Leech... |
|
Emerging |
| 24 |
rasbt/pytorch-memory-optim
This code repository contains the code used for my "Optimizing Memory Usage... |
|
Emerging |
| 25 |
ai-glimpse/toyllm
ToyLLM: Learning LLM from Scratch |
|
Emerging |
| 26 |
OpenNLPLab/TransnormerLLM
Official implementation of TransNormerLLM: A Faster and Better LLM |
|
Emerging |
| 27 |
Yigtwxx/PredictaLM
PredictaLM is a lightweight Turkish language model designed for next-word... |
|
Emerging |
| 28 |
GeeeekExplorer/transformers-patch
patches for huggingface transformers to save memory |
|
Emerging |
| 29 |
viralcode/superGPT
Train your own LLM from scratch |
|
Emerging |
| 30 |
hesamsheikh/llm-mechanics
Coding an LLM and its building blocks from scratch. |
|
Emerging |
| 31 |
JoelDeonDsouza/Zenpool_LLM
Zenpool is a compact, fine-tuned MLL (Mini Language Learner) model |
|
Emerging |
| 32 |
GURPREETKAURJETHRA/Multi-GPU-Fine-Training-LLMs
Multi GPU Fine Training LLMs using DeepSpeed and Accelerate. |
|
Emerging |
| 33 |
waltonfuture/InstructionGPT-4
InstructionGPT-4 |
|
Emerging |
| 34 |
Arezkiiiii/mini_llm
🚀 Build and understand a Large Language Model from scratch using PyTorch... |
|
Emerging |
| 35 |
microsoft/encoder-decoder-slm
Efficient encoder-decoder architecture for small language models (≤1B... |
|
Emerging |
| 36 |
ksm26/Pretraining-LLMs
Master the essential steps of pretraining large language models (LLMs).... |
|
Emerging |
| 37 |
myscience/x-lstm
Pytorch implementation of the xLSTM model by Beck et al. (2024) |
|
Emerging |
| 38 |
Utshav-paudel/LLM-Zero-to-Hero
This repo contains the resources, projects and documentation of mine while... |
|
Emerging |
| 39 |
muna-ai/muna-predictors
Interesting Python functions compiled to run anywhere with Muna. |
|
Emerging |
| 40 |
Nikshaan/llm-from-scratch
Implementation of build a LLM from scratch by Sebastian Raschka. |
|
Emerging |
| 41 |
OpenVanguard/remma-o1
Remma-O1: An open-source Language Model with 1.17B Params, built on pytorch... |
|
Emerging |
| 42 |
kmkrofficial/LiteGPT
LiteGPT: A 124M Small Language Model (SLM) pre-trained on FineWeb and... |
|
Emerging |
| 43 |
Ajax0564/VyomAI
VyomAI: state-of-the-art NLP LLM Vision MultiModel transformers ... |
|
Emerging |
| 44 |
Shinichi0713/LLM-fundamental-study
this site is the fundamental page of LLM-mechanism |
|
Experimental |
| 45 |
ranpy13/Learning-LLM
Learning to build LLM from scratch, following rasbt/LLMs-from-scratch footsteps. |
|
Experimental |
| 46 |
KillerShoaib/RLM-From-Scratch
Implementation of Recursive Language Model paper from scratch |
|
Experimental |
| 47 |
harshtiwari01/llm-heatmap-visualizer
A set of scripts to generate full attention-head heatmaps for transformer-based LLMs |
|
Experimental |
| 48 |
AnkitaMungalpara/Building-LLM-From-Scratch
This repository provides a step-by-step guide to creating your own large... |
|
Experimental |
| 49 |
Michael-Jackson666/Zero2Hero-AI
From first principles to state-of-the-art: A hands-on journey implementing... |
|
Experimental |
| 50 |
feifeibear/Odysseus-Transformer
Odysseus: Playground of LLM Sequence Parallelism |
|
Experimental |
| 51 |
SreeEswaran/Train-your-LLM
This repository contains code and resources for training, fine-tuning, and... |
|
Experimental |
| 52 |
MDalamin5/Build-and-Finetune-LLM-From-Scratch-Deploy-via-vLLM-AWS-GCP
A complete end-to-end learning repo covering everything from building Large... |
|
Experimental |
| 53 |
mtkaya/transformer-edge-optimization
Optimize Transformer models for edge devices |
|
Experimental |
| 54 |
simply-pouria/The-LMs-Book
My study notes, code implementations, etc. while reading The Hundred-Page... |
|
Experimental |
| 55 |
MaharshPatelX/multi-model-conda-environment
This GitHub repository provides a comprehensive guide on setting up a Conda... |
|
Experimental |
| 56 |
aryagxr/llm-from-scratch
Implementation of LLM ✨from scratch✨ |
|
Experimental |
| 57 |
musa-kal/smoll-language-model
Educational language model implementation from scratch, featuring Bigram and... |
|
Experimental |
| 58 |
adorow/LlmFromScratch
Notes and code from reading "Build a Large Language Model (From Scratch)" |
|
Experimental |
| 59 |
Tek233/Build_a_Large_Language_Model_From_Scratch
Testing and experimenting with code from Sebastian Raschka’s book Build a... |
|
Experimental |
| 60 |
mcbieda/llm-from-scratch
1. A simple implementation of the LLM gpt-2 "from scratch" that allows easy... |
|
Experimental |
| 61 |
Matonice/LLM_From_Scratch
Implementing all the parts and architecture of a large language model from scratch |
|
Experimental |
| 62 |
10-OASIS-01/Autoregressive-Language-Model
This project is a comprehensive implementation of a Transformer-based... |
|
Experimental |
| 63 |
filipsedivy/SLiM-CZ-V1
🇨🇿 Slavic Linguistic integrated Micro-model for Czechia |
|
Experimental |
| 64 |
AIDajiangtang/LLM-from-scratch
从零开始学大模型Transformer、GPT2、BERT pre-training and fine-tuning from scratch |
|
Experimental |
| 65 |
villagecomputing/superpipe
Superpipe - optimized LLM pipelines for structured data |
|
Experimental |
| 66 |
CS-433/ml-project-2-mlp
Advancing Homepage2Vec with LLM-Generated Datasets for Multilingual Website... |
|
Experimental |
| 67 |
JerryPan2718/flexgpt
Tradeoff between runtime and RAM usage for large language model inference. |
|
Experimental |
| 68 |
fahadsid1770/Aesop-LLM
custom Bengali Language Model built from scratch using PyTorch, featuring... |
|
Experimental |
| 69 |
prasanna00019/Small-Language-Models
A collection of Small Language Models (SLMs) built from scratch in PyTorch. |
|
Experimental |
| 70 |
ictup/Build-a-LLM-model-from-scratch-simple
LLM pipeline: data→tokenizer→attention→GPT train/eval→instruction... |
|
Experimental |
| 71 |
sanyalsunny111/Early_Weight_Avg
[COLM 2024] Early Weight Averaging meets High Learning Rates for LLM Pre-training |
|
Experimental |
| 72 |
CastorYu/train-hybrid-llm-from-scratch
A simplistic script for training your own hybrid llm (using autoregressive... |
|
Experimental |
| 73 |
daniel-mehta/CooperLM-354M
A 354M parameter GPT-2 model trained on filtered Wikipedia, BookCorpus, and... |
|
Experimental |
| 74 |
Ki-Seki/Awesome-Transformer-Visualization
Explore visualization tools for understanding Transformer-based large... |
|
Experimental |
| 75 |
ashrithssreddy/llm-from-scratch
Building GPTs from the ground up. A hands-on journey through attention... |
|
Experimental |
| 76 |
MAlshurbaji/LLM-Playground-From-Scratch
A hands-on educational repository demonstrating how to build a miniature LLM... |
|
Experimental |
| 77 |
T-Mohamed-Shafeek/LLM-for-Language-Translation
This repository contains a simple and beginner-level noteboook which employs... |
|
Experimental |
| 78 |
jbergq/transformer-language-modeling
Notebook showing how to implement and train a transformer for language modeling |
|
Experimental |
| 79 |
gusye1234/gpt-adapter
Collections of algorithms for efficiently fine-tuning LLM |
|
Experimental |
| 80 |
sasa5linkar/Spacy-Serbian-Transformer
Using Jertehs corpuses to train spacy with tranformer and LLM |
|
Experimental |
| 81 |
FreezB11/PsyDuck
a 60M parameter LLM from scratch |
|
Experimental |
| 82 |
ajheshbasnet/model-reincarnated
a collection of re-implementations of renowned artificial intelligence... |
|
Experimental |
| 83 |
nachammai779/Large-Language-Models-from-Scratch
🧠 Building Large Language Models from the ground up — BPE tokenization,... |
|
Experimental |
| 84 |
Adwerse/Mini_LLM
🧠 Transformer built from scratch — RoPE, SwiGLU, KV-Cache, Flash Attention.... |
|
Experimental |
| 85 |
rudyon/pipeline
Training pipeline for LLMs in PyTorch. |
|
Experimental |
| 86 |
itxmjr/LLM-From-Scratch
A step-by-step Guide: Build a GPT-like LLM From Scratch using PyTorch |
|
Experimental |
| 87 |
Daegybyte/LLM_Training_Pipeline
Modular text-generation training pipeline using PyTorch, Hugging Face, and... |
|
Experimental |
| 88 |
Samya-S/Building-LLMs-from-scratch
A hands-on guide to implementing Large Language Models from scratch |
|
Experimental |
| 89 |
falseywinchnet/ConvexGPT
Mostly Convex ICNN based Large Language Model |
|
Experimental |
| 90 |
sugam24/LLM-From-Scratch
Building a Large Language Model from the ground up. |
|
Experimental |
| 91 |
AndrewKM210/micro-llm
A micro-LLM implementation that should be able to train and run on a standard laptop |
|
Experimental |
| 92 |
hichamcharhi/llm-from-scratch
Implémentez un LLM from scratch en Python — cours pédagogique en... |
|
Experimental |
| 93 |
insooeric/LLM_Small
Long Language Model from SCRATCH |
|
Experimental |
| 94 |
patrikwolf/ttt_theory
Specialization after Generalization |
|
Experimental |
| 95 |
capecoder08/llm-playground
Playing with tokenizers, transformers, and LLMs |
|
Experimental |
| 96 |
storytellingengineer/Learning_LLM_using_Pytorch
Hi, this repo is my learning to build LLMs from scratch | Foundational... |
|
Experimental |
| 97 |
Jkanishkha0305/LLMs-from-Scratch
A curated collection of Large Language Models(LLMs), Small Language... |
|
Experimental |
| 98 |
gkswamy98/sequence_model_il
Contains sequence-model implementations of on and off-policy imitation... |
|
Experimental |
| 99 |
omkarbg100/LLM-GPT
This is a LLM Model with full Architecture Coded ans is based On GPT-2 Weights |
|
Experimental |
| 100 |
juanprida/language-model-from-scratch
We train various transformers to compose music like L.A.S. |
|
Experimental |
| 101 |
shankezh/LLMScratchs
This repository contains a comprehensive implementation of the full... |
|
Experimental |
| 102 |
Imran-ml/GEFS-language-detector
GEFS Language Detection model deployed at Hugging Face |
|
Experimental |
| 103 |
Ahmadjamil888/CUSTOM-LLM-MODEL-DL
A advanced deep learning AI model meant to be trained on your desired... |
|
Experimental |
| 104 |
teatonedev/Llama3.2-From-Scratch
This repository serves as an educational resource for building an... |
|
Experimental |
| 105 |
0xnu/tiny_llm_trainer
The experiment implements a tiny language model trainer using PyTorch. |
|
Experimental |
| 106 |
jeffrey-fong/llms-from-scratch
A collection of language model architectures implemented from scratch on PyTorch |
|
Experimental |
| 107 |
GiuseppeBellamacina/Little_Language_Model
Implementation of a Transformer and training on Dante's Divina Commedia |
|
Experimental |
| 108 |
KarthikArumugam3/Pretraining_LLM
A demo of how to use Transformer models for pretraining LLMs |
|
Experimental |
| 109 |
myscience/jamba
Pytorch (Lightning) Implementation of the Jamba Language Model |
|
Experimental |
| 110 |
sabamadadi/PersianWiki-LLM-FromScratch
Implementation of a decoder-only transformer for Persian Wikipedia text... |
|
Experimental |
| 111 |
SyedAanif/build-llm
Learn about the entire process of building a Large Language Model (LLM) from scratch |
|
Experimental |