LLM Implementation Tutorials LLM Tools
Educational repositories and step-by-step guides for building LLMs from scratch using frameworks like PyTorch and Transformers. Includes basic implementations, minimal examples, and learning-focused projects. Does NOT include production LLM frameworks, pre-trained model fine-tuning, or inference optimization tools.
There are 49 llm implementation tutorials tools tracked. 1 score above 70 (verified tier). The highest-rated is Lightning-AI/litgpt at 72/100 with 13,225 stars. 1 of the top 10 are actively maintained.
Get all 49 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=llm-tools&subcategory=llm-implementation-tutorials&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Tool | Score | Tier |
|---|---|---|---|
| 1 |
Lightning-AI/litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale. |
|
Verified |
| 2 |
liangyuwang/Tiny-DeepSpeed
Tiny-DeepSpeed, a minimalistic re-implementation of the DeepSpeed library |
|
Emerging |
| 3 |
catherinesyeh/attention-viz
Visualizing query-key interactions in language + vision transformers (VIS 2023) |
|
Emerging |
| 4 |
microsoft/Text2Grad
🚀 Text2Grad: Converting natural language feedback into gradient signals for... |
|
Emerging |
| 5 |
FareedKhan-dev/Building-llama3-from-scratch
LLaMA 3 is one of the most promising open-source model after Mistral, we... |
|
Emerging |
| 6 |
huangjia2019/llm-gpt
From classic NLP to modern LLMs: building language models step by step.... |
|
Emerging |
| 7 |
sondotpin/llm-scratch
ChatGPT-like LLM in PyTorch from scratch, step by step |
|
Emerging |
| 8 |
FareedKhan-dev/train-tiny-llm
Train a 29M parameter GPT from Scratch |
|
Emerging |
| 9 |
CoinCheung/gdGPT
Train llm (bloom, llama, baichuan2-7b, chatglm3-6b) with deepspeed pipeline... |
|
Emerging |
| 10 |
better-py/learn-py
Test code was written for research and verification of some Python libraries. |
|
Emerging |
| 11 |
loke-x/jam-gpt
An Experimental Reimplementation of LLM models for research and development process |
|
Emerging |
| 12 |
RWKV-Wiki/MultilingualShareGPT
MultilingualShareGPT, the free multi-language corpus for LLM training |
|
Emerging |
| 13 |
MuzzammilShah/Road-to-AI
A structured documentation hub for AI and ML concepts, based on Andrej... |
|
Emerging |
| 14 |
wozeparrot/tinyrwkv
tinygrad port of the RWKV large language model. |
|
Emerging |
| 15 |
logic-OT/Decoder-Only-LLM
This repository features a custom-built decoder-only language model (LLM)... |
|
Experimental |
| 16 |
liangyuwang/train-large-model-from-scratch
A minimal, hackable pre-training stack for GPT-style language models |
|
Experimental |
| 17 |
MistyozAI/CosmicFish
CosmicFish is a fully original, efficient transformer-based language model... |
|
Experimental |
| 18 |
ag-sc/CRAM-Generation-LLM
Using ChatGPT & GPT-4 to generate CRAM & PyCRAM designators in a one-shot fashion |
|
Experimental |
| 19 |
kreasof-ai/Homunculus-Project
Long term project about a custom AI architecture. Consist of cutting-edge... |
|
Experimental |
| 20 |
IAAR-Shanghai/Awesome-Attention-Heads
An awesome repository & A comprehensive survey on interpretability of LLM... |
|
Experimental |
| 21 |
max1ab/smol-training-playbook-zh
Hugging Face 文章《The Smol Training Playbook》中文翻译 |
|
Experimental |
| 22 |
wln20/Attention-Viewer
A plug-and-play tool for visualizing attention-score heatmap in generative... |
|
Experimental |
| 23 |
sheikhcoders/SheikhLM
Blazingly fast and remarkably powerful. |
|
Experimental |
| 24 |
benthecoder/AI
learning AI from scratch |
|
Experimental |
| 25 |
njmarko/llm-gpt-sort
A new type of sorting algorithm. Use large language model (llm like gpt,... |
|
Experimental |
| 26 |
hulkiciray/llm_from_scratch
Comprehensive fresh start for how LLMs work |
|
Experimental |
| 27 |
snnclsr/chatgpt-from-scratch
A full-stack ChatGPT-like application built (almost) from scratch |
|
Experimental |
| 28 |
Kitsunp/Small-lenguaje-Model-Hybrid-Norm-Furier-Formers
A compact language model implementing HybridNorm and Fourier-based... |
|
Experimental |
| 29 |
moorebrett0/microformer
A minimal, dual-memory transformer language model built from scratch in... |
|
Experimental |
| 30 |
FranxYao/Language-Model-Pretraining-for-Text-Generation
LM pretraining for generation, reading list, resources, conference mappings. |
|
Experimental |
| 31 |
bobo0810/MiniGPT-4-DeepSpeed
MiniGPT-4基于DeepSpeed加速➕ 扩充模型规模 ➕ 实验分析 |
|
Experimental |
| 32 |
UEFI-code/PyTorch_For_PoorGuys
This is a special PyTorch For Poor Guys Who can't afford big GPU |
|
Experimental |
| 33 |
anafisa/Text2Text-Transformer
Large Multi-Language Models for News Translation |
|
Experimental |
| 34 |
pprp/smol_training_zh
《Smol 训练手册》:打造世界级大模型的秘诀 |
|
Experimental |
| 35 |
ikun-llm/ikun-2.5B
练习时长两年半的 AI 大模型 (实际 26M params,2.5B = 两年半) | ikun meme-culture chatbot 🐔🏀 |
|
Experimental |
| 36 |
llami-team/gpt-torch
Compress the HTML as much as possible for LLM to inference. |
|
Experimental |
| 37 |
ariannamethod/RRPRAM
RRPRAM: Recurcsive Resonant Pattern Recognition Attention Mechanism by Arianna Method |
|
Experimental |
| 38 |
RajdeepKushwaha5/LLMs-from-scratch
This repository contains the code for developing, pretraining, and... |
|
Experimental |
| 39 |
feljost/fgpt
Building a conversational LLM from scratch |
|
Experimental |
| 40 |
ikun-llm/ikun-pretrain
从零预训练语言模型 | Pretrain LLM from scratch 📚 |
|
Experimental |
| 41 |
Antoinechss/LLM-from-scratch
Building a LLM (GPT-3 like architecture) from scratch in Python and training... |
|
Experimental |
| 42 |
Bhardwaj-Saurabh/Build_A_Small_Language_Model
A detail guide to build, train, fine-tune and deploye SLM. This repo... |
|
Experimental |
| 43 |
PalaashAgrawal/allmond
LLM Training made Quick and Easy |
|
Experimental |
| 44 |
adityanaranje/SLM-From-Scratch
NanoChat is a decoder-only causal language model built from scratch, showing... |
|
Experimental |
| 45 |
aayushsharma-io/tensorix
Tensorix is a minimalistic and highly compatible Python library designed to... |
|
Experimental |
| 46 |
aaaastark/NBART-Multilingual-Translator
This repository contains a Python script that uses a pre-trained NBART... |
|
Experimental |
| 47 |
rescuerz/LLMs-from-scratch
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step |
|
Experimental |
| 48 |
jvscursulim/build-a-llm-from-scratch
Studies about LLM |
|
Experimental |
| 49 |
B0-B/blowtorch-transformer-api
LLM bootstrap loader for local CPU/GPU inference with fully customizable chat. |
|
Experimental |