ai-glimpse/toyllm

ToyLLM: Learning LLM from Scratch

/ 100

Emerging

This project helps you understand how Large Language Models (LLMs) work by providing a hands-on environment to build and experiment with them from scratch. You input raw text and configuration parameters, and it outputs text generations and insights into the model's internal workings. This is for machine learning engineers, researchers, or advanced students interested in the foundational mechanics of modern language AI.

Use this if you want to deeply learn and implement core LLM concepts like transformer architectures, GPT-2, speculative sampling, and KV cache optimization.

Not ideal if you are looking for a pre-built, production-ready LLM solution or just want to use an LLM for immediate text generation without understanding its internal mechanics.

AI education natural language processing machine learning research deep learning implementation

No Package No Dependents

Maintenance 10 / 25

Adoption 7 / 25

Maturity 16 / 25

Community 4 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

AI-Hypercomputer/maxtext

A simple, performant and scalable Jax LLM!

rasbt/reasoning-from-scratch

Implement a reasoning LLM in PyTorch from scratch, step by step

mindspore-lab/mindnlp

MindSpore + 🤗Huggingface: Run any Transformers/Diffusers model on MindSpore with seamless...

mosaicml/llm-foundry

LLM training code for Databricks foundation models

rickiepark/llm-from-scratch

<밑바닥부터 만들면서 공부하는 LLM>(길벗, 2025)의 코드 저장소

Explore Transformer Models

All categories Trending Transformer directory Insights