llms-from-scratch and llm-scratch-pytorch

These are **competitors** — both provide step-by-step PyTorch implementations of transformer-based language models from scratch, targeting the same learning objective with overlapping scope and no dependency relationship.

llms-from-scratch
45
Emerging
llm-scratch-pytorch
41
Emerging
Maintenance 10/25
Adoption 7/25
Maturity 11/25
Community 17/25
Maintenance 10/25
Adoption 9/25
Maturity 15/25
Community 7/25
Stars: 26
Forks: 10
Downloads:
Commits (30d): 0
Language: Jupyter Notebook
License: MIT
Stars: 100
Forks: 4
Downloads:
Commits (30d): 0
Language: Jupyter Notebook
License: MIT
No Package No Dependents
No Package No Dependents

About llms-from-scratch

analyticalrohit/llms-from-scratch

Build a ChatGPT like LLM from scratch in PyTorch, explained step by step.

This project offers a hands-on, step-by-step guide to building a large language model (LLM) similar to ChatGPT using PyTorch. It breaks down the complex architecture into understandable components, showing how text input is transformed into predicted words. This is for AI/ML developers and researchers who want to understand the inner workings of generative AI models.

AI development Machine Learning engineering Deep Learning research Generative AI Natural Language Processing

About llm-scratch-pytorch

skyloevil/llm-scratch-pytorch

lm-scratch-pytorch - The code is designed to be beginner-friendly, with a focus on understanding the fundamentals of PyTorch and implementing LLMs from scratch,step by step.

This project helps aspiring machine learning engineers and researchers understand how large language models (LLMs) like GPT-2 are built from the ground up using PyTorch. It guides you step-by-step through implementing the core components, starting from basic PyTorch concepts, all the way to optimizing performance with techniques like Flash Attention. You'll work with actual LLM architectures and gain practical knowledge of their internal workings.

deep-learning-education LLM-architecture PyTorch-development transformer-models AI-model-training

Scores updated daily from GitHub, PyPI, and npm data. How scores work