LLMs-from-scratch and llms-from-scratch-cn

These are complement resources serving different language communities—the English PyTorch implementation paired with a Chinese-language variant that covers multiple architectures (GLM4, Llama3, RWKV6)—allowing practitioners to learn LLM construction in their preferred language while referencing the same foundational concepts.

LLMs-from-scratch
66
Established
llms-from-scratch-cn
48
Emerging
Maintenance 17/25
Adoption 10/25
Maturity 16/25
Community 23/25
Maintenance 0/25
Adoption 10/25
Maturity 16/25
Community 22/25
Stars: 87,892
Forks: 13,408
Downloads:
Commits (30d): 8
Language: Jupyter Notebook
License:
Stars: 4,010
Forks: 552
Downloads:
Commits (30d): 0
Language: Jupyter Notebook
License:
No Package No Dependents
Stale 6m No Package No Dependents

About LLMs-from-scratch

rasbt/LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

This project provides the practical code and guidance to build your own custom GPT-like large language model (LLM) from the ground up. You'll learn how to take raw text data, process it, and train a functional LLM that can generate text or follow instructions. This is designed for AI practitioners, machine learning engineers, and researchers who want to deeply understand and implement LLMs.

AI development natural language processing machine learning engineering deep learning research custom model training

About llms-from-scratch-cn

datawhalechina/llms-from-scratch-cn

仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理

This project provides a hands-on guide to building large language models (LLMs) from scratch. You'll start with basic Python knowledge and learn to implement the core architectures of models like GLM4, Llama3, and RWKV6. This is ideal for machine learning engineers, AI researchers, or data scientists who want to deeply understand how these powerful models are constructed.

AI development Machine learning engineering Natural language processing Deep learning research Model architecture

Scores updated daily from GitHub, PyPI, and npm data. How scores work