FareedKhan-dev/train-llm-from-scratch
A straightforward method for training your LLM, from downloading data to generating text.
This project offers a clear path to building your own custom large language model (LLM). You provide a large dataset of text, and the system trains a language model that can then generate new, coherent text based on what it learned. This is for AI researchers, hobbyists, or developers who want to experiment with creating their own text-generating AI.
531 stars. No commits in the last 6 months.
Use this if you want to train a custom text-generating AI model from scratch using your own data and have access to a GPU.
Not ideal if you need a pre-trained, production-ready LLM or don't have the technical expertise to work with PyTorch and deep learning concepts.
Stars
531
Forks
108
Language
Jupyter Notebook
License
MIT
Category
Last pushed
Aug 03, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/FareedKhan-dev/train-llm-from-scratch"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related models
rasbt/LLMs-from-scratch
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
facebookresearch/LayerSkip
Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024
kmeng01/rome
Locating and editing factual associations in GPT (NeurIPS 2022)
datawhalechina/llms-from-scratch-cn
仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理
geeks-of-data/knowledge-gpt
Extract knowledge from all information sources using gpt and other language models. Index and...