FareedKhan-dev/train-llm-from-scratch

A straightforward method for training your LLM, from downloading data to generating text.

/ 100

Established

This project offers a clear path to building your own custom large language model (LLM). You provide a large dataset of text, and the system trains a language model that can then generate new, coherent text based on what it learned. This is for AI researchers, hobbyists, or developers who want to experiment with creating their own text-generating AI.

531 stars. No commits in the last 6 months.

Use this if you want to train a custom text-generating AI model from scratch using your own data and have access to a GPU.

Not ideal if you need a pre-trained, production-ready LLM or don't have the technical expertise to work with PyTorch and deep learning concepts.

AI research natural language processing generative AI deep learning custom language models

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 24 / 25

How are scores calculated?

Stars

531

Forks

108

Language

Jupyter Notebook

License

MIT

Compare

train-llm-from-scratch and LLMs-from-scratch train-llm-from-scratch and llms-from-scratch

Related models

rasbt/LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

facebookresearch/LayerSkip

Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024

kmeng01/rome

Locating and editing factual associations in GPT (NeurIPS 2022)

datawhalechina/llms-from-scratch-cn

仅需Python基础，从0构建大语言模型；从0逐步构建GLM4\Llama3\RWKV6，深入理解大模型原理

geeks-of-data/knowledge-gpt

Extract knowledge from all information sources using gpt and other language models. Index and...

Explore Transformer Models

All categories Trending Transformer directory Insights