FareedKhan-dev/train-llm-from-scratch

A straightforward method for training your LLM, from downloading data to generating text.

52
/ 100
Established

This project offers a clear path to building your own custom large language model (LLM). You provide a large dataset of text, and the system trains a language model that can then generate new, coherent text based on what it learned. This is for AI researchers, hobbyists, or developers who want to experiment with creating their own text-generating AI.

531 stars. No commits in the last 6 months.

Use this if you want to train a custom text-generating AI model from scratch using your own data and have access to a GPU.

Not ideal if you need a pre-trained, production-ready LLM or don't have the technical expertise to work with PyTorch and deep learning concepts.

AI research natural language processing generative AI deep learning custom language models
Stale 6m No Package No Dependents
Maintenance 2 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 24 / 25

How are scores calculated?

Stars

531

Forks

108

Language

Jupyter Notebook

License

MIT

Last pushed

Aug 03, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/transformers/FareedKhan-dev/train-llm-from-scratch"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.