ranpy13/Learning-LLM

Learning to build LLM from scratch, following rasbt/LLMs-from-scratch footsteps.

28
/ 100
Experimental

This project provides the code and guidance to build your own large language models (LLMs) from the ground up. You'll put in foundational code and training data, and get out a custom, functional LLM capable of generating text and performing specific tasks. This is ideal for machine learning engineers, AI researchers, or data scientists looking to deepen their understanding of LLM architecture and training.

No commits in the last 6 months.

Use this if you are a machine learning practitioner who wants to learn the inner workings of large language models by implementing them yourself.

Not ideal if you are looking for a pre-built LLM to use out-of-the-box for applications without needing to understand its construction.

machine-learning-engineering natural-language-processing AI-research deep-learning model-development
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 4 / 25
Maturity 16 / 25
Community 8 / 25

How are scores calculated?

Stars

8

Forks

1

Language

Jupyter Notebook

License

Last pushed

Sep 29, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/transformers/ranpy13/Learning-LLM"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.