ashworks1706/llm-from-scratch
A theoretical and practical deep dive into Large Language Models and their applications on Deep learning from scratch.
This project offers a deep dive into the inner workings of machine learning models, particularly large language models and image generation models. It provides complete, ground-up implementations of various architectures, training methods, and optimization techniques. Data scientists, machine learning engineers, and researchers can use this to gain a fundamental understanding of how these complex systems are built and operate.
Use this if you are a machine learning engineer, data scientist, or researcher who wants to build a deep, intuitive understanding of how cutting-edge AI models are constructed, trained, and optimized from their foundational components.
Not ideal if you are looking for a high-level library to quickly apply pre-built models or an application that solves an immediate business problem without requiring an understanding of the underlying mechanics.
Stars
10
Forks
—
Language
Python
License
Apache-2.0
Category
Last pushed
Mar 08, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/ashworks1706/llm-from-scratch"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
limix-ldm-ai/LimiX
LimiX: Unleashing Structured-Data Modeling Capability for Generalist Intelligence...
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
google-research/plur
PLUR (Programming-Language Understanding and Repair) is a collection of source code datasets...
YalaLab/pillar-finetune
Finetuning framework for Pillar medical imaging models.
thuml/LogME
Code release for "LogME: Practical Assessment of Pre-trained Models for Transfer Learning" (ICML...