HariomJangra/project-lumen

A 128M parameter language model built from scratch for learning how large language models work.

/ 100

Emerging

Project Lumen helps AI researchers and developers understand how modern large language models work by providing a fully built and documented example. It takes raw text data, processes it, and outputs a trained language model capable of generating text or following instructions, allowing users to explore every development step. This is designed for those learning or researching language model creation.

Use this if you are an AI researcher, student, or developer who wants to learn the internal mechanics of building a large language model from scratch.

Not ideal if you need an off-the-shelf, production-ready language model for immediate deployment in an application.

AI-research natural-language-processing machine-learning-engineering educational-tool

No Package No Dependents

Maintenance 6 / 25

Adoption 4 / 25

Maturity 15 / 25

Community 8 / 25

How are scores calculated?

Stars

Forks

Language

Jupyter Notebook

License

Apache-2.0

Higher-rated alternatives

NX-AI/xlstm

Official repository of the xLSTM.

sinanuozdemir/oreilly-hands-on-gpt-llm

Mastering the Art of Scalable and Efficient AI Model Deployment

DashyDashOrg/pandas-llm

Pandas-LLM

wxhcore/bumblecore

An LLM training framework built from the ground up, featuring a custom BumbleBee architecture...

MiniMax-AI/MiniMax-01

The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model &...

Explore Transformer Models

All categories Trending Transformer directory Insights