kmkrofficial/LiteGPT

LiteGPT: A 124M Small Language Model (SLM) pre-trained on FineWeb and fine-tuned on Alpaca.

/ 100

Emerging

This project helps machine learning engineers and researchers train a small language model from scratch or fine-tune an existing one. It takes large text datasets, like educational web data or instruction-following examples, and outputs a custom language model that can generate text or follow instructions. This is for individuals building or experimenting with their own AI language models.

Use this if you need to create your own compact, custom language model for text generation or instruction following, without needing a massive, resource-intensive model.

Not ideal if you simply want to use an off-the-shelf language model for common tasks without any custom training or model development.

AI-model-training natural-language-processing text-generation machine-learning-research custom-LLM-development

No License No Package No Dependents

Maintenance 6 / 25

Adoption 7 / 25

Maturity 5 / 25

Community 12 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

AI-Hypercomputer/maxtext

A simple, performant and scalable Jax LLM!

rasbt/reasoning-from-scratch

Implement a reasoning LLM in PyTorch from scratch, step by step

mindspore-lab/mindnlp

MindSpore + 🤗Huggingface: Run any Transformers/Diffusers model on MindSpore with seamless...

mosaicml/llm-foundry

LLM training code for Databricks foundation models

rickiepark/llm-from-scratch

<밑바닥부터 만들면서 공부하는 LLM>(길벗, 2025)의 코드 저장소

Explore Transformer Models

All categories Trending Transformer directory Insights