KennethanCeyer/diy-generative-ai-lm

Make your Generative AI LM model from the scratch (Including pretraining / SFT with LoRA)

/ 100

Experimental

This project helps software developers understand how large language models (LLMs) work by providing a hands-on kit to build one from scratch. You input textual corpora for pretraining and instruction datasets for fine-tuning, and it outputs a trained generative AI language model. Developers or AI researchers who want to learn the fundamental mechanics of LLMs will find this useful.

No commits in the last 6 months.

Use this if you are a software developer or AI researcher who wants to learn the underlying architecture and training process of generative AI language models by implementing one yourself.

Not ideal if you are looking for a ready-to-use generative AI model or an advanced library to build production-grade LLM applications without understanding the internals.

AI-development machine-learning-engineering natural-language-processing deep-learning-education

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 8 / 25

Community 5 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

NVIDIA-NeMo/NeMo

A scalable generative AI framework built for researchers and developers working on Large...

alexiglad/EBT

PyTorch Code for Energy-Based Transformers paper -- generalizable reasoning and scalable learning

vlm-run/vlmrun-hub

A hub for various industry-specific schemas to be used with VLMs.

HyperGAI/HPT

HPT - Open Multimodal LLMs from HyperGAI

yash9439/Falcon-Local-AI-Model

Explore this GitHub repository housing 3 versions of Falcon code for text generation. Each...

Explore Generative AI Tools

All categories Trending Generative AI directory Insights