KennethanCeyer/diy-generative-ai-lm
Make your Generative AI LM model from the scratch (Including pretraining / SFT with LoRA)
This project helps software developers understand how large language models (LLMs) work by providing a hands-on kit to build one from scratch. You input textual corpora for pretraining and instruction datasets for fine-tuning, and it outputs a trained generative AI language model. Developers or AI researchers who want to learn the fundamental mechanics of LLMs will find this useful.
No commits in the last 6 months.
Use this if you are a software developer or AI researcher who wants to learn the underlying architecture and training process of generative AI language models by implementing one yourself.
Not ideal if you are looking for a ready-to-use generative AI model or an advanced library to build production-grade LLM applications without understanding the internals.
Stars
16
Forks
1
Language
Python
License
—
Category
Last pushed
Feb 16, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/generative-ai/KennethanCeyer/diy-generative-ai-lm"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
NVIDIA-NeMo/NeMo
A scalable generative AI framework built for researchers and developers working on Large...
alexiglad/EBT
PyTorch Code for Energy-Based Transformers paper -- generalizable reasoning and scalable learning
vlm-run/vlmrun-hub
A hub for various industry-specific schemas to be used with VLMs.
HyperGAI/HPT
HPT - Open Multimodal LLMs from HyperGAI
yash9439/Falcon-Local-AI-Model
Explore this GitHub repository housing 3 versions of Falcon code for text generation. Each...