loke-x/jam-gpt

An Experimental Reimplementation of LLM models for research and development process

/ 100

Emerging

This project helps AI researchers and developers explore and understand the inner workings of Large Language Models (LLMs). You can input your own datasets to train and fine-tune experimental Generative Pretrained Transformers (GPT) models, gaining insights into their architecture and design. It's designed for individuals building or experimenting with LLM models for research purposes.

No commits in the last 6 months.

Use this if you are an AI researcher or developer looking to experiment with LLM architectures, train custom models with your own data, and understand their underlying principles.

Not ideal if you need a pre-trained, production-ready LLM model for immediate use or if you are not interested in the detailed architecture and training process.

LLM-research Generative-AI-development Deep-learning-experimentation Transformer-model-design Custom-model-training

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 16 / 25

Community 11 / 25

How are scores calculated?

Stars

Forks

Language

Jupyter Notebook

License

MIT

Compare

jam-gpt and litgpt

Higher-rated alternatives

Lightning-AI/litgpt

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

liangyuwang/Tiny-DeepSpeed

Tiny-DeepSpeed, a minimalistic re-implementation of the DeepSpeed library

catherinesyeh/attention-viz

Visualizing query-key interactions in language + vision transformers (VIS 2023)

microsoft/Text2Grad

🚀 Text2Grad: Converting natural language feedback into gradient signals for precise model...

FareedKhan-dev/Building-llama3-from-scratch

LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's...

Explore LLM Tools

All categories Trending LLM Tool directory Insights