logic-OT/Decoder-Only-LLM

This repository features a custom-built decoder-only language model (LLM) with a total of 37 million parameters 🔥. I train the model to be able to ask question from a given context

/ 100

Experimental

This project helps machine learning engineers or researchers build and pretrain a custom language model for specific text generation tasks. You provide your own conversational dataset, and the model learns to generate relevant text based on a given context. It's designed for those who need to develop specialized LLMs rather than use off-the-shelf solutions.

No commits in the last 6 months.

Use this if you are a machine learning engineer or researcher looking to experiment with or build a bespoke decoder-only large language model from scratch and train it on your own unique dataset.

Not ideal if you need a ready-to-use language model for general text generation or question answering without custom training or deep technical involvement.

natural-language-processing machine-learning-engineering custom-LLM-development text-generation-research deep-learning-experimentation

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 8 / 25

Community 15 / 25

How are scores calculated?

Stars

Forks

Language

Jupyter Notebook

License

—

Higher-rated alternatives

Lightning-AI/litgpt

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

liangyuwang/Tiny-DeepSpeed

Tiny-DeepSpeed, a minimalistic re-implementation of the DeepSpeed library

catherinesyeh/attention-viz

Visualizing query-key interactions in language + vision transformers (VIS 2023)

microsoft/Text2Grad

🚀 Text2Grad: Converting natural language feedback into gradient signals for precise model...

FareedKhan-dev/Building-llama3-from-scratch

LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's...

Explore LLM Tools

All categories Trending LLM Tool directory Insights