All Transformer Models

7,795 models ranked by quality score · Page 2 of 78

Showing 101–200 of 7,795
# Model Score Tier
101 h2oai/h2o-llmstudio

H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs....

63
Established
102 FastFlowLM/FastFlowLM

Run LLMs on AMD Ryzenโ„ข AI NPUs in minutes. Just like Ollama - but...

62
Established
103 mostlygeek/llama-swap

Reliable model swapping for any local OpenAI/Anthropic compatible server -...

62
Established
104 Goekdeniz-Guelmez/mlx-lm-lora

Train Large Language Models on MLX.

62
Established
105 cel-ai/celai

Open source framework designed to accelerate the development of omnichannel...

62
Established
106 p-e-w/heretic

Fully automatic censorship removal for language models

62
Established
107 eole-nlp/eole

Open language modeling toolkit based on PyTorch

62
Established
108 mlc-ai/mlc-llm

Universal LLM Deployment Engine with ML Compilation

62
Established
109 peremartra/Large-Language-Model-Notebooks-Course

Practical course about Large Language Models.

61
Established
110 jessevig/bertviz

BertViz: Visualize Attention in Transformer Models

61
Established
111 jncraton/languagemodels

Explore large language models in 512MB of RAM

61
Established
112 huggingface/optimum-habana

Easy and lightning fast training of ๐Ÿค— Transformers on Habana Gaudi processor (HPU)

61
Established
113 jakobdylanc/llmcord

Make Discord your LLM frontend - Supports any OpenAI compatible API (Ollama,...

61
Established
114 huggingface/audio-transformers-course

The Hugging Face Course on Transformers for Audio

61
Established
115 structuredllm/syncode

Efficient and general syntactical decoding for Large Language Models

61
Established
116 rickiepark/nlp-with-transformers

<ํŠธ๋žœ์Šคํฌ๋จธ๋ฅผ ํ™œ์šฉํ•œ ์ž์—ฐ์–ด ์ฒ˜๋ฆฌ> ์˜ˆ์ œ ์ฝ”๋“œ๋ฅผ ์œ„ํ•œ ์ €์žฅ์†Œ์ž…๋‹ˆ๋‹ค.

60
Established
117 inseq-team/inseq

Interpretability for sequence generation models ๐Ÿ› ๐Ÿ”

60
Established
118 zjunlp/EasyEdit

[ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.

60
Established
119 NexaAI/nexa-sdk

Run frontier LLMs and VLMs with day-0 model support across GPU, NPU, and...

60
Established
120 NX-AI/xlstm

Official repository of the xLSTM.

60
Established
121 deeppavlov/AutoIntent

Automated machine learning for text classification

60
Established
122 mukel/llama3.java

Practical Llama 3 inference in Java

59
Established
123 nyu-mll/jiant

jiant is an nlp toolkit

59
Established
124 NVIDIA-NeMo/Automodel

Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging...

59
Established
125 lucidrains/simple-hierarchical-transformer

Experiments around a simple idea for inducing multiple hierarchical...

59
Established
126 cyberchitta/llm-context.py

Share code with LLMs via Model Context Protocol or clipboard. Rule-based...

59
Established
127 peremartra/optipfair

Structured pruning and bias visualization for Large Language Models. Tools...

59
Established
128 lucidrains/dreamer4

Implementation of Danijar's latest iteration for his Dreamer line of work

59
Established
129 OptimalScale/LMFlow

An Extensible Toolkit for Finetuning and Inference of Large Foundation...

59
Established
130 Mobile-Artificial-Intelligence/maid

Maid is a free and open source application for interfacing with llama.cpp...

59
Established
131 VainF/Torch-Pruning

[CVPR 2023] DepGraph: Towards Any Structural Pruning; LLMs, Vision...

59
Established
132 run-llama/LlamaIndexTS

Data framework for your LLM applications. Focus on server side solution

59
Established
133 zhihu/ZhiLight

A highly optimized LLM inference acceleration engine for Llama and its variants.

59
Established
134 arcee-ai/mergekit

Tools for merging pretrained large language models.

59
Established
135 EleutherAI/gpt-neox

An implementation of model parallel autoregressive transformers on GPUs,...

58
Established
136 sign-language-translator/sign-language-translator

Python library & framework to build custom translators for the...

58
Established
137 NielsRogge/Transformers-Tutorials

This repository contains demos I made with the Transformers library by HuggingFace.

58
Established
138 PacktPublishing/Mastering-NLP-from-Foundations-to-LLMs

Mastering NLP from Foundations to LLMs, Published by Packt

58
Established
139 mdsrqbl/omnihuman

AI model that understands text & humanoids.

58
Established
140 quic/efficient-transformers

This library empowers users to seamlessly port pretrained models and...

58
Established
141 changyeyu/LLM-RL-Visualized

๐ŸŒŸ100+ ๅŽŸๅˆ› LLM / RL ๅŽŸ็†ๅ›พ๐Ÿ“š๏ผŒใ€Šๅคงๆจกๅž‹็ฎ—ๆณ•ใ€‹ไฝœ่€…ๅทจ็Œฎ๏ผ๐Ÿ’ฅ๏ผˆ100+ LLM/RL Algorithm Maps ๏ผ‰

58
Established
142 TinyLLaVA/TinyLLaVA_Factory

A Framework of Small-scale Large Multimodal Models

58
Established
143 SalesforceAIResearch/uni2ts

Unified Training of Universal Time Series Forecasting Transformers

58
Established
144 ericmjl/llamabot

Pythonic class-based interface to LLMs

58
Established
145 argosopentech/argos-translate

Open-source offline translation library written in Python

58
Established
146 GradientHQ/parallax

Parallax is a distributed model serving framework that lets you build your...

57
Established
147 clusterzx/paperless-ai

An automated document analyzer for Paperless-ngx using OpenAI API, Ollama,...

57
Established
148 skyzh/tiny-llm

A course of learning LLM inference serving on Apple Silicon for systems...

57
Established
149 adithya-s-k/AI-Engineering.academy

Mastering Applied AI, One Concept at a Time

57
Established
150 dvgodoy/FineTuningLLMs

Official repository of my book "A Hands-On Guide to Fine-Tuning LLMs with...

57
Established
151 yjg30737/pyqt-openai

VividNode: Multi-purpose Text & Image Generation Desktop Chatbot (supporting...

57
Established
152 kyegomez/MambaTransformer

Integrating Mamba/SSMs with Transformer for Enhanced Long Context and...

57
Established
153 thu-ml/SageAttention

[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves...

57
Established
154 mindspore-lab/step_into_llm

MindSpore online courses: Step into LLM

57
Established
155 microsoft/unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

57
Established
156 microsoft/LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large...

57
Established
157 jsksxs360/How-to-use-Transformers

Transformers ๅบ“ๅฟซ้€Ÿๅ…ฅ้—จๆ•™็จ‹

57
Established
158 HandsOnLLM/Hands-On-Large-Language-Models

Official code repo for the O'Reilly Book - "Hands-On Large Language Models"

57
Established
159 BlinkDL/RWKV-LM

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can...

57
Established
160 moment-timeseries-foundation-model/moment

MOMENT: A Family of Open Time-series Foundation Models, ICML'24

57
Established
161 floneum/floneum

Instant, controllable, local pre-trained AI models in Rust

57
Established
162 Picovoice/picollm

On-device LLM Inference Powered by X-Bit Quantization

57
Established
163 mlabonne/llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

57
Established
164 SafeAILab/EAGLE

Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and...

57
Established
165 stas00/ml-engineering

Machine Learning Engineering Open Book

57
Established
166 huggingface/transformers.js-examples

A collection of ๐Ÿค— Transformers.js demos and example applications

57
Established
167 Michael-A-Kuykendall/shimmy

โšก Python-free Rust inference server โ€” OpenAI-API compatible. GGUF +...

57
Established
168 labmlai/annotated_deep_learning_paper_implementations

๐Ÿง‘โ€๐Ÿซ 60+ Implementations/tutorials of deep learning papers with side-by-side...

56
Established
169 OpenMachine-ai/transformer-tricks

A collection of tricks and tools to speed up transformer models

56
Established
170 nrl-ai/llama-assistant

AI-powered assistant to help you with your daily tasks, powered by Llama 3,...

56
Established
171 analyticalrohit/AI-ML-Cheatsheets

All Stanford Cheatsheets: Artificial Intelligence, Transformers, LLMs, Deep...

56
Established
172 OpenNMT/CTranslate2

Fast inference engine for Transformer models

56
Established
173 KimMeen/Time-LLM

[ICLR 2024] Official implementation of " ๐Ÿฆ™ Time-LLM: Time Series Forecasting...

56
Established
174 poloclub/transformer-explainer

Transformer Explained Visually: Learn How LLM Transformer Models Work with...

56
Established
175 underneathall/pinferencia

Python + Inference - Model Deployment library in Python. Simplest model...

56
Established
176 sgl-project/ome

Open Model Engine (OME) โ€” Kubernetes operator for LLM serving, GPU...

56
Established
177 kyegomez/Jamba

PyTorch Implementation of Jamba: "Jamba: A Hybrid Transformer-Mamba Language Model"

56
Established
178 mgonzs13/llama_ros

llama.cpp (GGUF LLMs) and llava.cpp (GGUF VLMs) for ROS 2

56
Established
179 huggingface/alignment-handbook

Robust recipes to align language models with human and AI preferences

56
Established
180 kyegomez/LFM

An open source implementation of LFMs from Liquid AI: Liquid Foundation Models

56
Established
181 label-sleuth/label-sleuth

Open source no-code system for text annotation and building of text classifiers

56
Established
182 mistralai/mistral-inference

Official inference library for Mistral models

56
Established
183 tattn/LocalLLMClient

Swift package to run local LLMs on iOS, macOS, Linux

56
Established
184 tue-mps/eomt

[CVPR 2025 Highlight] Official code and models for Encoder-only Mask...

56
Established
185 TIGER-AI-Lab/MMLU-Pro

The code and data for "MMLU-Pro: A More Robust and Challenging Multi-Task...

56
Established
186 avikumart/LLM-GenAI-Transformers-Notebooks

An repository containing all the LLM notebooks with tutorial and projects

56
Established
187 galilai-group/stable-pretraining

Reliable, minimal and scalable library for pretraining foundation and world models

56
Established
188 Mobile-Artificial-Intelligence/llama_sdk

lcpp is a dart implementation of llama.cpp used by the mobile artificial...

56
Established
189 hyunwoongko/nanoRLHF

nanoRLHF: from-scratch journey into how LLMs and RLHF really work.

56
Established
190 louisfb01/start-llms

A complete guide to start and improve your LLM skills in 2026 with little...

56
Established
191 kyegomez/MambaByte

Implementation of MambaByte in "MambaByte: Token-free Selective State Space...

56
Established
192 niedev/RTranslator

Open source real-time translation app for Android that runs locally

56
Established
193 sauravpanda/BrowserAI

Run local LLMs like llama, deepseek-distill, kokoro and more inside your browser

56
Established
194 nerdai/llms-from-scratch-rs

A comprehensive Rust translation of the code from Sebastian Raschka's Build...

56
Established
195 lucidrains/alphagenome

Implementation of AlphaGenome, Deepmind's updated genomic attention model

55
Established
196 kyegomez/LFM2

A simple and minimal open source implementation of "Introducing LFM2: The...

55
Established
197 Shivanandroy/simpleT5

simpleT5 is built on top of PyTorch-lightningโšก๏ธ and Transformers๐Ÿค— that lets...

55
Established
198 RBLN-SW/optimum-rbln

โšก A seamless integration of HuggingFace Transformers & Diffusers with RBLN...

55
Established
199 shibing624/textgen

TextGen: Implementation of Text Generation models, include LLaMA, BLOOM,...

55
Established
200 rickiepark/llm-from-scratch

<๋ฐ‘๋ฐ”๋‹ฅ๋ถ€ํ„ฐ ๋งŒ๋“ค๋ฉด์„œ ๊ณต๋ถ€ํ•˜๋Š” LLM>(๊ธธ๋ฒ—, 2025)์˜ ์ฝ”๋“œ ์ €์žฅ์†Œ

55
Established