All Transformer Models

7,795 models ranked by quality score

Showing 1–100 of 7,795
# Model Score Tier
1 huggingface/tokenizers

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

90
Verified
2 vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

87
Verified
3 huggingface/transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine...

87
Verified
4 sgl-project/sglang

SGLang is a high-performance serving framework for large language models and...

87
Verified
5 Dao-AILab/flash-attention

Fast and memory-efficient exact attention

86
Verified
6 vllm-project/vllm-omni

A framework for efficient model inference with omni-modality models

83
Verified
7 ModelCloud/GPTQModel

LLM model quantization (compression) toolkit with hw acceleration support...

83
Verified
8 AI-Hypercomputer/maxtext

A simple, performant and scalable Jax LLM!

82
Verified
9 unslothai/unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss,...

81
Verified
10 qubvel-org/segmentation_models.pytorch

Semantic segmentation models with 500+ pretrained convolutional and...

81
Verified
11 Blaizzy/mlx-vlm

MLX-VLM is a package for inference and fine-tuning of Vision Language Models...

81
Verified
12 openvinotoolkit/nncf

Neural Network Compression Framework for enhanced OpenVINO™ inference

80
Verified
13 alibaba/MNN

MNN: A blazing-fast, lightweight inference engine battle-tested by Alibaba,...

80
Verified
14 huggingface/peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

80
Verified
15 lucidrains/x-transformers

A concise but complete full-attention transformer with a set of promising...

79
Verified
16 LMCache/LMCache

Supercharge Your LLM with the Fastest KV Cache Layer

79
Verified
17 sgl-project/SpecForge

Train speculative decoding models effortlessly and port them smoothly to...

79
Verified
18 modelscope/ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5,...

78
Verified
19 huggingface/optimum

🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and...

77
Verified
20 microsoft/presidio

An open-source framework for detecting, redacting, masking, and anonymizing...

77
Verified
21 oumi-ai/oumi

Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any...

77
Verified
22 bitsandbytes-foundation/bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

77
Verified
23 ludwig-ai/ludwig

Low-code framework for building custom LLMs, neural networks, and other AI models

77
Verified
24 linkedin/Liger-Kernel

Efficient Triton Kernels for LLM Training

77
Verified
25 xorbitsai/inference

Swap GPT for any LLM by changing a single line of code. Xinference lets you...

76
Verified
26 SwanHubX/SwanLab

⚡️SwanLab - an open-source, modern-design AI training tracking and...

76
Verified
27 tensorzero/tensorzero

TensorZero is an open-source stack for industrial-grade LLM applications. It...

76
Verified
28 fla-org/flash-linear-attention

🚀 Efficient implementations of state-of-the-art linear attention models

76
Verified
29 intel/auto-round

🎯An accuracy-first, highly efficient quantization toolkit for LLMs, designed...

75
Verified
30 intel/neural-compressor

SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity;...

74
Verified
31 AI4Finance-Foundation/FinGPT

FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We...

74
Verified
32 tenstorrent/tt-metal

:metal: TT-NN operator library, and TT-Metalium low level kernel programming model.

73
Verified
33 PaddlePaddle/FastDeploy

High-performance Inference and Deployment Toolkit for LLMs and VLMs based on...

73
Verified
34 cvs-health/uqlm

UQLM: Uncertainty Quantification for Language Models, is a Python package...

73
Verified
35 filipstrand/mflux

MLX native implementations of state-of-the-art generative image models

73
Verified
36 NVIDIA/Megatron-LM

Ongoing research training transformer models at scale

73
Verified
37 withcatai/node-llama-cpp

Run AI models locally on your machine with node.js bindings for llama.cpp....

73
Verified
38 adapter-hub/adapters

A Unified Library for Parameter-Efficient and Modular Transfer Learning

72
Verified
39 pytorch/ao

PyTorch native quantization and sparsity for training and inference

71
Verified
40 amazon-science/chronos-forecasting

Chronos: Pretrained Models for Time Series Forecasting

71
Verified
41 MaartenGr/BERTopic

Leveraging BERT and c-TF-IDF to create easily interpretable topics.

71
Verified
42 ExtensityAI/symbolicai

A neurosymbolic perspective on LLMs

71
Verified
43 alibaba/rtp-llm

RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.

70
Verified
44 jd-opensource/xllm

A high-performance inference engine for LLMs, optimized for diverse AI accelerators.

69
Established
45 cubist38/mlx-openai-server

A high-performance API server that provides OpenAI-compatible endpoints for...

69
Established
46 NVIDIA-Merlin/Transformers4Rec

Transformers4Rec is a flexible and efficient library for sequential and...

69
Established
47 PaddlePaddle/PaddleNLP

Easy-to-use and powerful LLM and SLM library with awesome model zoo.

69
Established
48 NVIDIA/sphinx-llm

LLM extensions for Sphinx Documentation

69
Established
49 agentscope-ai/Trinity-RFT

Trinity-RFT is a general-purpose, flexible and scalable framework designed...

69
Established
50 huggingface/optimum-intel

🤗 Optimum Intel: Accelerate inference with Intel optimization tools

68
Established
51 gpustack/gpustack

Performance-optimized AI inference on your GPUs. Unlock superior throughput...

68
Established
52 transformerlab/transformerlab-app

The open source research environment for AI researchers to seamlessly train,...

68
Established
53 huggingface/text-generation-inference

Large Language Model Text Generation Inference

68
Established
54 hassancs91/SimplerLLM

Simplify interactions with Large Language Models

68
Established
55 ARahim3/mlx-tune

Bringing the Unsloth experience to Mac users via Apple's MLX framework

68
Established
56 shibing624/MedicalGPT

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training...

68
Established
57 meta-llama/llama-cookbook

Welcome to the Llama Cookbook! This is your go to guide for Building with...

67
Established
58 lyogavin/airllm

AirLLM 70B inference with single 4GB GPU

67
Established
59 leondgarse/keras_cv_attention_models

Keras...

67
Established
60 InternLM/lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

67
Established
61 webis-de/small-text

Active Learning for Text Classification in Python

67
Established
62 mudler/LocalAI

:robot: The free, Open Source alternative to OpenAI, Claude and others....

67
Established
63 jingyaogong/minimind

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

67
Established
64 hiyouga/LlamaFactory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

67
Established
65 kanishkamisra/minicons

Utility for behavioral and representational analyses of Language Models

67
Established
66 ScalaConsultants/Aspect-Based-Sentiment-Analysis

💭 Aspect-Based-Sentiment-Analysis: Transformer & Explainable ML (TensorFlow)

67
Established
67 OpenRLHF/OpenRLHF

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on...

66
Established
68 Tongjilibo/bert4torch

An elegent pytorch implement of transformers

66
Established
69 THU-BPM/MarkLLM

MarkLLM: An Open-Source Toolkit for LLM Watermarking.(EMNLP 2024 System...

66
Established
70 zhudotexe/kani

kani (カニ) is a highly hackable microframework for tool-calling language...

66
Established
71 mishushakov/llm-scraper

Turn any webpage into structured data using LLMs

66
Established
72 rasbt/LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

66
Established
73 InternLM/lagent

A lightweight framework for building LLM-based agents

65
Established
74 ModelTC/LightLLM

LightLLM is a Python-based LLM (Large Language Model) inference and serving...

65
Established
75 rasbt/reasoning-from-scratch

Implement a reasoning LLM in PyTorch from scratch, step by step

65
Established
76 EricLBuehler/mistral.rs

Fast, flexible LLM inference

65
Established
77 tabularis-ai/be_great

A novel approach for synthesizing tabular data using pretrained large language models

65
Established
78 SciSharp/LLamaSharp

A C#/.NET library to run LLM (🦙LLaMA/LLaVA) on your local device efficiently.

65
Established
79 lightonai/pylate

Late Interaction Models Training & Retrieval

65
Established
80 sintel-dev/sigllm

Using Large Language Models for Time Series Anomaly Detection

65
Established
81 ThilinaRajapakse/simpletransformers

Transformers for Information Retrieval, Text Classification, NER, QA,...

65
Established
82 huggingface/transformers.js

State-of-the-art Machine Learning for the web. Run 🤗 Transformers directly...

65
Established
83 kaito-project/aikit

🏗️ Fine-tune, build, and deploy open-source LLMs easily!

64
Established
84 azukds/tubular

Python package implementing ML feature engineering and pre-processing for...

64
Established
85 kyegomez/BitNet

Implementation of "BitNet: Scaling 1-bit Transformers for Large Language...

64
Established
86 stochasticai/xTuring

Build, personalize and control your own LLMs. From data pre-processing to...

64
Established
87 huggingface/course

The Hugging Face course on Transformers

64
Established
88 mybigday/llama.rn

React Native binding of llama.cpp

64
Established
89 mindspore-lab/mindnlp

MindSpore + 🤗Huggingface: Run any Transformers/Diffusers model on MindSpore...

64
Established
90 UbiquitousLearning/mllm

Fast Multimodal LLM on Mobile Devices

64
Established
91 ModelTC/LightCompress

[EMNLP 2024 & AAAI 2026] A powerful toolkit for compressing large models...

64
Established
92 bodaay/HuggingFaceModelDownloader

Simple go utility to download HuggingFace Models and Datasets

63
Established
93 OpenNMT/OpenNMT-py

Open Source Neural Machine Translation and (Large) Language Models in PyTorch

63
Established
94 mosaicml/llm-foundry

LLM training code for Databricks foundation models

63
Established
95 NVlabs/MambaVision

[CVPR 2025] Official PyTorch Implementation of MambaVision: A Hybrid...

63
Established
96 allenai/dolma

Data and tools for generating and inspecting OLMo pre-training data.

63
Established
97 bentoml/OpenLLM

Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible...

63
Established
98 scaleapi/llm-engine

Scale LLM Engine public repository

63
Established
99 NVIDIA/kvpress

LLM KV cache compression made easy

63
Established
100 csinva/imodelsX

Interpret text data with LLMs (sklearn compatible).

63
Established
1 2 3 76 77 78 Next »