The Transformer Directory

Quality-scored directory of 7,795 transformer models, updated daily. Every model scored on maintenance, adoption, maturity, and community signals.

Transformer models and tools for fine-tuning, quantisation, inference optimisation, and deployment of attention-based architectures.

Verified

43

70–100

Established

341

50–69

Emerging

2,607

30–49

Experimental

4,804

10–29

Top models by quality score

# Model Score
1 huggingface/tokenizers

πŸ’₯ Fast State-of-the-Art Tokenizers optimized for Research and Production

90
2 vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

87
3 huggingface/transformers

πŸ€— Transformers: the model-definition framework for state-of-the-art machine...

87
4 sgl-project/sglang

SGLang is a high-performance serving framework for large language models and...

87
5 Dao-AILab/flash-attention

Fast and memory-efficient exact attention

86
6 vllm-project/vllm-omni

A framework for efficient model inference with omni-modality models

83
7 ModelCloud/GPTQModel

LLM model quantization (compression) toolkit with hw acceleration support...

83
8 AI-Hypercomputer/maxtext

A simple, performant and scalable Jax LLM!

82
9 unslothai/unsloth

Fine-tuning & Reinforcement Learning for LLMs. πŸ¦₯ Train OpenAI gpt-oss,...

81
10 qubvel-org/segmentation_models.pytorch

Semantic segmentation models with 500+ pretrained convolutional and...

81
11 Blaizzy/mlx-vlm

MLX-VLM is a package for inference and fine-tuning of Vision Language Models...

81
12 openvinotoolkit/nncf

Neural Network Compression Framework for enhanced OpenVINOβ„’ inference

80
13 alibaba/MNN

MNN: A blazing-fast, lightweight inference engine battle-tested by Alibaba,...

80
14 huggingface/peft

πŸ€— PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

80
15 lucidrains/x-transformers

A concise but complete full-attention transformer with a set of promising...

79
16 LMCache/LMCache

Supercharge Your LLM with the Fastest KV Cache Layer

79
17 sgl-project/SpecForge

Train speculative decoding models effortlessly and port them smoothly to...

79
18 modelscope/ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5,...

78
19 huggingface/optimum

πŸš€ Accelerate inference and training of πŸ€— Transformers, Diffusers, TIMM and...

77
20 microsoft/presidio

An open-source framework for detecting, redacting, masking, and anonymizing...

77

Browse by category

Transformer Architecture Tutorials

313 models

Local LLM Deployment

257 models

LoRA QLoRA Fine-tuning

230 models

Llm Fine Tuning

212 models

LLM Training Experimentation

173 models

LLM Inference Engines

164 models

ML Foundations Curricula

154 models

Llm Learning Resources

151 models

GPT2 Pretraining Fine-tuning

149 models

Interactive AI Chat UIs

137 models

RLHF Alignment Training

123 models

Review Sentiment Classification

116 models

Llm Implementation Tutorials

111 models

Multimodal Vision Language

110 models

Text Summarization Transformers

107 models

Llm Frameworks Libraries

104 models

Multilingual LLM Adaptation

101 models

Transformer Frameworks Wrappers

94 models

Mathematical Reasoning Transformers

94 models

3D Vision Transformers

85 models

Conversational Chatbot Applications

82 models

Question Answering Systems

81 models

HuggingFace Learning Resources

77 models

LLM Quantization Methods

75 models

Llm Scaling Architecture

74 models

BERT Model Implementations

73 models

Time Series Forecasting Transformers

66 models

Vision Language Models

66 models

Messaging Platform Chatbots

65 models

LLM Terminal Automation

64 models

Transformer Architecture Education

63 models

Transformer Interpretability Mechanistic

63 models

Llm Finetuning Frameworks

62 models

Neural Machine Translation

61 models

Text Classification Transformers

60 models

NLP Learning Coursework

58 models

Llm Reasoning Research

57 models

Model Evaluation Diagnostics

57 models

Llm Interpretability Explainability

54 models

Hate Speech Detection

54 models

Medical Image Segmentation Transformers

53 models

Power Transformer Design

53 models

LLM Implementation From Scratch

52 models

AI-Powered Business Analytics

51 models

OCR Document Extraction

49 models

Transformer Training Optimization

48 models

Multi-agent Orchestration

47 models

Domain Specific Benchmarks

47 models

Llama Model Implementations

45 models

Math Reasoning Datasets

44 models

Llm Compression Optimization

44 models

Vision Transformer Implementations

44 models

Evaluation Frameworks Metrics

44 models

Emotion Detection Transformers

44 models

LLM Benchmark Leaderboards

42 models

Browser-Based ML Inference

41 models

Prompt Engineering Security

41 models

Llm Knowledge Distillation

38 models

Streamlit LLM Interfaces

38 models

Text to Image Generation

37 models

Protein Transformers ML

37 models

Llm Domain Datasets

37 models

Diffusion Language Models

36 models

Multimodal Fusion Transformers

36 models

Vision Language Instruction Tuning

35 models

ViT Image Classification

35 models

Korean Language Models

33 models

Instruction Tuning Datasets

33 models

Image Captioning Transformers

33 models

Medical Image Diagnosis Transformers

32 models

Therapeutic Chatbot Applications

32 models

Academic Thesis Repositories

32 models

Music Generation Transformers

31 models

Creative Text Generation

30 models

Llm Evaluation Benchmarking

30 models

Named Entity Recognition

29 models

Financial Return Prediction

29 models

Semantic Textual Similarity

28 models

Llm Fine Tuning Frameworks

27 models

Machine Translation Transformers

27 models

Resume Job Matching

27 models

Sparse Attention Optimization

26 models

Llm Hallucination Mitigation

26 models

T5 mT5 Fine-tuning

26 models

Graph Transformers

26 models

Fake News Detection

26 models

Uncategorized

25 models

Multimodal Vision Language Models

25 models

Llm Knowledge Editing

25 models

Llm Recommendation Systems

25 models

Attention Mechanism Implementations

24 models

Audio Classification Transformers

24 models

Essay Scoring Grading

24 models

ML API Deployment

24 models

Llm Inference Serving

23 models

BLIP Image Captioning

23 models

Llm Bias Evaluation

23 models

Text to Speech TTS

23 models

Mixture Of Experts Llms

23 models

Llm Research Curation

23 models

Semantic Search Retrieval

23 models

Llm Quantization Techniques

22 models

Text Clustering Topic Modeling

22 models

CLIP Image Embeddings

22 models

Vision Transformer Classification

22 models

Llm Framework Abstractions

21 models

Llm Cuda Optimization

21 models

Clinical Llm Tools

21 models

Text Classification

21 models

Multi-provider LLM Interfaces

20 models

Llm Docker Deployments

20 models

Molecular Generation Transformers

20 models

Parameter Efficient Adapters

19 models

LLM Pruning Compression

19 models

Direct Preference Optimization

19 models

Graph Language Models

19 models

Llm Thesis Research

19 models

Tokenizer Libraries

18 models

Apple Silicon Llm Inference

18 models

Speculative Decoding Algorithms

18 models

Recommendation Systems Transformers

18 models

Bias Detection Transformers

18 models

Gpt Model Fine Tuning

17 models

Object Detection Transformers

17 models

Whisper Speech Transcription

17 models

Code Model Training

16 models

Llm Robot Planning

16 models

Cybersecurity Threat Detection

16 models

Indic Language Translation

15 models

Text Summarization Tools

14 models

Llm Knowledge Graph Generation

14 models

Llm Evaluation Platforms

14 models

Financial Sentiment Analysis

14 models

Gpt Multilingual Training

13 models

Synthetic Data Generation

13 models

Llm Experimentation Labs

13 models

Transformer Implementation Education

12 models

Retrieval Augmented Generation

12 models

Llm Function Calling

11 models

Rust Llm Infrastructure

11 models

Llm Data Labeling

11 models

Mistral Ai Tools

11 models

AI-Powered SaaS Startups

11 models

Langchain Application Development

11 models

Ml Inference Benchmarking

10 models

Llm Orchestration Platforms

10 models

AI Content Detection

10 models

Safety Robustness Evaluation

10 models

YouTube Video Summarization

10 models

Protein Design Llms

10 models

Study Aid Generators

10 models

Spam Detection Transformers

10 models

Llm Comparison Evaluation

10 models

Compositional Reasoning Embeddings

10 models

Kv Cache Optimization

9 models

Code Completion Copilots

9 models

Multimodal Visual Grounding

9 models

Llm Agent Training Gyms

9 models

Generative Ai Learning

9 models

Ai Generated Text Detection

9 models

Llm Fine Tuning Optimization

9 models

Jailbreak Attacks Analysis

9 models

Gemma Model Fine Tuning

9 models

Model Fine Tuning Methods

9 models

Clinical Text Classification

9 models

Llm Translation Tools

8 models

PHP AI SDKs

8 models

Nlp Learning Resources

8 models

Disaster Tweet Classification

8 models

Chain Of Thought Reasoning

8 models

Vision Transformer Optimization

8 models

PII Redaction Anonymization

7 models

Llm Evaluation Frameworks

7 models

Gpt Implementation Tutorials

6 models

Mixup Augmentation Frameworks

6 models

Bert Model Frameworks

6 models

Ollama Chat Interfaces

6 models

Llm Chatbot Interfaces

6 models

Vulnerability Detection Llm

6 models

Wav2Vec2 Speech Recognition

6 models

Document Data Extraction

6 models

Prompt Engineering Techniques

5 models

Nlp Fundamentals Tutorials

5 models

Competitive Agent Games

5 models

Gpt2 Language Models

5 models

Clip Vision Language

5 models

Ai Music Generation

5 models

Qwen Llm Ecosystem

5 models

Agent Memory Systems

5 models

Ai Stock Analysis

5 models

Structured Output Enforcement

5 models

Langchain Integration Patterns

5 models

Rust Agent Frameworks

5 models

Task Oriented Dialogue Systems

5 models

Llm Pentest Automation

5 models

Langchain Learning Fundamentals

5 models

Llm Orchestration Routing

4 models

State Space Model Architectures

4 models

Llm Serialization Formats

4 models

Machine Translation Systems

4 models

Llm Pricing Comparison

4 models

Knowledge Distillation Compression

4 models

Llm Provider Sdks

4 models

Kubernetes Llm Serving

4 models

Rag Qa Systems

4 models

Image Captioning Tools

4 models

Multimodal Rag Systems

3 models

Chatglm Fine Tuning

3 models

Local Voice Assistants

3 models

Chemistry Llm Benchmarks

3 models

Julia Ml Frameworks

3 models

Chatgpt Api Tutorials

3 models

Agent Memory Infrastructure

3 models

Graph Neural Networks

3 models

Text Translation Tools

3 models

Distributed Training Frameworks

3 models

Text Tokenization Libraries

3 models

Neural Data Compression

3 models

Variational Autoencoders Nlp

3 models

Jax Ml Frameworks

3 models

Ollama Go Clients

3 models

Ml Benchmarking Frameworks

3 models

Adversarial Nlp Robustness

3 models

Pdf Qa Systems

3 models

Llm Data Visualization

3 models

Llm Request Routing

3 models

Nano Gpt Variants

3 models

Model Compression Optimization

3 models

Rust Onnx Runtime

3 models

Llm Interview Preparation

3 models

Ml Learning Resources

3 models

Langchain Framework Guides

3 models

Explainability Interpretability Frameworks

2 models

End To End Asr Frameworks

2 models

Llm Chat Interfaces

2 models

Protein Language Models

2 models

Semantic Segmentation Techniques

2 models

Llm Chatbot Applications

2 models

Image Caption Generation

2 models

Multi Agent Debate Systems

2 models

Langchain Framework Learning

2 models

Go Ml Bindings

2 models

Trajectory Prediction Ml

2 models

Llm Cost Tracking

2 models

Peptide Property Prediction

2 models

Hugging Face Tutorials

2 models

Defect Detection Quality Forensics

2 models

Local Rag Frameworks

2 models

Langchain Prompt Templates

2 models

Advanced Summarization Methods

2 models

Hybrid Retrieval Optimization

2 models

Node Llm Client Sdks

2 models

Agi Consciousness Philosophy

2 models

Ml Project Collections

2 models

Huggingface Hub Clients

2 models

Agent Memory Architectures

2 models

Agent Orchestration Platforms

2 models

Agent Cost Governance

2 models

Stable Diffusion Tools

2 models

Blackroad Os Ecosystem

2 models

Academic Paper Analysis

2 models

Ai Supply Chain Optimization

2 models

Langchain Tool Building

2 models

Pdf Document Chatbots

2 models

Financial News Sentiment

2 models

Ai Document Summarization

2 models

Langchain Tool Integrations

2 models

Semantic Book Recommenders

2 models

Stock Price Forecasting

2 models

Tokenization Libraries

2 models

Pdf Question Answering

2 models

Next Word Prediction

2 models

Video Editing Diffusion

1 models

Financial Ai Agents

1 models

Ai Image Generation Platforms

1 models

Content Based Recommendation

1 models

Computer Vision Learning

1 models

Loss Function Implementations

1 models

Lightweight Training Utilities

1 models

Speech Ai Coursework

1 models

Time Series Forecasting

1 models

Chatbot Nlp Frameworks

1 models

Energy Sector Forecasting

1 models

Ai Powered Search Engines

1 models

Character Motion Animation

1 models

Ai Presentation Generation

1 models

Feature Selection Frameworks

1 models

Sign Language Recognition

1 models

Legal Document Analysis

1 models

Ios Nlp Frameworks

1 models

Speaker Diarization Embedding

1 models

Generative Ai Learning Projects

1 models

Kaggle Competition Solutions

1 models

Local Ai Workstations

1 models

Nlp Education Courses

1 models

Session Context Memory

1 models

Rna Structure Learning

1 models

Compositional T2I Generation

1 models

Lora Training Tools

1 models

Ai Subtitle Translation

1 models

Black Box Optimization

1 models

Ai Video Generation

1 models

Causal Inference Nlp

1 models

Mcp Demo Examples

1 models

Self Supervised Learning

1 models

Healthcare Ai Diagnostics

1 models

Lottery Number Prediction

1 models

Generative Ai Platforms

1 models

Low Light Image Restoration

1 models

Game Playing Agents

1 models

Medical Image Segmentation

1 models

Prompt Optimization Systems

1 models

Paper Implementation Collections

1 models

World Models Frameworks

1 models

Fact Checking Systems

1 models

Quantum Nlp Processing

1 models

Spiking Neural Networks

1 models

Advanced Prompt Protocols

1 models

Image Generation Mcp

1 models

Speech Synthesis Diffusion

1 models

Music Similarity Embeddings

1 models

Data Pipeline Frameworks

1 models

Kolmogorov Arnold Networks

1 models

Domain Adaptation Frameworks

1 models

Text To Speech Frameworks

1 models

Self Hosted Embedding Servers

1 models

Multimodal Search Engines

1 models

Streamlit Langchain Apps

1 models

Text To Sql Rag

1 models

Diffusion Model Frameworks

1 models

Telegram Ai Assistants

1 models

Prompt Engineering Optimization

1 models

Ml Project Portfolios

1 models

Hate Speech Content Moderation

1 models

Llm Json Streaming

1 models

Streamlit Chatbot Apps

1 models

Content To Markdown

1 models

Codebase Context Extraction

1 models

Memory Augmented Architectures

1 models

Model Confidence Calibration

1 models

Variational Autoencoder Implementations

1 models

Javascript Ml Libraries

1 models

Membership Inference Attacks

1 models

Multi Agent Frameworks

1 models

Ai Security Training Labs

1 models

Multi Modal Ai Assistants

1 models

Langgraph Agentic Systems

1 models

Rust Nlp Bindings

1 models

Eeg Brain Signal Processing

1 models

Natural Language Sql Builders

1 models

Diffusion Web Interfaces

1 models

Streamlit App Templates

1 models

Cli Llm Interfaces

1 models

Video Content Intelligence

1 models

Reading Comprehension Qa

1 models

Edge Device Ml Frameworks

1 models

Mental Health Risk Detection

1 models

Ai Agent Memory Systems

1 models

Cold Email Generation

1 models

Blog Content Generation

1 models

Ruby Llm Frameworks

1 models

Keyword Speech Recognition

1 models

Apple Foundation Models

1 models

Ios Speech Frameworks

1 models

Spring Ai Applications

1 models

Financial News Rag

1 models

Langchain Application Tutorials

1 models

Code Context Packaging

1 models

Traffic Signal Optimization

1 models

Market Research Agents

1 models

Knowledge Graph Question Answering

1 models

Ai Service Sdks

1 models

Extractive Question Answering

1 models

Youtube Transcript Summarization

1 models

Diffusion Adversarial Robustness

1 models

Diffusion Deployment Serving

1 models

Mental Health Chatbots

1 models

Llm Internals Visualization

1 models

Nutrition Ai Apps

1 models

Natural Language Database Agents

1 models

Autogen Framework Implementations

1 models

Book Recommendation Systems

1 models

Ai Investment Analysis

1 models

Pii Redaction Tools

1 models

Youtube Video Qa

1 models

Voice Command Assistants

1 models

Embedding Model Tuning

1 models

Model Inference Serving

1 models

Text To Sql Generation

1 models

Langgraph Agent Implementations

1 models

Qa System Implementations

1 models

Ai Powered App Builders

1 models

Bias Measurement Evaluation

1 models

Rag Document Qa

1 models

Copilot Chat Extensions

1 models

Ai Literacy Education

1 models

Multi Agent Ai Systems

1 models

Agent Observability Debugging

1 models

Ml Development Environments

1 models

Ai Video Creation

1 models

Multi Task Learning

1 models

Onnx Model Deployment

1 models

Emoji Generation Ml

1 models

Image Captioning

1 models

Document Intelligence Extraction

1 models

Satellite Imagery Ml

1 models

Bert Model Deployment

1 models

Natural Language Sql

1 models

Ai Learning Collections

1 models

Gpt Rag Foundations

1 models

Medical Rag Chatbots

1 models

Generative Ai Projects

1 models

Quantum Machine Learning

1 models

Video Diffusion Models

1 models