The Transformer Directory
Quality-scored directory of 7,795 transformer models, updated daily. Every model scored on maintenance, adoption, maturity, and community signals.
Transformer models and tools for fine-tuning, quantisation, inference optimisation, and deployment of attention-based architectures.
43
70β100
341
50β69
2,607
30β49
4,804
10β29
Top models by quality score
| # | Model | Score |
|---|---|---|
| 1 |
huggingface/tokenizers
π₯ Fast State-of-the-Art Tokenizers optimized for Research and Production |
|
| 2 |
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs |
|
| 3 |
huggingface/transformers
π€ Transformers: the model-definition framework for state-of-the-art machine... |
|
| 4 |
sgl-project/sglang
SGLang is a high-performance serving framework for large language models and... |
|
| 5 |
Dao-AILab/flash-attention
Fast and memory-efficient exact attention |
|
| 6 |
vllm-project/vllm-omni
A framework for efficient model inference with omni-modality models |
|
| 7 |
ModelCloud/GPTQModel
LLM model quantization (compression) toolkit with hw acceleration support... |
|
| 8 |
AI-Hypercomputer/maxtext
A simple, performant and scalable Jax LLM! |
|
| 9 |
unslothai/unsloth
Fine-tuning & Reinforcement Learning for LLMs. π¦₯ Train OpenAI gpt-oss,... |
|
| 10 |
qubvel-org/segmentation_models.pytorch
Semantic segmentation models with 500+ pretrained convolutional and... |
|
| 11 |
Blaizzy/mlx-vlm
MLX-VLM is a package for inference and fine-tuning of Vision Language Models... |
|
| 12 |
openvinotoolkit/nncf
Neural Network Compression Framework for enhanced OpenVINOβ’ inference |
|
| 13 |
alibaba/MNN
MNN: A blazing-fast, lightweight inference engine battle-tested by Alibaba,... |
|
| 14 |
huggingface/peft
π€ PEFT: State-of-the-art Parameter-Efficient Fine-Tuning. |
|
| 15 |
lucidrains/x-transformers
A concise but complete full-attention transformer with a set of promising... |
|
| 16 |
LMCache/LMCache
Supercharge Your LLM with the Fastest KV Cache Layer |
|
| 17 |
sgl-project/SpecForge
Train speculative decoding models effortlessly and port them smoothly to... |
|
| 18 |
modelscope/ms-swift
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5,... |
|
| 19 |
huggingface/optimum
π Accelerate inference and training of π€ Transformers, Diffusers, TIMM and... |
|
| 20 |
microsoft/presidio
An open-source framework for detecting, redacting, masking, and anonymizing... |
|
Browse by category
Transformer Architecture Tutorials
313 models
Local LLM Deployment
257 models
LoRA QLoRA Fine-tuning
230 models
Llm Fine Tuning
212 models
LLM Training Experimentation
173 models
LLM Inference Engines
164 models
ML Foundations Curricula
154 models
Llm Learning Resources
151 models
GPT2 Pretraining Fine-tuning
149 models
Interactive AI Chat UIs
137 models
RLHF Alignment Training
123 models
Review Sentiment Classification
116 models
Llm Implementation Tutorials
111 models
Multimodal Vision Language
110 models
Text Summarization Transformers
107 models
Llm Frameworks Libraries
104 models
Multilingual LLM Adaptation
101 models
Transformer Frameworks Wrappers
94 models
Mathematical Reasoning Transformers
94 models
3D Vision Transformers
85 models
Conversational Chatbot Applications
82 models
Question Answering Systems
81 models
HuggingFace Learning Resources
77 models
LLM Quantization Methods
75 models
Llm Scaling Architecture
74 models
BERT Model Implementations
73 models
Time Series Forecasting Transformers
66 models
Vision Language Models
66 models
Messaging Platform Chatbots
65 models
LLM Terminal Automation
64 models
Transformer Architecture Education
63 models
Transformer Interpretability Mechanistic
63 models
Llm Finetuning Frameworks
62 models
Neural Machine Translation
61 models
Text Classification Transformers
60 models
NLP Learning Coursework
58 models
Llm Reasoning Research
57 models
Model Evaluation Diagnostics
57 models
Llm Interpretability Explainability
54 models
Hate Speech Detection
54 models
Medical Image Segmentation Transformers
53 models
Power Transformer Design
53 models
LLM Implementation From Scratch
52 models
AI-Powered Business Analytics
51 models
OCR Document Extraction
49 models
Transformer Training Optimization
48 models
Multi-agent Orchestration
47 models
Domain Specific Benchmarks
47 models
Llama Model Implementations
45 models
Math Reasoning Datasets
44 models
Llm Compression Optimization
44 models
Vision Transformer Implementations
44 models
Evaluation Frameworks Metrics
44 models
Emotion Detection Transformers
44 models
LLM Benchmark Leaderboards
42 models
Browser-Based ML Inference
41 models
Prompt Engineering Security
41 models
Llm Knowledge Distillation
38 models
Streamlit LLM Interfaces
38 models
Text to Image Generation
37 models
Protein Transformers ML
37 models
Llm Domain Datasets
37 models
Diffusion Language Models
36 models
Multimodal Fusion Transformers
36 models
Vision Language Instruction Tuning
35 models
ViT Image Classification
35 models
Korean Language Models
33 models
Instruction Tuning Datasets
33 models
Image Captioning Transformers
33 models
Medical Image Diagnosis Transformers
32 models
Therapeutic Chatbot Applications
32 models
Academic Thesis Repositories
32 models
Music Generation Transformers
31 models
Creative Text Generation
30 models
Llm Evaluation Benchmarking
30 models
Named Entity Recognition
29 models
Financial Return Prediction
29 models
Semantic Textual Similarity
28 models
Llm Fine Tuning Frameworks
27 models
Machine Translation Transformers
27 models
Resume Job Matching
27 models
Sparse Attention Optimization
26 models
Llm Hallucination Mitigation
26 models
T5 mT5 Fine-tuning
26 models
Graph Transformers
26 models
Fake News Detection
26 models
Uncategorized
25 models
Multimodal Vision Language Models
25 models
Llm Knowledge Editing
25 models
Llm Recommendation Systems
25 models
Attention Mechanism Implementations
24 models
Audio Classification Transformers
24 models
Essay Scoring Grading
24 models
ML API Deployment
24 models
Llm Inference Serving
23 models
BLIP Image Captioning
23 models
Llm Bias Evaluation
23 models
Text to Speech TTS
23 models
Mixture Of Experts Llms
23 models
Llm Research Curation
23 models
Semantic Search Retrieval
23 models
Llm Quantization Techniques
22 models
Text Clustering Topic Modeling
22 models
CLIP Image Embeddings
22 models
Vision Transformer Classification
22 models
Llm Framework Abstractions
21 models
Llm Cuda Optimization
21 models
Clinical Llm Tools
21 models
Text Classification
21 models
Multi-provider LLM Interfaces
20 models
Llm Docker Deployments
20 models
Molecular Generation Transformers
20 models
Parameter Efficient Adapters
19 models
LLM Pruning Compression
19 models
Direct Preference Optimization
19 models
Graph Language Models
19 models
Llm Thesis Research
19 models
Tokenizer Libraries
18 models
Apple Silicon Llm Inference
18 models
Speculative Decoding Algorithms
18 models
Recommendation Systems Transformers
18 models
Bias Detection Transformers
18 models
Gpt Model Fine Tuning
17 models
Object Detection Transformers
17 models
Whisper Speech Transcription
17 models
Code Model Training
16 models
Llm Robot Planning
16 models
Cybersecurity Threat Detection
16 models
Indic Language Translation
15 models
Text Summarization Tools
14 models
Llm Knowledge Graph Generation
14 models
Llm Evaluation Platforms
14 models
Financial Sentiment Analysis
14 models
Gpt Multilingual Training
13 models
Synthetic Data Generation
13 models
Llm Experimentation Labs
13 models
Transformer Implementation Education
12 models
Retrieval Augmented Generation
12 models
Llm Function Calling
11 models
Rust Llm Infrastructure
11 models
Llm Data Labeling
11 models
Mistral Ai Tools
11 models
AI-Powered SaaS Startups
11 models
Langchain Application Development
11 models
Ml Inference Benchmarking
10 models
Llm Orchestration Platforms
10 models
AI Content Detection
10 models
Safety Robustness Evaluation
10 models
YouTube Video Summarization
10 models
Protein Design Llms
10 models
Study Aid Generators
10 models
Spam Detection Transformers
10 models
Llm Comparison Evaluation
10 models
Compositional Reasoning Embeddings
10 models
Kv Cache Optimization
9 models
Code Completion Copilots
9 models
Multimodal Visual Grounding
9 models
Llm Agent Training Gyms
9 models
Generative Ai Learning
9 models
Ai Generated Text Detection
9 models
Llm Fine Tuning Optimization
9 models
Jailbreak Attacks Analysis
9 models
Gemma Model Fine Tuning
9 models
Model Fine Tuning Methods
9 models
Clinical Text Classification
9 models
Llm Translation Tools
8 models
PHP AI SDKs
8 models
Nlp Learning Resources
8 models
Disaster Tweet Classification
8 models
Chain Of Thought Reasoning
8 models
Vision Transformer Optimization
8 models
PII Redaction Anonymization
7 models
Llm Evaluation Frameworks
7 models
Gpt Implementation Tutorials
6 models
Mixup Augmentation Frameworks
6 models
Bert Model Frameworks
6 models
Ollama Chat Interfaces
6 models
Llm Chatbot Interfaces
6 models
Vulnerability Detection Llm
6 models
Wav2Vec2 Speech Recognition
6 models
Document Data Extraction
6 models
Prompt Engineering Techniques
5 models
Nlp Fundamentals Tutorials
5 models
Competitive Agent Games
5 models
Gpt2 Language Models
5 models
Clip Vision Language
5 models
Ai Music Generation
5 models
Qwen Llm Ecosystem
5 models
Agent Memory Systems
5 models
Ai Stock Analysis
5 models
Structured Output Enforcement
5 models
Langchain Integration Patterns
5 models
Rust Agent Frameworks
5 models
Task Oriented Dialogue Systems
5 models
Llm Pentest Automation
5 models
Langchain Learning Fundamentals
5 models
Llm Orchestration Routing
4 models
State Space Model Architectures
4 models
Llm Serialization Formats
4 models
Machine Translation Systems
4 models
Llm Pricing Comparison
4 models
Knowledge Distillation Compression
4 models
Llm Provider Sdks
4 models
Kubernetes Llm Serving
4 models
Rag Qa Systems
4 models
Image Captioning Tools
4 models
Multimodal Rag Systems
3 models
Chatglm Fine Tuning
3 models
Local Voice Assistants
3 models
Chemistry Llm Benchmarks
3 models
Julia Ml Frameworks
3 models
Chatgpt Api Tutorials
3 models
Agent Memory Infrastructure
3 models
Graph Neural Networks
3 models
Text Translation Tools
3 models
Distributed Training Frameworks
3 models
Text Tokenization Libraries
3 models
Neural Data Compression
3 models
Variational Autoencoders Nlp
3 models
Jax Ml Frameworks
3 models
Ollama Go Clients
3 models
Ml Benchmarking Frameworks
3 models
Adversarial Nlp Robustness
3 models
Pdf Qa Systems
3 models
Llm Data Visualization
3 models
Llm Request Routing
3 models
Nano Gpt Variants
3 models
Model Compression Optimization
3 models
Rust Onnx Runtime
3 models
Llm Interview Preparation
3 models
Ml Learning Resources
3 models
Langchain Framework Guides
3 models
Explainability Interpretability Frameworks
2 models
End To End Asr Frameworks
2 models
Llm Chat Interfaces
2 models
Protein Language Models
2 models
Semantic Segmentation Techniques
2 models
Llm Chatbot Applications
2 models
Image Caption Generation
2 models
Multi Agent Debate Systems
2 models
Langchain Framework Learning
2 models
Go Ml Bindings
2 models
Trajectory Prediction Ml
2 models
Llm Cost Tracking
2 models
Peptide Property Prediction
2 models
Hugging Face Tutorials
2 models
Defect Detection Quality Forensics
2 models
Local Rag Frameworks
2 models
Langchain Prompt Templates
2 models
Advanced Summarization Methods
2 models
Hybrid Retrieval Optimization
2 models
Node Llm Client Sdks
2 models
Agi Consciousness Philosophy
2 models
Ml Project Collections
2 models
Huggingface Hub Clients
2 models
Agent Memory Architectures
2 models
Agent Orchestration Platforms
2 models
Agent Cost Governance
2 models
Stable Diffusion Tools
2 models
Blackroad Os Ecosystem
2 models
Academic Paper Analysis
2 models
Ai Supply Chain Optimization
2 models
Langchain Tool Building
2 models
Pdf Document Chatbots
2 models
Financial News Sentiment
2 models
Ai Document Summarization
2 models
Langchain Tool Integrations
2 models
Semantic Book Recommenders
2 models
Stock Price Forecasting
2 models
Tokenization Libraries
2 models
Pdf Question Answering
2 models
Next Word Prediction
2 models
Video Editing Diffusion
1 models
Financial Ai Agents
1 models
Ai Image Generation Platforms
1 models
Content Based Recommendation
1 models
Computer Vision Learning
1 models
Loss Function Implementations
1 models
Lightweight Training Utilities
1 models
Speech Ai Coursework
1 models
Time Series Forecasting
1 models
Chatbot Nlp Frameworks
1 models
Energy Sector Forecasting
1 models
Ai Powered Search Engines
1 models
Character Motion Animation
1 models
Ai Presentation Generation
1 models
Feature Selection Frameworks
1 models
Sign Language Recognition
1 models
Legal Document Analysis
1 models
Ios Nlp Frameworks
1 models
Speaker Diarization Embedding
1 models
Generative Ai Learning Projects
1 models
Kaggle Competition Solutions
1 models
Local Ai Workstations
1 models
Nlp Education Courses
1 models
Session Context Memory
1 models
Rna Structure Learning
1 models
Compositional T2I Generation
1 models
Lora Training Tools
1 models
Ai Subtitle Translation
1 models
Black Box Optimization
1 models
Ai Video Generation
1 models
Causal Inference Nlp
1 models
Mcp Demo Examples
1 models
Self Supervised Learning
1 models
Healthcare Ai Diagnostics
1 models
Lottery Number Prediction
1 models
Generative Ai Platforms
1 models
Low Light Image Restoration
1 models
Game Playing Agents
1 models
Medical Image Segmentation
1 models
Prompt Optimization Systems
1 models
Paper Implementation Collections
1 models
World Models Frameworks
1 models
Fact Checking Systems
1 models
Quantum Nlp Processing
1 models
Spiking Neural Networks
1 models
Advanced Prompt Protocols
1 models
Image Generation Mcp
1 models
Speech Synthesis Diffusion
1 models
Music Similarity Embeddings
1 models
Data Pipeline Frameworks
1 models
Kolmogorov Arnold Networks
1 models
Domain Adaptation Frameworks
1 models
Text To Speech Frameworks
1 models
Self Hosted Embedding Servers
1 models
Multimodal Search Engines
1 models
Streamlit Langchain Apps
1 models
Text To Sql Rag
1 models
Diffusion Model Frameworks
1 models
Telegram Ai Assistants
1 models
Prompt Engineering Optimization
1 models
Ml Project Portfolios
1 models
Hate Speech Content Moderation
1 models
Llm Json Streaming
1 models
Streamlit Chatbot Apps
1 models
Content To Markdown
1 models
Codebase Context Extraction
1 models
Memory Augmented Architectures
1 models
Model Confidence Calibration
1 models
Variational Autoencoder Implementations
1 models
Javascript Ml Libraries
1 models
Membership Inference Attacks
1 models
Multi Agent Frameworks
1 models
Ai Security Training Labs
1 models
Multi Modal Ai Assistants
1 models
Langgraph Agentic Systems
1 models
Rust Nlp Bindings
1 models
Eeg Brain Signal Processing
1 models
Natural Language Sql Builders
1 models
Diffusion Web Interfaces
1 models
Streamlit App Templates
1 models
Cli Llm Interfaces
1 models
Video Content Intelligence
1 models
Reading Comprehension Qa
1 models
Edge Device Ml Frameworks
1 models
Mental Health Risk Detection
1 models
Ai Agent Memory Systems
1 models
Cold Email Generation
1 models
Blog Content Generation
1 models
Ruby Llm Frameworks
1 models
Keyword Speech Recognition
1 models
Apple Foundation Models
1 models
Ios Speech Frameworks
1 models
Spring Ai Applications
1 models
Financial News Rag
1 models
Langchain Application Tutorials
1 models
Code Context Packaging
1 models
Traffic Signal Optimization
1 models
Market Research Agents
1 models
Knowledge Graph Question Answering
1 models
Ai Service Sdks
1 models
Extractive Question Answering
1 models
Youtube Transcript Summarization
1 models
Diffusion Adversarial Robustness
1 models
Diffusion Deployment Serving
1 models
Mental Health Chatbots
1 models
Llm Internals Visualization
1 models
Nutrition Ai Apps
1 models
Natural Language Database Agents
1 models
Autogen Framework Implementations
1 models
Book Recommendation Systems
1 models
Ai Investment Analysis
1 models
Pii Redaction Tools
1 models
Youtube Video Qa
1 models
Voice Command Assistants
1 models
Embedding Model Tuning
1 models
Model Inference Serving
1 models
Text To Sql Generation
1 models
Langgraph Agent Implementations
1 models
Qa System Implementations
1 models
Ai Powered App Builders
1 models
Bias Measurement Evaluation
1 models
Rag Document Qa
1 models
Copilot Chat Extensions
1 models
Ai Literacy Education
1 models
Multi Agent Ai Systems
1 models
Agent Observability Debugging
1 models
Ml Development Environments
1 models
Ai Video Creation
1 models
Multi Task Learning
1 models
Onnx Model Deployment
1 models
Emoji Generation Ml
1 models
Image Captioning
1 models
Document Intelligence Extraction
1 models
Satellite Imagery Ml
1 models
Bert Model Deployment
1 models
Natural Language Sql
1 models
Ai Learning Collections
1 models
Gpt Rag Foundations
1 models
Medical Rag Chatbots
1 models
Generative Ai Projects
1 models
Quantum Machine Learning
1 models
Video Diffusion Models
1 models