All Transformer Models
7,795 models ranked by quality score · Page 4 of 78
| # | Model | Score | Tier |
|---|---|---|---|
| 301 |
AI-Hypercomputer/JetStream
JetStream is a throughput and memory optimized engine for LLM inference on... |
|
Established |
| 302 |
fixie-ai/ultravox
A fast multimodal LLM for real-time voice |
|
Established |
| 303 |
OpenVoiceOS/ovos-audio-transformer-plugin-ggwave
data over sound plugin |
|
Established |
| 304 |
monologg/JointBERT
Pytorch implementation of JointBERT: "BERT for Joint Intent Classification... |
|
Established |
| 305 |
jadore801120/attention-is-all-you-need-pytorch
A PyTorch implementation of the Transformer model in "Attention is All You Need". |
|
Established |
| 306 |
ai-forever/ru-gpts
Russian GPT3 models. |
|
Established |
| 307 |
daviddaytw/react-native-transformers
Run local LLM from Huggingface in React-Native or Expo using onnxruntime. |
|
Established |
| 308 |
NiuTrans/LaTeXTrans
A tool for translating the content of LaTeX documents into various other... |
|
Established |
| 309 |
zjunlp/EasyInstruct
[ACL 2024] An Easy-to-use Instruction Processing Framework for LLMs. |
|
Established |
| 310 |
abhimishra91/transformers-tutorials
Github repo with tutorials to fine tune transformers for diff NLP tasks |
|
Established |
| 311 |
bshao001/ChatLearner
A chatbot implemented in TensorFlow based on the seq2seq model, with certain... |
|
Established |
| 312 |
vitoplantamura/OnnxStream
Lightweight inference library for ONNX files, written in C++. It can run... |
|
Established |
| 313 |
grammarly/gector
Official implementation of the papers "GECToR โ Grammatical Error... |
|
Established |
| 314 |
LoicGrobol/zeldarose
Train transformer-based models. |
|
Established |
| 315 |
ikergarcia1996/Easy-Translate
Easy-Translate is a script for translating large text files with a SINGLE... |
|
Established |
| 316 |
lone-cloud/gerbil
A desktop app for running Large Language Models locally. |
|
Established |
| 317 |
rllm-team/rllm
Pytorch Library for Relational Table Learning with LLMs. |
|
Established |
| 318 |
tylerelyt/LLM-Workshop
๐ Learn Large Language Model development through hands-on projects and... |
|
Established |
| 319 |
kennethleungty/Llama-2-Open-Source-LLM-CPU-Inference
Running Llama 2 and other Open-Source LLMs on CPU Inference Locally for Documentย Q&A |
|
Established |
| 320 |
tensorgi/TPA
[NeurIPS 2025 Spotlight] TPA: Tensor ProducT ATTenTion Transformer (T6)... |
|
Established |
| 321 |
Nicolepcx/transformers-the-definitive-guide
This is the official repository for the book Transformers - The Definitive Guide |
|
Established |
| 322 |
telekom/mltb2
Machine Learning Toolbox 2 |
|
Established |
| 323 |
DashyDashOrg/pandas-llm
Pandas-LLM |
|
Established |
| 324 |
EricFillion/happy-transformer
Happy Transformer makes it easy to fine-tune and perform inference with NLP... |
|
Established |
| 325 |
rasbt/LLM-workshop-2024
A 4-hour coding workshop to understand how LLMs are implemented and used |
|
Established |
| 326 |
Rishit-dagli/Fast-Transformer
An implementation of Additive Attention |
|
Established |
| 327 |
CVHub520/X-AnyLabeling-Server
A Simple, Lightweight, and Extensible Serving Framework for X-AnyLabeling |
|
Established |
| 328 |
shreyansh26/Annotated-ML-Papers
Annotations of the interesting ML papers I read |
|
Established |
| 329 |
kyegomez/LongNet
Implementation of plug in and play Attention from "LongNet: Scaling... |
|
Established |
| 330 |
kyegomez/PALI3
Implementation of PALI3 from the paper PALI-3 VISION LANGUAGE MODELS:... |
|
Established |
| 331 |
beehive-lab/GPULlama3.java
GPU-accelerated Llama3.java inference in pure Java using TornadoVM. |
|
Established |
| 332 |
PKU-Alignment/safe-rlhf
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from... |
|
Established |
| 333 |
chanind/frame-semantic-transformer
Frame Semantic Parser based on T5 and FrameNet |
|
Established |
| 334 |
AdityaNG/kan-gpt
The PyTorch implementation of Generative Pre-trained Transformers (GPTs)... |
|
Established |
| 335 |
tatsu-lab/alpaca_eval
An automatic evaluator for instruction-following language models.... |
|
Established |
| 336 |
symfony/ai-platform
PHP library for interacting with AI platform provider. |
|
Established |
| 337 |
abelriboulot/onnxt5
Summarization, translation, sentiment-analysis, text-generation and more at... |
|
Established |
| 338 |
GURPREETKAURJETHRA/END-TO-END-GENERATIVE-AI-PROJECTS
End to End Generative AI Industry Projects on LLM Models with... |
|
Established |
| 339 |
monologg/KoELECTRA
Pretrained ELECTRA Model for Korean |
|
Established |
| 340 |
pszemraj/textsum
CLI & Python API to easily summarize text-based files with transformers |
|
Established |
| 341 |
opendilab/LightRFT
LightRFT: Light, Efficient, Omni-modal & Reward-model Driven Reinforcement... |
|
Established |
| 342 |
alephpi/Texo-web
The web application for Texo, a minimalist SOTA LaTeX OCR model which... |
|
Established |
| 343 |
lonePatient/Bert-Multi-Label-Text-Classification
This repo contains a PyTorch implementation of a pretrained BERT model for... |
|
Established |
| 344 |
EleutherAI/knowledge-neurons
A library for finding knowledge neurons in pretrained transformer models. |
|
Established |
| 345 |
avilum/minrlm
Token-efficient Recursive Language Model. 3.6x fewer tokens than vanilla... |
|
Established |
| 346 |
Strvm/meta-ai-api
Llama 3 API 70B & 405B (MetaAI Reverse Engineered) |
|
Established |
| 347 |
kyegomez/SwitchTransformers
Implementation of Switch Transformers from the paper: "Switch Transformers:... |
|
Established |
| 348 |
microsoft/sarathi-serve
A low-latency & high-throughput serving engine for LLMs |
|
Established |
| 349 |
zemlyansky/gpt-tfjs
GPT in TensorFlow.js |
|
Established |
| 350 |
pengzhangzhi/Open-dLLM
Open diffusion language model for code generation โ releasing pretraining,... |
|
Established |
| 351 |
tensorchord/modelz-llm
OpenAI compatible API for LLMs and embeddings (LLaMA, Vicuna, ChatGLM and... |
|
Established |
| 352 |
huggingface/transformers-bloom-inference
Fast Inference Solutions for BLOOM |
|
Established |
| 353 |
salesforce/TransmogrifAI
TransmogrifAI (pronounced trฤns-mลgหrษ-fฤซ) is an AutoML library for building... |
|
Established |
| 354 |
Troyanovsky/Local-LLM-Comparison-Colab-UI
Compare the performance of different LLM that can be deployed locally on... |
|
Established |
| 355 |
gordicaleksa/pytorch-original-transformer
My implementation of the original transformer model (Vaswani et al.). I've... |
|
Established |
| 356 |
bytefer/ollama-ocr
Implementing OCR with a local visual model run by ollama. |
|
Established |
| 357 |
ridgerchu/matmulfreellm
Implementation for MatMul-free LM. |
|
Established |
| 358 |
serge-chat/serge
A web interface for chatting with Alpaca through llama.cpp. Fully... |
|
Established |
| 359 |
gitkaz/mlx_gguf_server
This is a FastAPI based LLM server. Load multiple LLM models (MLX or... |
|
Established |
| 360 |
jina-ai/rungpt
An open-source cloud-native of large multi-modal models (LMMs) serving framework. |
|
Established |
| 361 |
camenduru/text-generation-webui-colab
A colab gradio web UI for running Large Language Models |
|
Established |
| 362 |
Imalwayshere/Open-Detector
BERT-based AI-generated academic text detection model |
|
Established |
| 363 |
appvision-ai/fast-bert
Super easy library for BERT based NLP models |
|
Established |
| 364 |
EfficientMoE/MoE-Infinity
PyTorch library for cost-effective, fast and easy serving of MoE models. |
|
Established |
| 365 |
adrienpetralia/NILMFormer
[KDD 2025] NILMFormer: A Sequence-To-Sequence Non-Stationarity Aware... |
|
Established |
| 366 |
bytedance/video-SALMONN-2
video-SALMONN 2 is a powerful audio-visual large language model (LLM) that... |
|
Established |
| 367 |
Gen-Verse/dLLM-RL
[ICLR 2026] Official code for TraceRL: Revolutionizing post-training for... |
|
Established |
| 368 |
cruiseresearchgroup/SensorLLM
[EMNLP 2025] Official implementation of "SensorLLM: Aligning Large Language... |
|
Established |
| 369 |
modelscope/easydistill
a toolkit on knowledge distillation for large language models |
|
Established |
| 370 |
BioinfoMachineLearning/DeepInteract
A geometric deep learning framework (Geometric Transformers) for predicting... |
|
Established |
| 371 |
polakowo/gpt2bot
Your new Telegram buddy powered by transformers |
|
Established |
| 372 |
MDGrey33/pyvisionai
The PyVisionAI Official Repo |
|
Established |
| 373 |
ml4fp/2025-lbnl
ML4FP 2025: notebooks used for the Machine Learning for Fundamental Physics... |
|
Established |
| 374 |
keith2018/TinyGPT
Tiny C++ LLM inference implementation from scratch |
|
Established |
| 375 |
microsoft/rat-sql
A relation-aware semantic parsing model from English to SQL |
|
Established |
| 376 |
Tencent/TurboTransformers
a fast and user-friendly runtime for transformer inference (Bert, Albert,... |
|
Established |
| 377 |
bbruceyuan/LLMs-Zero-to-Hero
ไปๆ ๅๅฐๅๅฐๅคงๆจกๅ๏ผLLM๏ผๅคง่ฑ้~ ๆฌข่ฟๅ ณๆณจๅ็ปญ๏ผ๏ผ๏ผ |
|
Established |
| 378 |
HPAI-BSC/TuRTLe
TuRTLe: A Unified Evaluation of LLMs for RTL Generation ๐ข (MLCAD 2025) |
|
Established |
| 379 |
uber-research/PPLM
Plug and Play Language Model implementation. Allows to steer topic and... |
|
Established |
| 380 |
sammcj/ingest
Parse files (e.g. code repos) and websites to clipboard or a file for... |
|
Established |
| 381 |
SamsungSAILMontreal/nino
Code for "Accelerating Training with Neuron Interaction and Nowcasting... |
|
Established |
| 382 |
yotambraun/APDTFlow
APDTFlow is a modern and extensible forecasting framework for time series... |
|
Established |
| 383 |
olivkoch/nano-trm
An implementation of Tiny Recursive Models (TRM) |
|
Established |
| 384 |
BeRo1985/pasllm
PasLLM - LLM inference engine in Object Pascal (synced from my private work... |
|
Established |
| 385 |
dbiir/UER-py
Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo |
|
Emerging |
| 386 |
CPJKU/wechsel
Code for WECHSEL: Effective initialization of subword embeddings for... |
|
Emerging |
| 387 |
JIA-Lab-research/MGM-Omni
MGM-Omni: Scaling Omni LLMs to Personalized Long-Horizon Speech |
|
Emerging |
| 388 |
iusztinpaul/hands-on-llms
๐ฆ ๐๐ฒ๐ฎ๐ฟ๐ป about ๐๐๐ ๐, ๐๐๐ ๐ข๐ฝ๐, and ๐๐ฒ๐ฐ๐๐ผ๐ฟ ๐๐๐ for free by designing, training,... |
|
Emerging |
| 389 |
stair-lab/mlhp
Machine Learning from Human Preferences |
|
Emerging |
| 390 |
yuanzhoulvpi2017/zero_nlp
ไธญๆnlp่งฃๅณๆนๆก(ๅคงๆจกๅใๆฐๆฎใๆจกๅใ่ฎญ็ปใๆจ็) |
|
Emerging |
| 391 |
deep-symbolic-mathematics/LLM-SR
[ICLR 2025 Oral] This is the official repo for the paper "LLM-SR" on... |
|
Emerging |
| 392 |
matlab-deep-learning/transformer-models
Deep Learning Transformer models in MATLAB |
|
Emerging |
| 393 |
asahi417/lmppl
Calculate perplexity on a text with pre-trained language models. Support MLM... |
|
Emerging |
| 394 |
hkproj/pytorch-llama
LLaMA 2 implemented from scratch in PyTorch |
|
Emerging |
| 395 |
AmpereComputingAI/ampere_model_library
AML's goal is to make benchmarking of various AI architectures on Ampere... |
|
Emerging |
| 396 |
mead-ml/mead-baseline
Deep-Learning Model Exploration and Development for NLP |
|
Emerging |
| 397 |
minggnim/nlp-models
A repository for training transformer based models |
|
Emerging |
| 398 |
pbloem/former
Simple transformer implementation from scratch in pytorch. (archival, latest... |
|
Emerging |
| 399 |
CLUEbenchmark/CLUE
ไธญๆ่ฏญ่จ็่งฃๆต่ฏๅบๅ Chinese Language Understanding Evaluation Benchmark: datasets,... |
|
Emerging |
| 400 |
google-research/bigbird
Transformers for Longer Sequences |
|
Emerging |