All Transformer Models
7,795 models ranked by quality score · Page 6 of 78
| # | Model | Score | Tier |
|---|---|---|---|
| 501 |
dali92002/DocEnTR
DocEnTr: An end-to-end document image enhancement transformer - ICPR 2022 |
|
Emerging |
| 502 |
zackshen/gguf
a GGUF file parser |
|
Emerging |
| 503 |
noahho/CAAFE
Semi-automatic feature engineering process using Language Models and your... |
|
Emerging |
| 504 |
conceptofmind/LaMDA-rlhf-pytorch
Open-source pre-training implementation of Google's LaMDA in PyTorch. Adding... |
|
Emerging |
| 505 |
Tzohar/PassLLM
World's most accurate password guessing AI tool. A PyTorch implementation of... |
|
Emerging |
| 506 |
kenhktsui/anyclassifier
One Line To Build Zero-Data Classifiers in Minutes |
|
Emerging |
| 507 |
EleutherAI/gpt-neo
An implementation of model parallel GPT-2 and GPT-3-style models using the... |
|
Emerging |
| 508 |
awslabs/mlm-scoring
Python library & examples for Masked Language Model Scoring (ACL 2020) |
|
Emerging |
| 509 |
mim-solutions/bert_for_longer_texts
BERT classification model for processing texts longer than 512 tokens. Text... |
|
Emerging |
| 510 |
rxn4chemistry/rxn-onmt-models
Training of OpenNMT-based RXN models |
|
Emerging |
| 511 |
x-tabdeveloping/turftopic
Robust and fast topic models with sentence-transformers. |
|
Emerging |
| 512 |
Gleghorn-Lab/Protify
Low code molecular property prediction |
|
Emerging |
| 513 |
jobergum/browser-ml-inference
Edge Inference in Browser with Transformer NLP model |
|
Emerging |
| 514 |
predibase/lorax
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs |
|
Emerging |
| 515 |
lorenzorovida/FHE-BERT-Tiny
Source code for the paper "Transformer-based Language Models and Homomorphic... |
|
Emerging |
| 516 |
dorarad/gansformer
Generative Adversarial Transformers |
|
Emerging |
| 517 |
dusty-nv/NanoLLM
Optimized local inference for LLMs with HuggingFace-like APIs for... |
|
Emerging |
| 518 |
kyegomez/SimplifiedTransformers
SimplifiedTransformer simplifies transformer block without affecting... |
|
Emerging |
| 519 |
jackaduma/Recurrent-LLM
The open-source LLM implementation of paper: RecurrentGPT: Interactive... |
|
Emerging |
| 520 |
chuanyangjin/MMToM-QA
[🏆Outstanding Paper Award at ACL 2024] MMToM-QA: Multimodal Theory of Mind... |
|
Emerging |
| 521 |
geeks-of-data/knowledge-gpt
Extract knowledge from all information sources using gpt and other language... |
|
Emerging |
| 522 |
monologg/KoBERT-Transformers
KoBERT on 🤗 Huggingface Transformers 🤗 (with Bug Fixed) |
|
Emerging |
| 523 |
qcri/LLMeBench
Benchmarking Large Language Models |
|
Emerging |
| 524 |
vinjn/llm-metahuman
An open solution for AI-powered photorealistic digital humans. |
|
Emerging |
| 525 |
The-AI-Summer/self-attention-cv
Implementation of various self-attention mechanisms focused on computer... |
|
Emerging |
| 526 |
The-Swarm-Corporation/MedGuard
MedGuard is a robust, production-grade Python library that ensures HIPAA... |
|
Emerging |
| 527 |
back2matching/turboquant
First open-source TurboQuant KV cache compression for LLM inference. Drop-in... |
|
Emerging |
| 528 |
ycq091044/BIOT
BIOT - A framework for pretraining biosignals at scale. Large EEG pre-trained models. |
|
Emerging |
| 529 |
ssbuild/chatglm_finetuning
chatglm 6b finetuning and alpaca finetuning |
|
Emerging |
| 530 |
soulteary/docker-llama2-chat
Play LLaMA2 (official / 中文版 / INT4 / llama2.cpp) Together! ONLY 3 STEPS! (... |
|
Emerging |
| 531 |
xusenlinzy/api-for-open-llm
Openai style api for open large language models, using LLMs just as chatgpt!... |
|
Emerging |
| 532 |
cedrickchee/awesome-transformer-nlp
A curated list of NLP resources focused on Transformer networks, attention... |
|
Emerging |
| 533 |
svdrecbd/mhc-mlx
MLX + Metal implementation of mHC: Manifold-Constrained Hyper-Connections by... |
|
Emerging |
| 534 |
ARM-software/keyword-transformer
Official implementation of the Keyword Transformer: https://arxiv.org/abs/2104.00769 |
|
Emerging |
| 535 |
r2d4/rellm
Exact structure out of any language model completion. |
|
Emerging |
| 536 |
mlabonne/llm-datasets
Curated list of datasets and tools for post-training. |
|
Emerging |
| 537 |
Zefan-Cai/KVCache-Factory
Unified KV Cache Compression Methods for Auto-Regressive Models |
|
Emerging |
| 538 |
bobazooba/xllm
🦖 X—LLM: Cutting Edge & Easy LLM Finetuning |
|
Emerging |
| 539 |
deepseek-ai/Janus
Janus-Series: Unified Multimodal Understanding and Generation Models |
|
Emerging |
| 540 |
kyegomez/Lets-Verify-Step-by-Step
"Improving Mathematical Reasoning with Process Supervision" by OPENAI |
|
Emerging |
| 541 |
jhkchan/translategemma-cli
Local CLI for Google's TranslateGemma translation models with multi-platform... |
|
Emerging |
| 542 |
davidpirogov/toon-llm
Token-Oriented Object Notation (TOON) is an LLM-optimized data serialization... |
|
Emerging |
| 543 |
LM-Kit/lm-kit-net-samples
.NET samples for LM-Kit.NET |
|
Emerging |
| 544 |
showlab/Show-o
[ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer... |
|
Emerging |
| 545 |
jeya-maria-jose/TransWeather
Pytorch Code for the paper TransWeather - CVPR 2022 |
|
Emerging |
| 546 |
cztomsik/ava
All-in-one desktop app for running LLMs locally. |
|
Emerging |
| 547 |
AliHaiderAhmad001/GPT-from-Scratch-with-Tensorflow
Implementation for "Improving Language Understanding by Generative... |
|
Emerging |
| 548 |
sinanuozdemir/oreilly-llm-rl-alignment
This training offers an intensive exploration into the frontier of... |
|
Emerging |
| 549 |
leaderj1001/BottleneckTransformers
Bottleneck Transformers for Visual Recognition |
|
Emerging |
| 550 |
Uminosachi/open-llm-webui
This repository contains a web application designed to execute relatively... |
|
Emerging |
| 551 |
prrao87/tweet-stance-prediction
Applying NLP transfer learning techniques to predict Tweet stance toward a topic |
|
Emerging |
| 552 |
mirpo/fastapi-gen
Build LLM-enabled FastAPI applications without build configuration. |
|
Emerging |
| 553 |
horseee/LLM-Pruner
[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language... |
|
Emerging |
| 554 |
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V... |
|
Emerging |
| 555 |
ictnlp/LLaMA-Omni
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction... |
|
Emerging |
| 556 |
vectorch-ai/ScaleLLM
A high-performance inference system for large language models, designed for... |
|
Emerging |
| 557 |
The-FinAI/PIXIU
This repository introduces PIXIU, an open-source resource featuring the... |
|
Emerging |
| 558 |
Cardinal-Operations/ORLM
ORLM: Training Large Language Models for Optimization Modeling |
|
Emerging |
| 559 |
willyfh/graph-transformer
An unofficial implementation of Graph Transformer (Masked Label Prediction:... |
|
Emerging |
| 560 |
NVlabs/Eagle
Eagle: Frontier Vision-Language Models with Data-Centric Strategies |
|
Emerging |
| 561 |
kyegomez/MHMoE
Community Implementation of the paper: "Multi-Head Mixture-of-Experts" In PyTorch |
|
Emerging |
| 562 |
tigerchen52/query_level_uncertainty
query-level uncertainty in LLMs |
|
Emerging |
| 563 |
DaoD/INTERS
This is the repository for our paper "INTERS: Unlocking the Power of Large... |
|
Emerging |
| 564 |
jiwidi/Behavior-Sequence-Transformer-Pytorch
This is a pytorch implementation for the BST model from Alibaba... |
|
Emerging |
| 565 |
HHousen/TransformerSum
Models to perform neural summarization (extractive and abstractive) using... |
|
Emerging |
| 566 |
locuslab/wanda
A simple and effective LLM pruning approach. |
|
Emerging |
| 567 |
VinAIResearch/PhoBERT
PhoBERT: Pre-trained language models for Vietnamese (EMNLP-2020 Findings) |
|
Emerging |
| 568 |
codewithdark-git/Building-LLMs-from-scratch
This repository guides you through the process of building a GPT-style Large... |
|
Emerging |
| 569 |
sagorbrur/bangla-bert
Bangla-Bert is a pretrained bert model for Bengali language |
|
Emerging |
| 570 |
Event-AHU/Medical_Image_Analysis
Foundation models based medical image analysis |
|
Emerging |
| 571 |
kyegomez/SingLoRA
This repository provides a minimal, single-file implementation of SingLoRA... |
|
Emerging |
| 572 |
DmitryNekrasov/ai-code-completion-idea-plugin
Implementation of IntelliJ IDEA code completion plugin using a local LLM. |
|
Emerging |
| 573 |
hiyouga/ChatGLM-Efficient-Tuning
Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调 |
|
Emerging |
| 574 |
kayoyin/transformer-slt
Sign Language Translation with Transformers (COLING'2020, ECCV'20 SLRTP Workshop) |
|
Emerging |
| 575 |
alpa-projects/alpa
Training and serving large-scale neural networks with auto parallelization. |
|
Emerging |
| 576 |
ymcui/Chinese-LLaMA-Alpaca-2
中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs... |
|
Emerging |
| 577 |
raymin0223/mixture_of_recursions
Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive... |
|
Emerging |
| 578 |
fashn-AI/fashn-human-parser
Human parsing model for fashion and virtual try-on applications |
|
Emerging |
| 579 |
AviSoori1x/makeMoE
From scratch implementation of a sparse mixture of experts language model... |
|
Emerging |
| 580 |
xNul/chat-llama-discord-bot
A Discord Bot for chatting with LLaMA, Vicuna, Alpaca, MPT, or any other... |
|
Emerging |
| 581 |
chaitjo/learning-tsp
Code for the paper 'Learning TSP Requires Rethinking Generalization' (CP 2021) |
|
Emerging |
| 582 |
davidiommi/Pytorch--3D-Medical-Images-Segmentation--SALMON
Segmentation deep learning ALgorithm based on MONai toolbox: single and... |
|
Emerging |
| 583 |
intel/intel-extension-for-transformers
⚡ Build your chatbot within minutes on your favorite device; offer SOTA... |
|
Emerging |
| 584 |
JIA-Lab-research/LongLoRA
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral) |
|
Emerging |
| 585 |
FoundationVision/Liquid
(Accepted by IJCV) Liquid: Language Models are Scalable and Unified... |
|
Emerging |
| 586 |
mit-han-lab/lite-transformer
[ICLR 2020] Lite Transformer with Long-Short Range Attention |
|
Emerging |
| 587 |
FudanDISC/DISC-LawLLM
[中文法律大模型] DISC-LawLLM: an intelligent legal system powered by large language... |
|
Emerging |
| 588 |
kmeng01/memit
Mass-editing thousands of facts into a transformer memory (ICLR 2023) |
|
Emerging |
| 589 |
voidful/TFkit
🤖📇 handling multiple nlp task in one pipeline |
|
Emerging |
| 590 |
quantium-ai/research
Research experiments exploring uncommon quant techniques. |
|
Emerging |
| 591 |
j-min/VL-T5
PyTorch code for "Unifying Vision-and-Language Tasks via Text Generation" (ICML 2021) |
|
Emerging |
| 592 |
MagedSaeed/generate-sequences
A python package made to generate sequences (greedy and beam-search) from... |
|
Emerging |
| 593 |
KRR-Oxford/HierarchyTransformers
Language Models as Hierarchy Encoders |
|
Emerging |
| 594 |
THU-SI/Spatial-MLLM
[NeurIPS 2025] Official implementation of Spatial-MLLM: Boosting MLLM... |
|
Emerging |
| 595 |
KristiyanVachev/Leaf-Question-Generation
Easy to use and understand multiple-choice question generation algorithm... |
|
Emerging |
| 596 |
Paranioar/Awesome_Matching_Pretraining_Transfering
The Paper List of Large Multi-Modality Model (Perception, Generation,... |
|
Emerging |
| 597 |
verifai/multiLLM
🚀 Invoke multiple large language models concurrently and the rank results.... |
|
Emerging |
| 598 |
bytedance/byteir
A model compilation solution for various hardware |
|
Emerging |
| 599 |
thu-nics/MoA
[CoLM'25] The official implementation of the paper |
|
Emerging |
| 600 |
palewire/first-llm-classifier
Learn how journalists use large-language models to organize and analyze... |
|
Emerging |