All Transformer Models
7,795 models ranked by quality score · Page 21 of 78
| # | Model | Score | Tier |
|---|---|---|---|
| 2001 |
Ereboas/MagiCodec
A single-layer, streaming codec model providing SOTA audio quality and... |
|
Emerging |
| 2002 |
vlarine/transformers-ru
A list of pretrained Transformer models for the Russian language. |
|
Emerging |
| 2003 |
Yog-Sotho/LLM-fine-tuner
Powerful no-code LLM fine-tuner: upload data → train → deploy in minutes.... |
|
Emerging |
| 2004 |
nsidn98/LLaMAR
Code for our paper LLaMAR: LM-based Long-Horizon Planner for Multi-Agent Robotics |
|
Emerging |
| 2005 |
surrey-nlp/NLP-2026
Labs for COM3029/COMM061 at University of Surrey |
|
Emerging |
| 2006 |
hitz-zentroa/whisper-lm
Add n-gram and large language model (LLM) support to Whisper models. |
|
Emerging |
| 2007 |
UIC-InDeXLab/RSR
An Efficient Matrix Multiplication Algorithm for Accelerating Inference in... |
|
Emerging |
| 2008 |
JayZhang42/SLED
SLED: Self Logits Evolution Decoding for Improving Factuality in Large... |
|
Emerging |
| 2009 |
arrmansa/Basic-UI-for-GPT-Neo-with-low-vram
A basic ui for running gpt neo 2.7B on low vram (3 gb Vram minimum) |
|
Emerging |
| 2010 |
achimoraites/machine-learning-playground
Having fun with ML |
|
Emerging |
| 2011 |
yzGuu830/efficient-speech-codec
[EMNLP 2024] ESC: Efficient Speech Coding with Cross-Scale Residual Vector... |
|
Emerging |
| 2012 |
Baran-phys/Tropical-Attention
[NeurIPS 2025] Official code for "Tropical Attention: Neural Algorithmic... |
|
Emerging |
| 2013 |
asigalov61/Orchestrator
Local windowed attention multi-instrumental music transformer tailored for... |
|
Emerging |
| 2014 |
marcobombieri/do-LLM-dream-of-ontologies
Repository containing code and dataset of the paper "Do LLM Dream Of Ontologies?" |
|
Emerging |
| 2015 |
HUBioDataLab/SELFormer
SELFormer: Molecular Representation Learning via SELFIES Language Models |
|
Emerging |
| 2016 |
sichunluo/RecRanker
[TOIS'24] "RecRanker: Instruction Tuning Large Language Model as Ranker for... |
|
Emerging |
| 2017 |
krnel-ai/krnel-graph
Lightweight representation engineering dataflow operations for agent developers. |
|
Emerging |
| 2018 |
turboline-ai/tsln-python
Time Series Lean Notation for python, it is designed to maximize the token... |
|
Emerging |
| 2019 |
OpenNLPLab/TransnormerLLM
Official implementation of TransNormerLLM: A Faster and Better LLM |
|
Emerging |
| 2020 |
researchim-ai/models-at-home
training models at home |
|
Emerging |
| 2021 |
ShelbyJenkins/llm_utils
llm_utils: Basic LLM tools, best practices, and minimal abstraction. |
|
Emerging |
| 2022 |
robertvacareanu/llm4regression
Examining how large language models (LLMs) perform across various synthetic... |
|
Emerging |
| 2023 |
GURPREETKAURJETHRA/Perfect-LLM-Model-Finder
Perfect LLM Model Finder is a tool designed to simplify the overwhelming... |
|
Emerging |
| 2024 |
jackaduma/Alpaca-LoRA-RLHF-PyTorch
A full pipeline to finetune Alpaca LLM with LoRA and RLHF on consumer... |
|
Emerging |
| 2025 |
Beomi/exbert-transformers
exBERT on Transformers🤗 |
|
Emerging |
| 2026 |
deepmancer/vlm-toolbox
Vision-Language Models Toolbox: Your all-in-one solution for multimodal... |
|
Emerging |
| 2027 |
amazon-science/recode
Releasing code for "ReCode: Robustness Evaluation of Code Generation Models" |
|
Emerging |
| 2028 |
Yigtwxx/PredictaLM
PredictaLM is a lightweight Turkish language model designed for next-word... |
|
Emerging |
| 2029 |
declare-lab/Auto-Scaling
[Arxiv 2024] Official Implementation of the paper: "Towards Robust... |
|
Emerging |
| 2030 |
teelinsan/parallel-decoding
Repository of the paper "Accelerating Transformer Inference for Translation... |
|
Emerging |
| 2031 |
TheBrainLab/SGLFormer
Spiking Global-Local Fusion Transformer |
|
Emerging |
| 2032 |
moharamfatema/graduation-project
Video vision transformers for hierarchical anomaly detection in video scenes. |
|
Emerging |
| 2033 |
ngoanpv/llama2_vietnamese
A fine-tuned Large Language Model (LLM) for the Vietnamese language based on... |
|
Emerging |
| 2034 |
TIGER-AI-Lab/General-Reasoner
General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25] |
|
Emerging |
| 2035 |
Akshint0407/Automated-Answer-Checker
AI-powered grading system for educators 🔹 Streamlit web app that automates... |
|
Emerging |
| 2036 |
he-h/rhythm
[NeurIPS 2025] RHYTHM: Reasoning with Hierarchical Temporal Tokenization for... |
|
Emerging |
| 2037 |
THUDM/Multilingual-GLM
The multilingual variant of GLM, a general language model trained with... |
|
Emerging |
| 2038 |
JerryYLi/valhalla-nmt
Code repository for CVPR 2022 paper "VALHALLA: Visual Hallucination for... |
|
Emerging |
| 2039 |
bminixhofer/tokenkit
A toolkit implementing advanced methods to transfer models and model... |
|
Emerging |
| 2040 |
iamgmujtaba/llama3.2-webUI
LLaMa 3.2 Multimodal Web UI is a user-friendly interface for interacting... |
|
Emerging |
| 2041 |
RenzeLou/Muffin
MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following |
|
Emerging |
| 2042 |
srvCodes/continual_learning_with_vit
Code for our CVPR 2022 workshop paper "Towards Exemplar-Free Continual... |
|
Emerging |
| 2043 |
InternRobotics/PointLLM
[ECCV 2024 Best Paper Candidate & TPAMI 2025] PointLLM: Empowering Large... |
|
Emerging |
| 2044 |
DEV-D-GR8/SignSense
This repository contains a transformer-based model for real-time American... |
|
Emerging |
| 2045 |
xf-zhao/LoT
Official implementation of LoT paper: "Enhancing Zero-Shot Chain-of-Thought... |
|
Emerging |
| 2046 |
Tanveer81/ReVisionLLM
This is the official implementation of ReVisionLLM: Recursive... |
|
Emerging |
| 2047 |
zjunlp/ModelKinship
Exploring Model Kinship for Merging Large Language Models |
|
Emerging |
| 2048 |
OpenBMB/VisCPM
[ICLR'24 spotlight] Chinese and English Multimodal Large Model Series (Chat... |
|
Emerging |
| 2049 |
NVlabs/NFT
Implementation of Negative-aware Finetuning (NFT) algorithm for "Bridging... |
|
Emerging |
| 2050 |
Bruce-Lee-LY/decoding_attention
Decoding Attention is specially optimized for MHA, MQA, GQA and MLA using... |
|
Emerging |
| 2051 |
ai8hyf/llm_split_recall_test
Split and Recall: A simple and efficient benchmark to evaluate in-context... |
|
Emerging |
| 2052 |
nlp-with-transformers/website
Website for the Natural Language Processing with Transformers book |
|
Emerging |
| 2053 |
AIFEG/BenchLMM
[ECCV 2024] BenchLMM: Benchmarking Cross-style Visual Capability of Large... |
|
Emerging |
| 2054 |
hemangjoshi37a/hjAlgos
AI based algorithmic trading platform for zerodha users |
|
Emerging |
| 2055 |
thushv89/packt_nlp_tensorflow_2
This will contain the code for the 2nd edition of NLP with TensorFlow (Edition 2) |
|
Emerging |
| 2056 |
gsarti/t5-flax-gcp
Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP |
|
Emerging |
| 2057 |
Wang-ML-Lab/llm-continual-learning-survey
[CSUR 2025] Continual Learning of Large Language Models: A Comprehensive Survey |
|
Emerging |
| 2058 |
leftmove/cria
Run LLMs locally with as little friction as possible. |
|
Emerging |
| 2059 |
GeeeekExplorer/transformers-patch
patches for huggingface transformers to save memory |
|
Emerging |
| 2060 |
senadkurtisi/pytorch-image-captioning
Transformer & CNN Image Captioning model in PyTorch. |
|
Emerging |
| 2061 |
nlpodyssey/gotokenizers
Go implementation of today's most used tokenizers |
|
Emerging |
| 2062 |
BauplanLabs/Making-Databases-Faster-with-LLM-Evolutionary-Sampling
Repository hosting code to reproduce our paper (with Stanford and... |
|
Emerging |
| 2063 |
BoHuangLab/Protein-Localization-Transformer
Code for CELL-E: Biological Zero-Shot Text-to-Image Synthesis for Protein... |
|
Emerging |
| 2064 |
deep-diver/segformer-tf-transformers
This repository demonstrates how to use TensorFlow based SegFormer model in... |
|
Emerging |
| 2065 |
raghavagps/pptstab
PPTStab: Designing of thermostable proteins with a desired melting temperature |
|
Emerging |
| 2066 |
opendatalab/UrBench
[AAAI 2025]This repo contains evaluation code for the paper “UrBench: A... |
|
Emerging |
| 2067 |
vicuna-tools/vicuna-installation-guide
The "vicuna-installation-guide" provides step-by-step instructions for... |
|
Emerging |
| 2068 |
GURPREETKAURJETHRA/PaliGemma-Inference-and-Fine-Tuning
PaliGemma Inference and Fine Tuning |
|
Emerging |
| 2069 |
calpt/awesome-adapter-resources
Collection of Tools and Papers related to Adapters / Parameter-Efficient... |
|
Emerging |
| 2070 |
fattorib/fusedswiglu
Fused SwiGLU Triton kernels |
|
Emerging |
| 2071 |
umbertocappellazzo/Llama-AVSR
Official Pytorch implementation of "Large Language Models are Strong... |
|
Emerging |
| 2072 |
UCSC-REAL/TokenCleaning
[ICML 2025] Official implementation of paper "Token Cleaning: Fine-Grained... |
|
Emerging |
| 2073 |
nipunsadvilkar/roberta-base-mr
RoBERTa Marathi Language model trained from scratch during huggingface 🤗 x ... |
|
Emerging |
| 2074 |
maxi-w/llama2-chat-interface
Gradio Chat Interface for Llama 2 |
|
Emerging |
| 2075 |
worldbank/LLMs-Practical-Guide
A practical introduction to Generative AI and LLMs, equipping professionals... |
|
Emerging |
| 2076 |
HacktivSpace/multidisciplinary-deepfake-detection
A solution for deepfake detection across multiple modalities, including... |
|
Emerging |
| 2077 |
tgautam03/Transformers
A Gentle Introduction to Transformers Neural Network |
|
Emerging |
| 2078 |
xmindflow/MSA-2Net
[BMVC 2024] Official repository of the paper titled "MSA^2 Net: Multi-scale... |
|
Emerging |
| 2079 |
saddam213/LLamaStack
ASP.NET Core Web, WebApi & WPF implementations for LLama.cpp & LLamaSharp |
|
Emerging |
| 2080 |
ziqipang/RandAR
[CVPR 2025 (Oral)] Open implementation of "RandAR" |
|
Emerging |
| 2081 |
ziqipang/LM4VisualEncoding
[ICLR 2024 (Spotlight)] "Frozen Transformers in Language Models are... |
|
Emerging |
| 2082 |
sam575/axial-gan
Code for "Simultaneous Face Hallucination and Translation for Thermal to... |
|
Emerging |
| 2083 |
thongnt99/learned-sparse-retrieval
Unified Learned Sparse Retrieval Framework |
|
Emerging |
| 2084 |
zjunlp/NLPCC2024_RegulatingLLM
[NLPCC 2024] Shared Task 10: Regulating Large Language Models |
|
Emerging |
| 2085 |
FareedKhan-dev/gpt4o-from-scratch
Implementation of a GPT-4o like Multimodal from Scratch using Python |
|
Emerging |
| 2086 |
akjindal53244/Arithmo
Small and Efficient Mathematical Reasoning LLMs |
|
Emerging |
| 2087 |
declare-lab/CICERO
The purpose of this repository is to introduce new dialogue-level... |
|
Emerging |
| 2088 |
AI4LIFE-GROUP/LLM_Explainer
Code for paper: Are Large Language Models Post Hoc Explainers? |
|
Emerging |
| 2089 |
Wangbiao2/R1-Track
R1-Track: Direct Application of MLLMs to Visual Object Tracking via... |
|
Emerging |
| 2090 |
qizhou000/UniEdit
[NeurIPS 2025 B & D] UniEdit: A Unified Knowledge Editing Benchmark for... |
|
Emerging |
| 2091 |
zhchen18/ToMBench
ToMBench: Benchmarking Theory of Mind in Large Language Models, ACL 2024. |
|
Emerging |
| 2092 |
BatsResearch/planetarium
Dataset and benchmark for assessing LLMs in translating natural language... |
|
Emerging |
| 2093 |
gustavecortal/gpt-j-fine-tuning-example
Fine-tuning 6-Billion GPT-J (& other models) with LoRA and 8-bit compression |
|
Emerging |
| 2094 |
otto-de/TRON
⚡️ Implementation of TRON: Transformer Recommender using Optimized... |
|
Emerging |
| 2095 |
yyDing1/ScaleQuest
[ACL 2025] We introduce ScaleQuest, a scalable, novel and cost-effective... |
|
Emerging |
| 2096 |
linydub/azureml-greenai-txtsum
Samples for fine-tuning HuggingFace models with AzureML |
|
Emerging |
| 2097 |
SkywalkerLuke/TransHLA
TransHLA: A hybrid transformer model for peptide-HLA epitope detection. |
|
Emerging |
| 2098 |
aj-naik/Text-Summarization
Abstractive and Extractive Text summarization using Transformers. |
|
Emerging |
| 2099 |
XavierZXY/Zero2Hero
从0到1学习大模型 |
|
Emerging |
| 2100 |
viralcode/superGPT
Train your own LLM from scratch |
|
Emerging |