All Transformer Models

7,795 models ranked by quality score · Page 21 of 78

Showing 2001–2100 of 7,795
# Model Score Tier
2001 Ereboas/MagiCodec

A single-layer, streaming codec model providing SOTA audio quality and...

35
Emerging
2002 vlarine/transformers-ru

A list of pretrained Transformer models for the Russian language.

35
Emerging
2003 Yog-Sotho/LLM-fine-tuner

Powerful no-code LLM fine-tuner: upload data → train → deploy in minutes....

35
Emerging
2004 nsidn98/LLaMAR

Code for our paper LLaMAR: LM-based Long-Horizon Planner for Multi-Agent Robotics

35
Emerging
2005 surrey-nlp/NLP-2026

Labs for COM3029/COMM061 at University of Surrey

35
Emerging
2006 hitz-zentroa/whisper-lm

Add n-gram and large language model (LLM) support to Whisper models.

35
Emerging
2007 UIC-InDeXLab/RSR

An Efficient Matrix Multiplication Algorithm for Accelerating Inference in...

35
Emerging
2008 JayZhang42/SLED

SLED: Self Logits Evolution Decoding for Improving Factuality in Large...

35
Emerging
2009 arrmansa/Basic-UI-for-GPT-Neo-with-low-vram

A basic ui for running gpt neo 2.7B on low vram (3 gb Vram minimum)

35
Emerging
2010 achimoraites/machine-learning-playground

Having fun with ML

35
Emerging
2011 yzGuu830/efficient-speech-codec

[EMNLP 2024] ESC: Efficient Speech Coding with Cross-Scale Residual Vector...

35
Emerging
2012 Baran-phys/Tropical-Attention

[NeurIPS 2025] Official code for "Tropical Attention: Neural Algorithmic...

35
Emerging
2013 asigalov61/Orchestrator

Local windowed attention multi-instrumental music transformer tailored for...

35
Emerging
2014 marcobombieri/do-LLM-dream-of-ontologies

Repository containing code and dataset of the paper "Do LLM Dream Of Ontologies?"

35
Emerging
2015 HUBioDataLab/SELFormer

SELFormer: Molecular Representation Learning via SELFIES Language Models

35
Emerging
2016 sichunluo/RecRanker

[TOIS'24] "RecRanker: Instruction Tuning Large Language Model as Ranker for...

35
Emerging
2017 krnel-ai/krnel-graph

Lightweight representation engineering dataflow operations for agent developers.

35
Emerging
2018 turboline-ai/tsln-python

Time Series Lean Notation for python, it is designed to maximize the token...

35
Emerging
2019 OpenNLPLab/TransnormerLLM

Official implementation of TransNormerLLM: A Faster and Better LLM

35
Emerging
2020 researchim-ai/models-at-home

training models at home

35
Emerging
2021 ShelbyJenkins/llm_utils

llm_utils: Basic LLM tools, best practices, and minimal abstraction.

35
Emerging
2022 robertvacareanu/llm4regression

Examining how large language models (LLMs) perform across various synthetic...

35
Emerging
2023 GURPREETKAURJETHRA/Perfect-LLM-Model-Finder

Perfect LLM Model Finder is a tool designed to simplify the overwhelming...

35
Emerging
2024 jackaduma/Alpaca-LoRA-RLHF-PyTorch

A full pipeline to finetune Alpaca LLM with LoRA and RLHF on consumer...

35
Emerging
2025 Beomi/exbert-transformers

exBERT on Transformers🤗

35
Emerging
2026 deepmancer/vlm-toolbox

Vision-Language Models Toolbox: Your all-in-one solution for multimodal...

35
Emerging
2027 amazon-science/recode

Releasing code for "ReCode: Robustness Evaluation of Code Generation Models"

35
Emerging
2028 Yigtwxx/PredictaLM

PredictaLM is a lightweight Turkish language model designed for next-word...

35
Emerging
2029 declare-lab/Auto-Scaling

[Arxiv 2024] Official Implementation of the paper: "Towards Robust...

35
Emerging
2030 teelinsan/parallel-decoding

Repository of the paper "Accelerating Transformer Inference for Translation...

35
Emerging
2031 TheBrainLab/SGLFormer

Spiking Global-Local Fusion Transformer

35
Emerging
2032 moharamfatema/graduation-project

Video vision transformers for hierarchical anomaly detection in video scenes.

35
Emerging
2033 ngoanpv/llama2_vietnamese

A fine-tuned Large Language Model (LLM) for the Vietnamese language based on...

35
Emerging
2034 TIGER-AI-Lab/General-Reasoner

General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25]

35
Emerging
2035 Akshint0407/Automated-Answer-Checker

AI-powered grading system for educators 🔹 Streamlit web app that automates...

35
Emerging
2036 he-h/rhythm

[NeurIPS 2025] RHYTHM: Reasoning with Hierarchical Temporal Tokenization for...

35
Emerging
2037 THUDM/Multilingual-GLM

The multilingual variant of GLM, a general language model trained with...

35
Emerging
2038 JerryYLi/valhalla-nmt

Code repository for CVPR 2022 paper "VALHALLA: Visual Hallucination for...

35
Emerging
2039 bminixhofer/tokenkit

A toolkit implementing advanced methods to transfer models and model...

35
Emerging
2040 iamgmujtaba/llama3.2-webUI

LLaMa 3.2 Multimodal Web UI is a user-friendly interface for interacting...

35
Emerging
2041 RenzeLou/Muffin

MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following

35
Emerging
2042 srvCodes/continual_learning_with_vit

Code for our CVPR 2022 workshop paper "Towards Exemplar-Free Continual...

35
Emerging
2043 InternRobotics/PointLLM

[ECCV 2024 Best Paper Candidate & TPAMI 2025] PointLLM: Empowering Large...

35
Emerging
2044 DEV-D-GR8/SignSense

This repository contains a transformer-based model for real-time American...

35
Emerging
2045 xf-zhao/LoT

Official implementation of LoT paper: "Enhancing Zero-Shot Chain-of-Thought...

35
Emerging
2046 Tanveer81/ReVisionLLM

This is the official implementation of ReVisionLLM: Recursive...

35
Emerging
2047 zjunlp/ModelKinship

Exploring Model Kinship for Merging Large Language Models

35
Emerging
2048 OpenBMB/VisCPM

[ICLR'24 spotlight] Chinese and English Multimodal Large Model Series (Chat...

35
Emerging
2049 NVlabs/NFT

Implementation of Negative-aware Finetuning (NFT) algorithm for "Bridging...

35
Emerging
2050 Bruce-Lee-LY/decoding_attention

Decoding Attention is specially optimized for MHA, MQA, GQA and MLA using...

35
Emerging
2051 ai8hyf/llm_split_recall_test

Split and Recall: A simple and efficient benchmark to evaluate in-context...

35
Emerging
2052 nlp-with-transformers/website

Website for the Natural Language Processing with Transformers book

35
Emerging
2053 AIFEG/BenchLMM

[ECCV 2024] BenchLMM: Benchmarking Cross-style Visual Capability of Large...

35
Emerging
2054 hemangjoshi37a/hjAlgos

AI based algorithmic trading platform for zerodha users

35
Emerging
2055 thushv89/packt_nlp_tensorflow_2

This will contain the code for the 2nd edition of NLP with TensorFlow (Edition 2)

35
Emerging
2056 gsarti/t5-flax-gcp

Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP

35
Emerging
2057 Wang-ML-Lab/llm-continual-learning-survey

[CSUR 2025] Continual Learning of Large Language Models: A Comprehensive Survey

35
Emerging
2058 leftmove/cria

Run LLMs locally with as little friction as possible.

35
Emerging
2059 GeeeekExplorer/transformers-patch

patches for huggingface transformers to save memory

35
Emerging
2060 senadkurtisi/pytorch-image-captioning

Transformer & CNN Image Captioning model in PyTorch.

35
Emerging
2061 nlpodyssey/gotokenizers

Go implementation of today's most used tokenizers

35
Emerging
2062 BauplanLabs/Making-Databases-Faster-with-LLM-Evolutionary-Sampling

Repository hosting code to reproduce our paper (with Stanford and...

35
Emerging
2063 BoHuangLab/Protein-Localization-Transformer

Code for CELL-E: Biological Zero-Shot Text-to-Image Synthesis for Protein...

35
Emerging
2064 deep-diver/segformer-tf-transformers

This repository demonstrates how to use TensorFlow based SegFormer model in...

35
Emerging
2065 raghavagps/pptstab

PPTStab: Designing of thermostable proteins with a desired melting temperature

35
Emerging
2066 opendatalab/UrBench

[AAAI 2025]This repo contains evaluation code for the paper “UrBench: A...

35
Emerging
2067 vicuna-tools/vicuna-installation-guide

The "vicuna-installation-guide" provides step-by-step instructions for...

35
Emerging
2068 GURPREETKAURJETHRA/PaliGemma-Inference-and-Fine-Tuning

PaliGemma Inference and Fine Tuning

35
Emerging
2069 calpt/awesome-adapter-resources

Collection of Tools and Papers related to Adapters / Parameter-Efficient...

35
Emerging
2070 fattorib/fusedswiglu

Fused SwiGLU Triton kernels

35
Emerging
2071 umbertocappellazzo/Llama-AVSR

Official Pytorch implementation of "Large Language Models are Strong...

35
Emerging
2072 UCSC-REAL/TokenCleaning

[ICML 2025] Official implementation of paper "Token Cleaning: Fine-Grained...

35
Emerging
2073 nipunsadvilkar/roberta-base-mr

RoBERTa Marathi Language model trained from scratch during huggingface 🤗 x ...

35
Emerging
2074 maxi-w/llama2-chat-interface

Gradio Chat Interface for Llama 2

35
Emerging
2075 worldbank/LLMs-Practical-Guide

A practical introduction to Generative AI and LLMs, equipping professionals...

35
Emerging
2076 HacktivSpace/multidisciplinary-deepfake-detection

A solution for deepfake detection across multiple modalities, including...

35
Emerging
2077 tgautam03/Transformers

A Gentle Introduction to Transformers Neural Network

35
Emerging
2078 xmindflow/MSA-2Net

[BMVC 2024] Official repository of the paper titled "MSA^2 Net: Multi-scale...

35
Emerging
2079 saddam213/LLamaStack

ASP.NET Core Web, WebApi & WPF implementations for LLama.cpp & LLamaSharp

35
Emerging
2080 ziqipang/RandAR

[CVPR 2025 (Oral)] Open implementation of "RandAR"

35
Emerging
2081 ziqipang/LM4VisualEncoding

[ICLR 2024 (Spotlight)] "Frozen Transformers in Language Models are...

35
Emerging
2082 sam575/axial-gan

Code for "Simultaneous Face Hallucination and Translation for Thermal to...

35
Emerging
2083 thongnt99/learned-sparse-retrieval

Unified Learned Sparse Retrieval Framework

35
Emerging
2084 zjunlp/NLPCC2024_RegulatingLLM

[NLPCC 2024] Shared Task 10: Regulating Large Language Models

35
Emerging
2085 FareedKhan-dev/gpt4o-from-scratch

Implementation of a GPT-4o like Multimodal from Scratch using Python

35
Emerging
2086 akjindal53244/Arithmo

Small and Efficient Mathematical Reasoning LLMs

35
Emerging
2087 declare-lab/CICERO

The purpose of this repository is to introduce new dialogue-level...

35
Emerging
2088 AI4LIFE-GROUP/LLM_Explainer

Code for paper: Are Large Language Models Post Hoc Explainers?

35
Emerging
2089 Wangbiao2/R1-Track

R1-Track: Direct Application of MLLMs to Visual Object Tracking via...

35
Emerging
2090 qizhou000/UniEdit

[NeurIPS 2025 B & D] UniEdit: A Unified Knowledge Editing Benchmark for...

35
Emerging
2091 zhchen18/ToMBench

ToMBench: Benchmarking Theory of Mind in Large Language Models, ACL 2024.

35
Emerging
2092 BatsResearch/planetarium

Dataset and benchmark for assessing LLMs in translating natural language...

35
Emerging
2093 gustavecortal/gpt-j-fine-tuning-example

Fine-tuning 6-Billion GPT-J (& other models) with LoRA and 8-bit compression

35
Emerging
2094 otto-de/TRON

⚡️ Implementation of TRON: Transformer Recommender using Optimized...

35
Emerging
2095 yyDing1/ScaleQuest

[ACL 2025] We introduce ScaleQuest, a scalable, novel and cost-effective...

35
Emerging
2096 linydub/azureml-greenai-txtsum

Samples for fine-tuning HuggingFace models with AzureML

35
Emerging
2097 SkywalkerLuke/TransHLA

TransHLA: A hybrid transformer model for peptide-HLA epitope detection.

35
Emerging
2098 aj-naik/Text-Summarization

Abstractive and Extractive Text summarization using Transformers.

35
Emerging
2099 XavierZXY/Zero2Hero

从0到1学习大模型

35
Emerging
2100 viralcode/superGPT

Train your own LLM from scratch

35
Emerging
« Prev 1 2 3 19 20 21 22 23 76 77 78 Next »