All Transformer Models
7,795 models ranked by quality score · Page 13 of 78
| # | Model | Score | Tier |
|---|---|---|---|
| 1201 |
ai4co/parco
[NeurIPS 2025] PARCO: Parallel AutoRegressive Combinatorial Optimization |
|
Emerging |
| 1202 |
hongyehu/Machine_Learning_Quantum_State_Tomography
An **unofficial** pytorch implementation of using generative models to do... |
|
Emerging |
| 1203 |
cdli-gh/Semi-Supervised-NMT-for-Sumerian-English
Exploring the Limits of Low-Resource Neural Machine Translation |
|
Emerging |
| 1204 |
asigalov61/Allegro-Music-Transformer
Full-attention multi-instrumental music transformer featuring asymmetrical... |
|
Emerging |
| 1205 |
clabrugere/scratch-llm
Implements a LLM similar to Meta's Llama 2 from the ground up in PyTorch,... |
|
Emerging |
| 1206 |
SamsungSAILMontreal/ghn3
Code for "Can We Scale Transformers to Predict Parameters of Diverse... |
|
Emerging |
| 1207 |
amazon-science/unified-ept
A Unified Efficient Pyramid Transformer for Semantic Segmentation, ICCVW 2021 |
|
Emerging |
| 1208 |
PediaMedAI/AggPose
[IJCAI 2022] Official PyTorch implementation of AggPose: Deep Aggregation... |
|
Emerging |
| 1209 |
lin-tan/clm
For our ICSE23 paper "Impact of Code Language Models on Automated Program... |
|
Emerging |
| 1210 |
muhtalhakhan/Hacktoberfest2024
Hacktoberfest 2024 🧑🏻💻 OPEN FIRST Pull Request 🎉 |
|
Emerging |
| 1211 |
VPGTrans/VPGTrans
Codes for VPGTrans: Transfer Visual Prompt Generator across LLMs. VL-LLaMA,... |
|
Emerging |
| 1212 |
JackZeng0208/llama.cpp-android-tutorial
llama.cpp tutorial on Android phone |
|
Emerging |
| 1213 |
bodeby/torchstack
🫧 probability-level model ensembling for transformers |
|
Emerging |
| 1214 |
developer239/llama.cpp-ts
llama.cpp 🦙 LLM inference in TypeScript |
|
Emerging |
| 1215 |
GAIR-NLP/ProX
[ICML 2025] Programming Every Example: Lifting Pre-training Data Quality... |
|
Emerging |
| 1216 |
l294265421/alpaca-rlhf
Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback)... |
|
Emerging |
| 1217 |
skit-ai/SpeechLLM
This repository contains the training, inference, evaluation code for... |
|
Emerging |
| 1218 |
amitkedia007/Financial-Fraud-Detection-Using-LLMs
The aim of this dissertation is to assess the effectiveness of LLMs such as ... |
|
Emerging |
| 1219 |
luuyin/OWL
Official Pytorch Implementation of "Outlier Weighed Layerwise Sparsity... |
|
Emerging |
| 1220 |
yuanzhoulvpi2017/DocumentSearch
基于sentence transformers和chatglm实现的文档搜索工具 |
|
Emerging |
| 1221 |
vmicheli/delta-iris
Efficient World Models with Context-Aware Tokenization. ICML 2024 |
|
Emerging |
| 1222 |
IAAR-Shanghai/Grimoire
Grimoire is All You Need for Enhancing Large Language Models |
|
Emerging |
| 1223 |
aJupyter/ThinkLLM
ThinkLLM:🚀 轻量、高效的大语言模型算法实现 |
|
Emerging |
| 1224 |
trrahul/llama2.cs
Inference Llama 2 in one file of pure C# |
|
Emerging |
| 1225 |
ShiZhengyan/DePT
[ICLR 2024] This is the repository for the paper titled "DePT: Decomposed... |
|
Emerging |
| 1226 |
wuwangzhang1216/prometheus
Fully automatic censorship removal for language models. LoRA abliteration +... |
|
Emerging |
| 1227 |
OpenSparseLLMs/LLaMA-MoE-v2
🚀 LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of... |
|
Emerging |
| 1228 |
ukairia777/pytorch-nlp-tutorial
pytorch를 사용하여 텍스트 전처리부터 RAG, 에이전트, LLM 파인튜닝을 정리한 Deep Learning NLP 저장소입니다. |
|
Emerging |
| 1229 |
architkaila/Fine-Tuning-LLMs-for-Medical-Entity-Extraction
Exploring the potential of fine-tuning Large Language Models (LLMs) like... |
|
Emerging |
| 1230 |
molbal/llm-text-completion-finetune
Guide on text completion large language model fine-tuning, including example... |
|
Emerging |
| 1231 |
prajjwal1/fluence
A deep learning library based on Pytorch focussed on low resource language... |
|
Emerging |
| 1232 |
JulesBelveze/bert-squeeze
🛠️ Tools for Transformers compression using PyTorch Lightning ⚡ |
|
Emerging |
| 1233 |
kingabzpro/using-llama3-locally
Running llama3 using Ollama-Python, Curl, LangChain, Chroma, and User interface. |
|
Emerging |
| 1234 |
TrustedLLM/LLMDet
LLMDet is a text detection tool that can identify which generated sources... |
|
Emerging |
| 1235 |
rasbt/blog-finetuning-llama-adapters
Supplementary material for "Understanding Parameter-Efficient Finetuning of... |
|
Emerging |
| 1236 |
Bindwell/PLAPT
Codebase and CLI for PLAPT: A state-of-the-art protein-ligand binding... |
|
Emerging |
| 1237 |
jonrbates/turing
A PyTorch library for simulating Turing machines with neural networks, based... |
|
Emerging |
| 1238 |
eduard23144/locoformer
🤖 Explore LocoFormer, a Transformer-XL model that enhances robot locomotion... |
|
Emerging |
| 1239 |
Traffic-Alpha/LLM-Assisted-Light
This repository contains the code for the paper "LLM-Assisted Light:... |
|
Emerging |
| 1240 |
hans00/react-native-transformers-example
Example of transformers.js on React Native |
|
Emerging |
| 1241 |
Omid-Nejati/BEFUnet
A Hybrid CNN-Transformer Architecture for Precise Medical Image Segmentation |
|
Emerging |
| 1242 |
ziplab/LIT
[AAAI 2022] This is the official PyTorch implementation of "Less is More:... |
|
Emerging |
| 1243 |
chenmozhijin/BSRoformer.cpp
GGML-based C++ inference for BS Roformer/Mel-Band-Roformer vocal separation... |
|
Emerging |
| 1244 |
ShuntaroOkuma/adapt-gauge-core
Measure LLM adaptation efficiency — how fast models learn from few examples |
|
Emerging |
| 1245 |
MetaGLM/FinGLM
FinGLM: 致力于构建一个开放的、公益的、持久的金融大模型项目,利用开源开放来促进「AI+金融」。 |
|
Emerging |
| 1246 |
ECNU-ICALK/EduChat
An open-source educational chat model from ICALK, East China Normal... |
|
Emerging |
| 1247 |
neulab/knn-transformers
PyTorch + HuggingFace code for RetoMaton: "Neuro-Symbolic Language Modeling... |
|
Emerging |
| 1248 |
yang-ai-lab/SleepLM
SleepLM: Natural-Language Intelligence for Human Sleep |
|
Emerging |
| 1249 |
zyds/transformers-code
手把手带你实战 Huggingface Transformers 课程视频同步更新在B站与YouTube |
|
Emerging |
| 1250 |
YadaYuki/transformer-from-scratch
Transformer from scratch 🙊 (English to Japanese Translator by PyTorch) |
|
Emerging |
| 1251 |
saqib1707/gpt2-from-scratch
PyTorch Implementation of GPT-2 |
|
Emerging |
| 1252 |
infocusp/llm_seminar_series
Material for the series of seminars on Large Language Models |
|
Emerging |
| 1253 |
metriccoders/one-line-llm-tuner
This repository is the source code for fine tuning any LLM in just one line 🔥 |
|
Emerging |
| 1254 |
zejia-lin/BulletServe
Boosting GPU utilization for LLM serving via dynamic spatial-temporal... |
|
Emerging |
| 1255 |
geobrain-ai/geogalactica
Code and datasets for paper "GeoGalactica: A Scientific Large Language Model... |
|
Emerging |
| 1256 |
stanleylsx/llms_tool
一个基于HuggingFace开发的大语言模型训练、测试工具。支持各模型的webui、终端预测,低参数量及全参数模型训练(预训练、SFT、RM、PPO、D... |
|
Emerging |
| 1257 |
xjywhu/Awesome-Multimodal-LLM-for-Code
Multimodal Large Language Models for Code Generation under Multimodal Scenarios |
|
Emerging |
| 1258 |
jha-lab/acceltran
[TCAD'23] AccelTran: A Sparsity-Aware Accelerator for Transformers |
|
Emerging |
| 1259 |
ChristophReich1996/Swin-Transformer-V2
PyTorch reimplementation of the paper "Swin Transformer V2: Scaling Up... |
|
Emerging |
| 1260 |
HqWu-HITCS/Awesome-Chinese-LLM
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。 |
|
Emerging |
| 1261 |
VITA-Group/Q-GaLore
Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank... |
|
Emerging |
| 1262 |
aniketmaurya/llm-inference
Large Language Model (LLM) Inference API and Chatbot |
|
Emerging |
| 1263 |
monologg/KoBigBird
🦅 Pretrained BigBird Model for Korean (up to 4096 tokens) |
|
Emerging |
| 1264 |
samestrin/llm-pdf-ocr-api
A Python-based REST API for PDF OCR using AI models with PyTorch and... |
|
Emerging |
| 1265 |
AllenXiangX/SnowflakeNet
(TPAMI 2023) Snowflake Point Deconvolution for Point Cloud Completion and... |
|
Emerging |
| 1266 |
google-research/magvit
Official JAX implementation of MAGVIT: Masked Generative Video Transformer |
|
Emerging |
| 1267 |
DannyArends/DLLM
A minimal, clean D language interface for running LLM inference using... |
|
Emerging |
| 1268 |
AlexandrosChrtn/llama-fine-tune-guide
Fine-tune the newly released Llama-3.2 lightweight models. |
|
Emerging |
| 1269 |
iverly/llamafile-docker
Distribute and run llamafile/LLMs with a single docker image. |
|
Emerging |
| 1270 |
rednote-hilab/dots.llm1
The official repository of the dots.llm1 base and instruct models proposed... |
|
Emerging |
| 1271 |
google-deepmind/gemma_penzai
A JAX Research Toolkit for Visualizing, Manipulating, and Understanding... |
|
Emerging |
| 1272 |
vicgalle/zero-shot-reward-models
ZYN: Zero-Shot Reward Models with Yes-No Questions |
|
Emerging |
| 1273 |
git-cloner/llama-lora-fine-tuning
llama fine-tuning with lora |
|
Emerging |
| 1274 |
RedHatResearch/conext24-NetConfEval
Benchmark for evaluating LLMs in network configuration problems. |
|
Emerging |
| 1275 |
ymcui/Chinese-Mixtral
中文Mixtral混合专家大模型(Chinese Mixtral MoE LLMs) |
|
Emerging |
| 1276 |
takashiishida/paper2slides
Transform any arXiv papers into slides using LLMs |
|
Emerging |
| 1277 |
hkust-nlp/deita
Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024] |
|
Emerging |
| 1278 |
gusye1234/llm-as-function
Embed your LLM into a python function |
|
Emerging |
| 1279 |
sanjibnarzary/awesome-llm
Curated list of open source and openly accessible large language models |
|
Emerging |
| 1280 |
modelscope/mcore-bridge
MCore-Bridge: Providing Megatron-Core model definitions for state-of-the-art... |
|
Emerging |
| 1281 |
Sachithx/EntroPE
This includes the codebase for EntroPE (Entropy-Guided Dynamic Patch Encoder... |
|
Emerging |
| 1282 |
softengg-manoj/dreamer4
🌟 Implement Dreamer 4 for training agents within scalable world models,... |
|
Emerging |
| 1283 |
sisinflab/Ducho
Ducho is a Python framework aimed to extract multimodal features used in... |
|
Emerging |
| 1284 |
abaheti95/LoL-RL
Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving... |
|
Emerging |
| 1285 |
harishdeivanayagam/rowfill
Open-source spreadsheets platform for deep research and document processing |
|
Emerging |
| 1286 |
johnmai-dev/NotebookMLX
📋 NotebookMLX - An Open Source version of NotebookLM (Ported NotebookLlama) |
|
Emerging |
| 1287 |
JKevin17/TM-LLM
The official code for "(ISCC 2025) Network Traffic Matrix Imputation via... |
|
Emerging |
| 1288 |
sodascience/workshop_llm_data_collection
This repository contains the code and slides for our workshop on data... |
|
Emerging |
| 1289 |
KolosalAI/kolosal-cli
Super lightweight Ollama + Qwen Code alternative to run Llama 3.3,... |
|
Emerging |
| 1290 |
YuweiYin/FinPT
FinPT: Financial Risk Prediction with Profile Tuning on Pretrained Foundation Models |
|
Emerging |
| 1291 |
tpoisonooo/llama.onnx
LLaMa/RWKV onnx models, quantization and testcase |
|
Emerging |
| 1292 |
prismformore/Multi-Task-Transformer
Code of ICLR2023 paper "TaskPrompter: Spatial-Channel Multi-Task Prompting... |
|
Emerging |
| 1293 |
bigcode-project/selfcodealign
[NeurIPS'24] SelfCodeAlign: Self-Alignment for Code Generation |
|
Emerging |
| 1294 |
upb-lea/mag-net-hub
MagNet Toolkit - Certified Models of the MagNet Challenge |
|
Emerging |
| 1295 |
sunnynguyen-ai/llm-attention-visualizer
Interactive tool for analyzing attention patterns in transformer models with... |
|
Emerging |
| 1296 |
nlpodyssey/cybertron
Cybertron: the home planet of the Transformers in Go |
|
Emerging |
| 1297 |
xNul/code-llama-for-vscode
Use Code Llama with Visual Studio Code and the Continue extension. A local... |
|
Emerging |
| 1298 |
dingo-actual/infini-transformer
PyTorch implementation of Infini-Transformer from "Leave No Context Behind:... |
|
Emerging |
| 1299 |
swordlidev/Efficient-Multimodal-LLMs-Survey
Efficient Multimodal Large Language Models: A Survey |
|
Emerging |
| 1300 |
zyushun/Adam-mini
Code for Adam-mini: Use Fewer Learning Rates To Gain More... |
|
Emerging |