All Transformer Models
7,795 models ranked by quality score · Page 11 of 78
| # | Model | Score | Tier |
|---|---|---|---|
| 1001 |
asigalov61/SuperPiano
Absolutely amazing SOTA Google Colab (Jupyter) Notebooks for... |
|
Emerging |
| 1002 |
johnmai-dev/ChatMLX
🤖✨ChatMLX is a modern, open-source, high-performance chat application for... |
|
Emerging |
| 1003 |
softmax1/Flash-Attention-Softmax-N
CUDA and Triton implementations of Flash Attention with SoftmaxN. |
|
Emerging |
| 1004 |
ashleykleynhans/text-generation-docker
Docker image for the Text Generation Web UI: A Gradio web UI for Large... |
|
Emerging |
| 1005 |
RobertCsordas/transformer_generalization
The official repository for our paper "The Devil is in the Detail: Simple... |
|
Emerging |
| 1006 |
harleyszhang/llm_note
LLM notes, including model inference, transformer model structure, and llm... |
|
Emerging |
| 1007 |
TextGeneratorio/text-generator.io
Run Vision LLMs, TTS and STT APIs. Website and API for https://text-generator.io |
|
Emerging |
| 1008 |
shushanxingzhe/transformers_ner
Add CRF or LSTM+CRF for huggingface transformers bert to perform better on... |
|
Emerging |
| 1009 |
Gunale0926/SORSA
SORSA: Singular Values and Orthonormal Regularized Singular Vectors... |
|
Emerging |
| 1010 |
a-tokyo/ai-zero-shot-classifier
🧠 leverage advanced AI embeddings to perform multilingual zero-shot text... |
|
Emerging |
| 1011 |
ahmetkumass/yolo-gen
Train YOLO + VLM with one command. Auto-generate vision-language training... |
|
Emerging |
| 1012 |
ariannamethod/nanollama
Train Llama 3 models from scratch. Any scale, any personality. By Arianna Method. |
|
Emerging |
| 1013 |
xmindflow/Awesome-Transformer-in-Medical-Imaging
[MedIA Journal] An ultimately comprehensive paper list of Vision... |
|
Emerging |
| 1014 |
sinanuozdemir/oreilly-ai-pipelines
Designing and Deploying LLM Pipelines |
|
Emerging |
| 1015 |
bilibili/Index-1.9B
A lightweight multilingual LLM |
|
Emerging |
| 1016 |
monologg/GoEmotions-Korean
Korean version of GoEmotions Dataset 😍😢😱 |
|
Emerging |
| 1017 |
SensAI-PT/LLaMa2lang
Convenience scripts to finetune (chat-)LLaMa3 and other models for any language |
|
Emerging |
| 1018 |
xyjigsaw/LLM-Pretrain-SFT
Scripts of LLM pre-training and fine-tuning (w/wo LoRA, DeepSpeed) |
|
Emerging |
| 1019 |
monologg/DistilKoBERT
Distillation of KoBERT from SKTBrain (Lightweight KoBERT) |
|
Emerging |
| 1020 |
HKUDS/OpenGraph
[EMNLP'2024] "OpenGraph: Towards Open Graph Foundation Models" |
|
Emerging |
| 1021 |
HyperCluster-Tech/manimator
Transform research papers and mathematical concepts into stunning visual... |
|
Emerging |
| 1022 |
josStorer/selfhostedAI
A collection of one-click self-hosted AI |
|
Emerging |
| 1023 |
Rishit-dagli/Conformer
An implementation of Conformer: Convolution-augmented Transformer for Speech... |
|
Emerging |
| 1024 |
tatsu-lab/alpaca_farm
A simulation framework for RLHF and alternatives. Develop your RLHF method... |
|
Emerging |
| 1025 |
gitctrlx/llama.go
Llama from scratch in Go. |
|
Emerging |
| 1026 |
menon92/BangalASR
Transformer based Bangla Speech Recognition | Encoder Decoder Architecture |
|
Emerging |
| 1027 |
ai-forever/mgpt
Multilingual Generative Pretrained Model |
|
Emerging |
| 1028 |
amirfeder/CausaLM
CausaLM: Causal Model Explanation Through Counterfactual Language Models |
|
Emerging |
| 1029 |
NVlabs/GroupViT
Official PyTorch implementation of GroupViT: Semantic Segmentation Emerges... |
|
Emerging |
| 1030 |
mojivalipour/symbolicgpt
Symbolic regression is the task of identifying a mathematical expression... |
|
Emerging |
| 1031 |
oxpig/CaLM
Protein language model trained on coding DNA |
|
Emerging |
| 1032 |
grctest/FastAPI-BitNet
Running Microsoft's BitNet inference framework via FastAPI, Uvicorn and Docker. |
|
Emerging |
| 1033 |
aimclub/FEDOT.LLM
LLM-based prototype for nexgen AutoML |
|
Emerging |
| 1034 |
LLukas22/llm-rs-python
Unofficial python bindings for the rust llm library. 🐍❤️🦀 |
|
Emerging |
| 1035 |
mmaaz60/EdgeNeXt
[CADL'22, ECCVW] Official repository of paper titled "EdgeNeXt: Efficiently... |
|
Emerging |
| 1036 |
CAMeL-Lab/CAMeLBERT
Code and models for "The Interplay of Variant, Size, and Task Type in Arabic... |
|
Emerging |
| 1037 |
garyb9/twitter-llm-bot
Fully automatic asynchronous AI operated Twitter bot using Large Language... |
|
Emerging |
| 1038 |
therealoliver/Deepdive-llama3-from-scratch
Achieve the llama3 inference step-by-step, grasp the core concepts, master... |
|
Emerging |
| 1039 |
thuml/Flowformer
About Code release for "Flowformer: Linearizing Transformers with... |
|
Emerging |
| 1040 |
LowinLi/fastgpt
⚡ boost inference speed of GPT models in transformers by onnxruntime |
|
Emerging |
| 1041 |
sedthh/BeatLearning
Open Source Generative AI Models for Automatic Rhythm Game Beatmap... |
|
Emerging |
| 1042 |
ayaka14732/llama-2-jax
JAX implementation of the Llama 2 model |
|
Emerging |
| 1043 |
biodatlab/thonburian-whisper
Thonburian Whisper: Open models for fine-tuned Whisper in Thai. Try our demo... |
|
Emerging |
| 1044 |
dmanuel64/codablellm
A framework for creating and curating high-quality code datasets tailored... |
|
Emerging |
| 1045 |
hyperonym/basaran
Basaran is an open-source alternative to the OpenAI text completion API. It... |
|
Emerging |
| 1046 |
jankais3r/LLaMA_MPS
Run LLaMA (and Stanford-Alpaca) inference on Apple Silicon GPUs. |
|
Emerging |
| 1047 |
woodRock/fishy-business
Machine Learning for Rapid Evaporative Ionization Mass Spectrometry for... |
|
Emerging |
| 1048 |
NVIDIA/Cosmos-Tokenizer
A suite of image and video neural tokenizers |
|
Emerging |
| 1049 |
sangmichaelxie/doremi
Pytorch implementation of DoReMi, a method for optimizing the data mixture... |
|
Emerging |
| 1050 |
gotzmann/llama.go
llama.go is like llama.cpp in pure Golang! |
|
Emerging |
| 1051 |
declare-lab/flan-alpaca
This repository contains code for extending the Stanford Alpaca synthetic... |
|
Emerging |
| 1052 |
LISA-ITMO/LLM-resume-moderator
Автоматизирует модерацию резюме на русском языке с помощью LLM. Для... |
|
Emerging |
| 1053 |
ZinYY/Online_RLHF
A PyTorch implementation of the paper "Provably Efficient Online RLHF with... |
|
Emerging |
| 1054 |
sayakpaul/robustness-vit
Contains code for the paper "Vision Transformers are Robust Learners" (AAAI 2022). |
|
Emerging |
| 1055 |
waltonfuture/Diabetica
[SCI-FM@ICLR 2025] Specialized LLMs capable of handling various diabetes tasks |
|
Emerging |
| 1056 |
Dicklesworthstone/llm_introspective_compression_and_metacognition
A novel approach for transformer model introspection that enables saving,... |
|
Emerging |
| 1057 |
ZO-Bench/ZO-LLM
[ICML‘24] Official code for the paper "Revisiting Zeroth-Order Optimization... |
|
Emerging |
| 1058 |
zjunlp/KnowledgeCircuits
[NeurIPS 2024] Knowledge Circuits in Pretrained Transformers |
|
Emerging |
| 1059 |
liuqidong07/LLM-ESR
[NeurIPS'24 Spotlight] The official implementation code of LLM-ESR. |
|
Emerging |
| 1060 |
efeslab/fiddler
[ICLR'25] Fast Inference of MoE Models with CPU-GPU Orchestration |
|
Emerging |
| 1061 |
harleyszhang/lite_llama
A light llama-like llm inference framework based on the triton kernel. |
|
Emerging |
| 1062 |
shivendrra/SmallLanguageModel
a LLM cookbook, for building your own from scratch, all the way from... |
|
Emerging |
| 1063 |
sail-sg/understand-r1-zero
Understanding R1-Zero-Like Training: A Critical Perspective |
|
Emerging |
| 1064 |
A-baoYang/alpaca-7b-chinese
Finetune LLaMA-7B with Chinese instruction datasets |
|
Emerging |
| 1065 |
taufeeque9/codebook-features
Sparse and discrete interpretability tool for neural networks |
|
Emerging |
| 1066 |
omron-sinicx/crystalframer
The official code respository for "Rethinking the role of frames for... |
|
Emerging |
| 1067 |
nickduran/align2-linguistic-alignment
ALIGN 2.0: Modern Python package for multi-level linguistic alignment... |
|
Emerging |
| 1068 |
RobbenRibery/TuoTuo
TuoTuo is a Topic Modeling library for Researchers and Engineers |
|
Emerging |
| 1069 |
golsun/DialogRPT
EMNLP 2020: "Dialogue Response Ranking Training with Large-Scale Human Feedback Data" |
|
Emerging |
| 1070 |
westlake-repl/IDvs.MoRec
End-to-end Training for Multimodal Recommendation Systems |
|
Emerging |
| 1071 |
nuhmanpk/quick-llama
Run Ollama models on Google Colab |
|
Emerging |
| 1072 |
leehanchung/lora-instruct
Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA |
|
Emerging |
| 1073 |
macabdul9/AnyGen
A Unified and Minimalist Pipeline for Generating Outputs with LLMs... |
|
Emerging |
| 1074 |
thunlp/InfLLM
The code of our paper "InfLLM: Unveiling the Intrinsic Capacity of LLMs for... |
|
Emerging |
| 1075 |
sinanuozdemir/foundations-of-gen-ai
Transformer Architectures for Generative AI |
|
Emerging |
| 1076 |
Azure99/BlossomData
A fluent, scalable, and easy-to-use LLM data processing framework. |
|
Emerging |
| 1077 |
Knuckles-Team/genius-chatbot
Chatbot that uses any desired hugging face model or allows for scalable... |
|
Emerging |
| 1078 |
RobertCsordas/modules
The official repository for our paper "Are Neural Nets Modular? Inspecting... |
|
Emerging |
| 1079 |
Beomi/InfiniTransformer
Unofficial PyTorch/🤗Transformers(Gemma/Llama3) implementation of Leave No... |
|
Emerging |
| 1080 |
AdrianBZG/llama-multimodal-vqa
Multimodal Instruction Tuning for Llama 3 |
|
Emerging |
| 1081 |
jshilong/GPT4RoI
(ECCVW 2025)GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest |
|
Emerging |
| 1082 |
adaptivetokensampling/ATS
Adaptive Token Sampling for Efficient Vision Transformers (ECCV 2022 Oral... |
|
Emerging |
| 1083 |
jxiw/MambaInLlama
[NeurIPS 2024] Official Repository of The Mamba in the Llama: Distilling and... |
|
Emerging |
| 1084 |
ictnlp/LLaVA-Mini
LLaVA-Mini is a unified large multimodal model (LMM) that can support the... |
|
Emerging |
| 1085 |
Pyenb/Ollama-models
A collection of zipped Ollama models for offline use. Simply download,... |
|
Emerging |
| 1086 |
declare-lab/instruct-eval
This repository contains code to quantitatively evaluate instruction-tuned... |
|
Emerging |
| 1087 |
poloclub/LLM-Attributor
LLM Attributor: Attribute LLM's Generated Text to Training Data |
|
Emerging |
| 1088 |
audioku/meta-transfer-learning
Implementation of meta-transfer-learning for ASR and LM (ACL 2020) |
|
Emerging |
| 1089 |
eugenehp/bitnet-cpp-rs
Rust bindings for bitnet.cpp based on llama-cpp-4 |
|
Emerging |
| 1090 |
zjysteven/mink-plus-plus
[ICLR'25 Spotlight] Min-K%++: Improved baseline for detecting pre-training... |
|
Emerging |
| 1091 |
JohnMachado11/Build-a-Large-Language-Model-from-Scratch
Building a GPT-like LLM from scratch with PyTorch. |
|
Emerging |
| 1092 |
mlpc-ucsd/BLIVA
(AAAI 2024) BLIVA: A Simple Multimodal LLM for Better Handling of Text-rich... |
|
Emerging |
| 1093 |
Eamon2009/Transformer-language-model
An educational implementation of a GPT-style language model built from... |
|
Emerging |
| 1094 |
okuvshynov/slowllama
Finetune llama2-70b and codellama on MacBook Air without quantization |
|
Emerging |
| 1095 |
Beomi/KcELECTRA
🤗 Korean Comments ELECTRA: 한국어 댓글로 학습한 ELECTRA 모델 |
|
Emerging |
| 1096 |
zai-org/GLM-Edge
GLM Series Edge Models |
|
Emerging |
| 1097 |
louisbrulenaudet/tsdae
Transformer-based Denoising AutoEncoder for Sentence Transformers... |
|
Emerging |
| 1098 |
datadreamer-dev/DataDreamer
DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. 🤖💤 |
|
Emerging |
| 1099 |
HarderThenHarder/transformers_tasks
⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification,... |
|
Emerging |
| 1100 |
Kakz/prometheus-llm
PrometheusLLM is a unique transformer architecture inspired by dignity and... |
|
Emerging |