All Transformer Models
7,795 models ranked by quality score · Page 22 of 78
| # | Model | Score | Tier |
|---|---|---|---|
| 2101 |
dmis-lab/Monet
[ICLR 2025] Monet: Mixture of Monosemantic Experts for Transformers |
|
Emerging |
| 2102 |
AdrianBZG/LLM-distributed-finetune
Tune efficiently any LLM model from HuggingFace using distributed training... |
|
Emerging |
| 2103 |
lucasjinreal/Namo-R1
A CPU Realtime VLM in 500M. Surpassed Moondream2 and SmolVLM. Training from... |
|
Emerging |
| 2104 |
wehos/awesome-graph-transformer
Papers about graph transformers. |
|
Emerging |
| 2105 |
logic-OT/BobVLM
BobVLM – A 1.5B multimodal model built from scratch and pre-trained on a... |
|
Emerging |
| 2106 |
ExplainableML/Vision_by_Language
[ICLR 2024] Official repository for "Vision-by-Language for Training-Free... |
|
Emerging |
| 2107 |
daviden1013/llm-ie
A comprehensive toolkit that provides building blocks for LLM-based named... |
|
Emerging |
| 2108 |
ExplainableML/WaffleCLIP
Official repository for the ICCV 2023 paper: "Waffling around for... |
|
Emerging |
| 2109 |
YeonwooSung/vision-search
Image search engine |
|
Emerging |
| 2110 |
TencentARC/ST-LLM
[ECCV 2024🔥] Official implementation of the paper "ST-LLM: Large Language... |
|
Emerging |
| 2111 |
eliahuhorwitz/Spectral-DeTuning
Official PyTorch Implementation for the "Recovering the Pre-Fine-Tuning... |
|
Emerging |
| 2112 |
davide-coccomini/MINTIME-Multi-Identity-size-iNvariant-TIMEsformer-for-Video-Deepfake-Detection
Code for Video Deepfake Detector from "MINTIME: Multi-Identity... |
|
Emerging |
| 2113 |
DestroyerDarkNess/fastvlm-webgpu
Real-time video captioning powered by FastVLM |
|
Emerging |
| 2114 |
EvilFreelancer/rugpt3-custom
Pre-training custom ruGPT3 model on books written by F.M. Dostoevski |
|
Emerging |
| 2115 |
cifkao/context-probing
Black-box language model explanation by context length probing |
|
Emerging |
| 2116 |
DCQN-axiomatics/DCQN-Matrix-Axiomatik-LLM-Protocol
A strict, deterministic LLM protocol for loading, reading and activating the... |
|
Emerging |
| 2117 |
monk1337/auto-ollama
run ollama & gguf easily with a single command |
|
Emerging |
| 2118 |
bloomberg/MixCE-acl2023
Implementation of MixCE method described in ACL 2023 paper by Zhang et al. |
|
Emerging |
| 2119 |
X-iZhang/CCD
📷 CCD: Mitigating Hallucinations in Radiology MLLMs via Clinical Contrastive... |
|
Emerging |
| 2120 |
henrikalbihn/gliner-as-a-service
GLiNER model in a FastAPI microservice. |
|
Emerging |
| 2121 |
pymc-labs/transpailer
LLM-based, self-correcting transpiler. Supports JAX, PyTorch, Rust, PyMC, Stan. |
|
Emerging |
| 2122 |
nareshis21/Truelarge-RT
Android inference engine running 20B+ parameter LLMs on 4GB-8GB RAM devices.... |
|
Emerging |
| 2123 |
MNoorFawi/curlora
The code repository for the CURLoRA research paper. Stable LLM continual... |
|
Emerging |
| 2124 |
PathologyFoundation/plip
Pathology Language and Image Pre-Training (PLIP) is the first vision and... |
|
Emerging |
| 2125 |
Nikityyy/lille
A powerful 130-million-parameter model trained from scratch as part of a... |
|
Emerging |
| 2126 |
rezazad68/transdeeplab
TransDeepLab: Convolution-Free Transformer-based DeepLab v3+ for Medical... |
|
Emerging |
| 2127 |
aws-samples/sample-for-multi-modal-document-to-json-with-sagemaker-ai
This open-source project delivers a complete pipeline for converting... |
|
Emerging |
| 2128 |
will-thompson-k/tldr-transformers
The "tl;dr" on a few notable transformer papers (pre-2022). |
|
Emerging |
| 2129 |
Saivineeth147/llm-testlab
Comprehensive Testing Tool for Large Language Models |
|
Emerging |
| 2130 |
arcee-ai/PruneMe
Automated Identification of Redundant Layer Blocks for Pruning in Large... |
|
Emerging |
| 2131 |
SakanaAI/evo-memory
Code to train and evaluate Neural Attention Memory Models to obtain... |
|
Emerging |
| 2132 |
HaoAreYuDong/MachineLearningLM
Scaling In-context Learning from Few-shot to 1,024-shot on Tabular ML |
|
Emerging |
| 2133 |
ManashJKonwar/NLP-Transformers
Transformer (BERT, GPT2, etc.) based Training Module for popular NLP tasks |
|
Emerging |
| 2134 |
GithubX-F/DynaMO-RL
Dynamic Rollout Allocation and Advantage Modulation for Policy Optimization... |
|
Emerging |
| 2135 |
IDSIA/modern-srwm
Official repository for the paper "A Modern Self-Referential Weight Matrix... |
|
Emerging |
| 2136 |
tsinghua-fib-lab/ANeurIPS2024_SPV-MIA
[NeurIPS'24] "Membership Inference Attacks against Fine-tuned Large Language... |
|
Emerging |
| 2137 |
SolomonB14D3/knowledge-fidelity
Behavioral auditing & repair toolkit for LLMs. Measures 8 dimensions via... |
|
Emerging |
| 2138 |
amajee11us/TabGLM
[AAAI' 25] Tabular Graph-Text Representation Learning with Consistency Minimization |
|
Emerging |
| 2139 |
c00k1ez/plain-transformers
Transformer models implementation for training from scratch. |
|
Emerging |
| 2140 |
abacaj/transformers-docker
Run, build, test transformer models using docker |
|
Emerging |
| 2141 |
pymc-labs/transalchemy
LLM-based, self-correcting transpiler. Supports JAX, PyTorch, Rust, PyMC, Stan. |
|
Emerging |
| 2142 |
linonetwo/langchain-alpaca
Run Alpaca LLM in LangChain |
|
Emerging |
| 2143 |
gitctrlx/llama.cu
Llama from scratch in CUDA with Flash Attention. |
|
Emerging |
| 2144 |
CLAIRE-Labo/quantile-reward-policy-optimization
Official codebase for "Quantile Reward Policy Optimization: Alignment with... |
|
Emerging |
| 2145 |
CristianCristanchoT/chivito
Implementación de un LLM basado en Llama finetuneado en español empleando... |
|
Emerging |
| 2146 |
BubbleJoe-BrownU/TransformerHub
This is a repository of transformer-like models, including Transformer, GPT,... |
|
Emerging |
| 2147 |
kuvaus/llama-chat
Simple chat program for LLaMa models |
|
Emerging |
| 2148 |
AkiRusProd/numpy-transformer
A numpy implementation of the Transformer model in "Attention is All You Need" |
|
Emerging |
| 2149 |
Hon-Wong/VoRA
[Fully open] [Encoder-free MLLM] Vision as LoRA |
|
Emerging |
| 2150 |
cxcscmu/Montessori-Instruct
Official repository for Montessori-Instruct: Generate Influential Training... |
|
Emerging |
| 2151 |
vivy-yi/awesome-llm-training-inference
Curated list of LLM training and inference frameworks, tools, and resources.... |
|
Emerging |
| 2152 |
an-yongqi/systematic-outliers
[ICLR 2025] Systematic Outliers in Large Language Models. |
|
Emerging |
| 2153 |
shahrukhx01/bert-probe
BERT Probe: A python package for probing attention based robustness to... |
|
Emerging |
| 2154 |
jorgemunozl/Finetunning-Llama-Vision-11b
Inference and finnetunning of a VLM (LLama Vision 11b) using the Unsloth,... |
|
Emerging |
| 2155 |
rasbt/gradient-accumulation-blog
Finetuning BLOOM on a single GPU using gradient-accumulation |
|
Emerging |
| 2156 |
theodo-group/GenossGPT
One API for all LLMs either Private or Public (Anthropic, Llama V2, GPT... |
|
Emerging |
| 2157 |
RahulSChand/gpu_poor
Calculate token/s & GPU memory requirement for any LLM. Supports... |
|
Emerging |
| 2158 |
antoninodimaggio/Hugging-Captions
Generate realistic Instagram captions using transformers 🤗 |
|
Emerging |
| 2159 |
czg1225/CoDe
[CVPR 2025] CoDe: Collaborative Decoding Makes Visual Auto-Regressive... |
|
Emerging |
| 2160 |
itsqyh/Awesome-LMMs-Mechanistic-Interpretability
A curated collection of resources focused on the Mechanistic... |
|
Emerging |
| 2161 |
jmnolte/HCCNet
Early prediction of liver cancer using longitudinal MRI |
|
Emerging |
| 2162 |
fermyon/ai-examples
A collection of serverless apps that show how Fermyon's Serverless AI... |
|
Emerging |
| 2163 |
Ramseths/app-llama2
Generative AI - LLaMA 2 7B & LangChain, to generate stories based on a genre. |
|
Emerging |
| 2164 |
forgi86/sysid-transformers
Code to reproduce the results of the paper In-context learning for... |
|
Emerging |
| 2165 |
chelsea0x3b/llama-dfdx
LLaMa 7b with CUDA acceleration implemented in rust. Minimal GPU memory needed! |
|
Emerging |
| 2166 |
ausboss/Local-LLM-Langchain
Load local LLMs effortlessly in a Jupyter notebook for testing purposes... |
|
Emerging |
| 2167 |
leliuga/cohere-configurations
Co:Here Inference configurations |
|
Emerging |
| 2168 |
LuluW8071/Text-Sentiment-Analysis
Text Sentiment Analysis with RNNs Models + Additive Attention and Transformers |
|
Emerging |
| 2169 |
taishan1994/LLM-Quantization
记录量化LLM中的总结。 |
|
Emerging |
| 2170 |
Zishan-Shao/FlashSVD
Welcome to the FlashSVD, an activation aware inference system for SVD-based... |
|
Emerging |
| 2171 |
iKernels/transformers-lightning
A collection of Models, Datasets, DataModules, Callbacks, Metrics, Losses... |
|
Emerging |
| 2172 |
datatrigger/nlp_hugging_face
Text classification with the transformers library from Hugging Face, by... |
|
Emerging |
| 2173 |
xmindflow/MMCFormer
[MIDL 2023] MMCFormer: Missing Modality Compensation Transformer for Brain... |
|
Emerging |
| 2174 |
dougeeai/llama-cpp-python-wheels
Pre-built wheels for llama-cpp-python across platforms and CUDA versions |
|
Emerging |
| 2175 |
damianoduranti/LLMknowextra
LLM-Driven Knowledge Extraction: Results in Temporal and Description Logics... |
|
Emerging |
| 2176 |
jakobtroidl/neuron-shape-reasoning
PyTorch Implementation of Global Neuron Shape Reasoning with Point Affinity... |
|
Emerging |
| 2177 |
RAHB-REALTORS-Association/email-autodrafts
Email Auto-ReplAI is a Python tool that uses AI to automate drafting... |
|
Emerging |
| 2178 |
ZJLAB-AMMI/LLM4Teach
Python code to implement LLM4Teach, a policy distillation approach for... |
|
Emerging |
| 2179 |
NiuTrans/LaMaTE
Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine... |
|
Emerging |
| 2180 |
Pengxin-Guo/FedSA-LoRA
Selective Aggregation for Low-Rank Adaptation in Federated Learning [ICLR 2025] |
|
Emerging |
| 2181 |
Tebmer/Awesome-Knowledge-Distillation-of-LLMs
This repository collects papers for "A Survey on Knowledge Distillation of... |
|
Emerging |
| 2182 |
ModelTC/QLLM
[ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate... |
|
Emerging |
| 2183 |
dhruvdcoder/xlm-core
XLM is a modular, research-friendly framework for developing and comparing... |
|
Emerging |
| 2184 |
5aharsh/collama
Run Ollama LLM models in Google Colab for free |
|
Emerging |
| 2185 |
InternLM/OREAL
Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning |
|
Emerging |
| 2186 |
tojiboyevf/image_captioning
Deep Learning Final project 2022 |
|
Emerging |
| 2187 |
dsindex/iclassifier
reference pytorch code for intent classification |
|
Emerging |
| 2188 |
liuqidong07/LEADER-pytorch
[arXiv'24] The official implementation code of LEADER. |
|
Emerging |
| 2189 |
forgi86/sysid-transformers-transfer
Code of the paper "On the adaptation of in-context learners for system... |
|
Emerging |
| 2190 |
AntonioGr7/pratical-llms
A collection of hand on notebook for LLMs practitioner |
|
Emerging |
| 2191 |
ksm26/Open-Source-Models-with-Hugging-Face
"Open Source Models with Hugging Face" course empowers you with the skills... |
|
Emerging |
| 2192 |
EasierMTL/chinese-translation-app
Chinese to English Translation Full Stack Web App + Automated Load Testing... |
|
Emerging |
| 2193 |
ictnlp/TruthX
Code for ACL 2024 paper "TruthX: Alleviating Hallucinations by Editing Large... |
|
Emerging |
| 2194 |
guyoung/AIMatrices
AIMatrices is a lightweight, high-performance, scalable, and open source AI... |
|
Emerging |
| 2195 |
GURPREETKAURJETHRA/Ollama-UseCases
This repo brings numerous use cases from the Open Source Ollama |
|
Emerging |
| 2196 |
ASSERT-KTH/repairllama
RepairLLaMA: Efficient Representations and Fine-Tuned Adapters for Program... |
|
Emerging |
| 2197 |
haoliuhl/instructrl
Instruction Following Agents with Multimodal Transforemrs |
|
Emerging |
| 2198 |
WayneMao/RoboMatrix
The Official Implementation of RoboMatrix |
|
Emerging |
| 2199 |
Infini-AI-Lab/Sequoia
scalable and robust tree-based speculative decoding algorithm |
|
Emerging |
| 2200 |
mechramc/Orion
Local AI runtime for training & running small LLMs directly on Apple Neural... |
|
Emerging |