All Transformer Models
7,795 models ranked by quality score · Page 34 of 78
| # | Model | Score | Tier |
|---|---|---|---|
| 3301 |
LaMP-Benchmark/LaMP
Codes for papers on Large Language Models Personalization (LaMP) |
|
Experimental |
| 3302 |
kaalen/tiny-assistant
Experimenting with smaller LLMs that can run on commodity hardware like... |
|
Experimental |
| 3303 |
DanHrmti/SenTransformer-VAE-pytorch
Sentence VAE using the Transformer encoder-decoder architecture. |
|
Experimental |
| 3304 |
jasonacox/ProtosAI
A Study in Artificial Intelligence - Simple scripts that explore... |
|
Experimental |
| 3305 |
sabrinaherbst/distilbert_question_answering
Implements a Q&A ML model usuing DistilBERT. |
|
Experimental |
| 3306 |
Yusuf270200101/DeepAnalyze
🔍 Empower data scientists with DeepAnalyze, a tool that leverages large... |
|
Experimental |
| 3307 |
khairulislam/Timeseries-Explained
Interpreting Deep Learning timeseries models using Local Interpretation methods |
|
Experimental |
| 3308 |
LSquaredM/mutual_info_scaling_law
(NeurIPS 2025) Official Code for L²M: Mutual Information Scaling Law for... |
|
Experimental |
| 3309 |
HROlive/Advanced-Deep-Learning-with-Transformers
Workshop that will take you from Graph Neural Networks (GNNs) to... |
|
Experimental |
| 3310 |
andylolu2/jax-vqvae-gpt
Implementation of VQ-VAE with a GPT-style sampler in the JAX and Haiku ecosystem. |
|
Experimental |
| 3311 |
oooranz/GraDe
📐 Not All Features Deserve Attention: Graph-Guided Dependency Learning for... |
|
Experimental |
| 3312 |
ImMohammadHosseini/MKP-RL
:sparkles: Solve multi_dimensional multiple knapsack problem using... |
|
Experimental |
| 3313 |
Uokoroafor/transformer_from_scratch
This is a PyTorch implementation of the Transformer model in the paper... |
|
Experimental |
| 3314 |
WooooDyy/MathCritique
Implementation for the research paper "Enhancing LLM Reasoning via Critique... |
|
Experimental |
| 3315 |
zhuang-li/SCAR
[ACL 2025 main] SCAR: Data Selection via Style Consistency-Aware Response... |
|
Experimental |
| 3316 |
debugst1ck/tarp
🦠 Antimicrobial resistance prediction using transformer models. |
|
Experimental |
| 3317 |
othmanelhoufi/LM-for-FactChecking
An automated solution for fact-checking using available claims and fake-news... |
|
Experimental |
| 3318 |
plutonium-239/memsave_torch
Lowering PyTorch's Memory Consumption for Selective Differentiation |
|
Experimental |
| 3319 |
philogicae/ai-notebooks-colab
Useful colab notebooks to try out Stable Diffusion, LLM, etc. |
|
Experimental |
| 3320 |
nishantb06/smolLM
Reverse Engineering SmolLM2 model and training it from scratch |
|
Experimental |
| 3321 |
hurui200320/llama-cpp-kt
The Kotlin wrapper of llama.cpp, powered by JNA |
|
Experimental |
| 3322 |
LastBotInc/llama2j
Pure Java Llama2 inference with optional multi-GPU CUDA implementation |
|
Experimental |
| 3323 |
AmericanPresidentJimmyCarter/yal-discord-bot
Yet Another LLaMA/ALPACA Discord Bot |
|
Experimental |
| 3324 |
AshutoshKulkarni4998/UMWTransformer
Inference code for "Unified Multi-Weather Transformer for Multi-Weather... |
|
Experimental |
| 3325 |
atomlayer/llama_cute_voice_assistant
Llama cute voice assistant |
|
Experimental |
| 3326 |
DDDOH/LLM_News
LOLA_ LLM-Assisted Online Learning Algorithm for Content Experiments |
|
Experimental |
| 3327 |
MusfiqDehan/Llama2-Finetuned-for-Translation
Fine-Tuned Llama-2 For Machine Translation |
|
Experimental |
| 3328 |
harshtiwari01/llm-heatmap-visualizer
A set of scripts to generate full attention-head heatmaps for transformer-based LLMs |
|
Experimental |
| 3329 |
snexus/nlp-question-answering-system
Question answering system with transformers |
|
Experimental |
| 3330 |
codessian/epistemic-confidence-layer
Model-agnostic trust protocol for calibrated, auditable AI |
|
Experimental |
| 3331 |
snsn3/policy-LLM
Finetuning an LLM for heavy policy work |
|
Experimental |
| 3332 |
AnkitaMungalpara/Building-LLM-From-Scratch
This repository provides a step-by-step guide to creating your own large... |
|
Experimental |
| 3333 |
tobifinn/ensemble_transformer
Official PyTorch implementation of "Self-Attentive Ensemble Transformer:... |
|
Experimental |
| 3334 |
AdrienneDeganutti/DANTE-AD
"DANTE-AD: Dual-Vision Attention Network for Long-Term Audio Description"... |
|
Experimental |
| 3335 |
Scientific-Computing-Lab/Tokompiler
Scope is all you need: Transforming LLMs for HPC Code |
|
Experimental |
| 3336 |
isaacus-dev/terge
An easy-to-use Python library for merging PyTorch models. |
|
Experimental |
| 3337 |
scalable-ml-deep-learning/fine_tune_whisper
Fine-Tune Whisper for Italian ASR with transformers |
|
Experimental |
| 3338 |
declare-lab/KNOT
This repository contains the implementation of the paper -- KNOT: Knowledge... |
|
Experimental |
| 3339 |
andyngdz/exogen_backend
ExoGen Backend |
|
Experimental |
| 3340 |
ictnlp/LSG
The code for AAAI 2025 “Large Language Models Are Read/Write Policy-Makers... |
|
Experimental |
| 3341 |
gersongerardcruz/extractive_and_abstractive_text_summarization
A combination of extractive and abstractive text summarization for... |
|
Experimental |
| 3342 |
Guest400123064/ezgatr
Geometric Algebra Transformer Made Easy |
|
Experimental |
| 3343 |
cui-shaobo/causal-strength
evaluating the causal strength between cause and effect |
|
Experimental |
| 3344 |
richardsonlima/synapsense
SynapSense: Python In-Context Learning for Large Language Models SynapSense... |
|
Experimental |
| 3345 |
sparkle-reasoning/sparkle
[NeurIPS'25] Beyond Accuracy: Dissecting Mathematical Reasoning for LLMs... |
|
Experimental |
| 3346 |
JianxXiong/AAPO
Implementation of AAPO (Arxiv: 2505.14264v2) paper |
|
Experimental |
| 3347 |
affjljoo3581/starcoder-jax
a Jax/Flax inference code of StarCoder |
|
Experimental |
| 3348 |
LoupFireYT/llm.c
🛠️ Explore GPT-2 with llm.c, a lightweight C implementation that simplifies... |
|
Experimental |
| 3349 |
Quotify-Bot/quotify-frontend
AI-powered inspirational quote generator |
|
Experimental |
| 3350 |
Orfeous/llamacpp.net
C#/.NET binding of llama.cpp |
|
Experimental |
| 3351 |
ziansu/prorec
Official Implementation of NeurIPS 2024 paper - Source Code Foundation... |
|
Experimental |
| 3352 |
emagod/LLM-Forecast
🚀 Integrate ARIMA and Large Language Models for accurate forecasting with... |
|
Experimental |
| 3353 |
navamai/navamai
Use NavamAI to supercharge your productivity and workflow with personal,... |
|
Experimental |
| 3354 |
robjsliwa/llama-agent
Fun project to run your own LLM chat bot using llama.cpp |
|
Experimental |
| 3355 |
jesusvilela/IGBundle-LLM
IGBundle LLM is an experimental framework for adapting Large Language Models... |
|
Experimental |
| 3356 |
newfull5/NLLB-200-Distilled-350M-en-ko
nllb-200 distilled 350M for English to Korean translation |
|
Experimental |
| 3357 |
rachel-pai/T5Elasticsearch
Elasticsearch with T5/Bert/Other models provided by huggingface Transfomers. |
|
Experimental |
| 3358 |
nmamie/HiveLLM
This project evaluated the collective intelligence potential of... |
|
Experimental |
| 3359 |
Framstag/LLMAnalysisJinni
A tool to implement complex analysis tasks using an LLM in cases where you... |
|
Experimental |
| 3360 |
imSanko/Image_Caption_Generator_With_Transformers
This repository contains code for generating captions for images using a... |
|
Experimental |
| 3361 |
IsmaelMousa/TTL
Full-stack simulator for a todo task list application using FastAPI, I built... |
|
Experimental |
| 3362 |
Yellow4Submarine7/LLMDoctor
🩺 Token-Level Flow-Guided Preference Optimization for Efficient Test-Time... |
|
Experimental |
| 3363 |
bernardoleite/question-generation-t5-pytorch-lightning
Question Generation for English and Portuguese, using the T5 model,... |
|
Experimental |
| 3364 |
rekalantar/MedSegmentAnything_SAM_LungCT
The code to finetune SAM with bounding box prompt for segmentation of the lungs on CT |
|
Experimental |
| 3365 |
sno2/bertml
Use common pre-trained ML models in Deno! |
|
Experimental |
| 3366 |
CeMOS-IS/GenFormer
[ICPR 2024] Official repository of the paper "GenFormer - Generated Images... |
|
Experimental |
| 3367 |
codepawl/turboquant-torch
Unofficial PyTorch implementation of TurboQuant (Google Research, ICLR... |
|
Experimental |
| 3368 |
AbineshSivakumar/Llama-2-7B-QLoRA-Vicuna
This repository contains code to fine-tune a Llama-7B-Uncensored model using... |
|
Experimental |
| 3369 |
gsarti/pecore
Materials for "Quantifying the Plausibility of Context Reliance in Neural... |
|
Experimental |
| 3370 |
aerosta/rewardhackwatch
Runtime detector for reward hacking and misalignment in LLM agents (89.7% F1... |
|
Experimental |
| 3371 |
JarvisPei/MemDLM
MemDLM: Memory-enhanced Diffusion Language Model |
|
Experimental |
| 3372 |
shub-garg/Vision-Transformer-VIT-for-MNIST
This repository implements a Vision Transformer (ViT) to classify... |
|
Experimental |
| 3373 |
szheng3/Rust-server-pre-trained-models
Rust server that summarizes text with pre-trained models |
|
Experimental |
| 3374 |
mpociot/llamero
A GUI application to easily try out Facebook's LLaMA models. |
|
Experimental |
| 3375 |
Hyun-Ryu/clover
Official code for "Divide and Translate: Compositional First-Order Logic... |
|
Experimental |
| 3376 |
JamesVorder/python-tddpp
This LLM generates code based on tests, and makes sure they pass. |
|
Experimental |
| 3377 |
Wells-the-Doctor/leaxer
🌟 Build and deploy local AI models with Leaxer for real-time interaction,... |
|
Experimental |
| 3378 |
eshoyuan/WeChat-LLM
WeChat-LLM: Build a LLM that Mirrors Your Chat Style Using WeChat... |
|
Experimental |
| 3379 |
OpenNLG/OpenBA-v2
OpenBA-V2: 3B LLM (Large Language Model) with T5 architecture, utilizing... |
|
Experimental |
| 3380 |
maggiesong7/FullyAttentional
Fully Attentional Network for Semantic Segmentation [AAAI 2022] |
|
Experimental |
| 3381 |
Michael-Jackson666/Zero2Hero-AI
From first principles to state-of-the-art: A hands-on journey implementing... |
|
Experimental |
| 3382 |
hydropix/AutoDescribe-Images
Tool to automatically generate text descriptions for images using Ollama... |
|
Experimental |
| 3383 |
ibnaleem/mixtral.py
A Python module for running the Mixtral-8x7B language model with... |
|
Experimental |
| 3384 |
jankstar/pydocu
fastapi server for classification of documents and extraction of data |
|
Experimental |
| 3385 |
aidendorian/Marcella-60M-SLM
A 66M parameter decoder-only transformer language model implemented from... |
|
Experimental |
| 3386 |
yul091/GraphLogAD
Codebase for the ICKG 2023 paper: "GLAD: Content-aware Dynamic Graphs For... |
|
Experimental |
| 3387 |
muhammad-fiaz/EMSUGI
EMSUGI is a future prediction & analysis project on various factor like... |
|
Experimental |
| 3388 |
li-plus/nanoRLHF
Train a tiny LLaMA model from scratch to repeat your words using... |
|
Experimental |
| 3389 |
CanvaChen/chinese-llama-tokenizer
目标:构建一个更符合语言学的小而美的 llama 分词器,支持中英日三国语言 |
|
Experimental |
| 3390 |
Losif01/text-preprocessing-to-transformers-NLP-notes
This repo is my personal notes from the Stanford NLP course, and i currently... |
|
Experimental |
| 3391 |
isaaccorley/segmenter-pytorch
PyTorch implementation of "Segmenter: Transformer for Semantic Segmentation"... |
|
Experimental |
| 3392 |
UCSC-VLAA/Sight-Beyond-Text
[TMLR 2024] Official implementation of "Sight Beyond Text: Multi-Modal... |
|
Experimental |
| 3393 |
ilya16/deephumor
DeepHumor: Image-based Meme Generation using Deep Learning |
|
Experimental |
| 3394 |
merlerm/In-Context-Symbolic-Regression
Official code implementation for the ACL 2024 Student Research Workshop... |
|
Experimental |
| 3395 |
yyy01/PAC
The official implementation of the paper "Data Contamination Calibration for... |
|
Experimental |
| 3396 |
Y-Research-SBU/CSR
Official Repository for CSR - ICML 2025 Oral |
|
Experimental |
| 3397 |
Type-Here/med-vix-ray
A Knowledge-Guided Model for CXR classification |
|
Experimental |
| 3398 |
erfanzar/OST-OpenSourceTransformers
OST Collection: An AI-powered suite of models that predict the next word... |
|
Experimental |
| 3399 |
ArtificialZeng/transformers-Explained
官方transformers源码解析。AI大模型时代,pytorch、transformer是新操作系统,其他都是运行在其上面的软件。 |
|
Experimental |
| 3400 |
januverma/transformers-for-sequential-recommendation
Notebooks on using transformers for sequential recommendation tasks |
|
Experimental |