All Transformer Models
7,795 models ranked by quality score · Page 50 of 78
| # | Model | Score | Tier |
|---|---|---|---|
| 4901 |
Eric-he-cn/Qwen3-QLoRA-News
This project enables the model to directly generate structured summaries... |
|
Experimental |
| 4902 |
frikishaan/pytorch-transformers
This repository contains the original transformers model implementation code. |
|
Experimental |
| 4903 |
ReNothingg/Mind-4
Экспериментальная трансформерная LLM для локального обучения, инференса и... |
|
Experimental |
| 4904 |
StarLight1212/Story-Teller
This repo mainly encapsulates an LLM model + front-end + back-end for... |
|
Experimental |
| 4905 |
KOKOSde/sparse-clt
Cross-Layer Transcoder (CLT) library for extracting sparse interpretable... |
|
Experimental |
| 4906 |
Xhst/ml-record-linkage
Unstructured Record Linkage using Siamese Networks and Large Language Models... |
|
Experimental |
| 4907 |
ZEKE320/llm-dataset-generator
The LLM Dataset Generator is an open source tool for generating text data... |
|
Experimental |
| 4908 |
michelecafagna26/VinVL
Original VinVL (and Oscar) repo with API designed for an easy inference |
|
Experimental |
| 4909 |
sidharrth2002/text-scoring
Industrial Text Scoring using Multimodal Deep Natural Language Processing 🚀 ... |
|
Experimental |
| 4910 |
SauravMaheshkar/nanollm
JAX LLM playground |
|
Experimental |
| 4911 |
secret-ai-labs/awesome-local-llm
Your complete guide to running powerful AI models locally in 2025. Covers... |
|
Experimental |
| 4912 |
lijoraju/llm-news-aggregator
Personalized news summaries using LLMs, FAISS, and Telegram bot. |
|
Experimental |
| 4913 |
Simoso68/llama-lit
Streamlit frontend for Ollama. |
|
Experimental |
| 4914 |
ntphuc149/ViAG
ViAG: A Novel Framework for Fine-tuning Answer Generation models ultilizing... |
|
Experimental |
| 4915 |
Mecanik/Tiny-BPE-Trainer
Lightweight, header-only Byte Pair Encoding (BPE) trainer in modern C++17.... |
|
Experimental |
| 4916 |
chrisliu298/awesome-sparse-autoencoders
A resource repository of sparse autoencoders for large language models |
|
Experimental |
| 4917 |
thyt3618/instap-ai
an AI Agent project contributed by students from YingShan Middle... |
|
Experimental |
| 4918 |
di37/multiclass-image-classification-using-multimodal-llms
A comprehensive comparison of multimodal models - llama3.2-vision,... |
|
Experimental |
| 4919 |
Mahadasghar/Amazon-food-sentiment-analyzer
Fine-tuned RoBERTa transformer for Amazon food review sentiment analysis... |
|
Experimental |
| 4920 |
CameLLM/CameLLM
Run your favourite LLMs locally on macOS from Swift |
|
Experimental |
| 4921 |
brihijoshi/granular-similarity-COLING-2020
Code for the paper "The Devil is in the Details: Evaluating Limitations of... |
|
Experimental |
| 4922 |
NISL-MSU/MultiSetSR
Decomposable Neuro Symbolic Regression |
|
Experimental |
| 4923 |
PRITHIVSAKTHIUR/Doc-VLMs-exp
An experimental document-focused Vision-Language Model application that... |
|
Experimental |
| 4924 |
The-Swarm-Corporation/ClusterMoE
A novel neural network architecture that extends Mixture of Experts (MoE)... |
|
Experimental |
| 4925 |
efficientscaling/Z1
[EMNLP'25 Industry] Repo for "Z1: Efficient Test-time Scaling with Code" |
|
Experimental |
| 4926 |
koudounasalkis/divergence-in-speech-systems
Code associated with the paper "Exploring Subgroup Performance in End-to-End... |
|
Experimental |
| 4927 |
jwmke/BiasCompass
Using LLMs to detect bias in news articles. |
|
Experimental |
| 4928 |
rajveer43/titan_transformer
Unofficial implementation of titans transformer |
|
Experimental |
| 4929 |
Saivineeth147/LLM-Compass
The ultimate collection of resources for building, evaluating, and... |
|
Experimental |
| 4930 |
Vaioskn/song-identification-fingerprints-and-embeddings
Song identification combining landmark audio fingerprinting with... |
|
Experimental |
| 4931 |
psunlpgroup/VerbosityLLM
This repository maintains dataset, predictions, and code for paper:... |
|
Experimental |
| 4932 |
RedTeamingforLLMs/RedTeamingforLLMs
A framework designed for executing positive red-teaming experiments on large... |
|
Experimental |
| 4933 |
kyegomez/Mixture-of-MQA
An implementation of a switch transformer like Multi-query attention model |
|
Experimental |
| 4934 |
obss/disgem
[EMNLP 2024] Official Implementation of DisGeM: Distractor Generation for... |
|
Experimental |
| 4935 |
jhuapl-fomo/ralf
A lightweight library to support the development of applications using LLMs |
|
Experimental |
| 4936 |
Aqib121201/BurgerBot
LLM dashboard for German policy documents—translation, summarization, visualization |
|
Experimental |
| 4937 |
Mrigank005/Rubric_Generator
This repository contains a machine learning model designed to generate... |
|
Experimental |
| 4938 |
adarsh-crafts/llama-llm-from-scratch
Educational, from-scratch implementation of a LLaMA-style LLM using PyTorch... |
|
Experimental |
| 4939 |
MauroLuzzatto/lyrics-translator
🎵 LyricsTranslator is a Python library for automated lyrics translation |
|
Experimental |
| 4940 |
zhudotexe/kani-vision
Kani extension for supporting vision-language models (VLMs). Comes with... |
|
Experimental |
| 4941 |
Sarvesh-Yadav-5201/Lyrics-Generation---NLP-Project
This is a project to demonstrate the capabilities of Transformer Models to... |
|
Experimental |
| 4942 |
sappho192/EDMTranslator
.NET Text translator library based on LLM models, especially... |
|
Experimental |
| 4943 |
marksgraham/transformer-ood
Official PyTorch code for "Transformer-based out-of-distribution detection... |
|
Experimental |
| 4944 |
itsDaiton/masters-thesis
Exploration and Comparison of Transformers for Image Classification. |
|
Experimental |
| 4945 |
mduffster/self-referent-test
Testing role-based pathways on small LLMs |
|
Experimental |
| 4946 |
HySonLab/HierAttention
Scalable Hierarchical Self-Attention with Learnable Hierarchy for Long-Range... |
|
Experimental |
| 4947 |
harrisonvshen/triton-accelerated-attention
Custom Triton GPU kernels for multi-head attention, including QK^T, softmax,... |
|
Experimental |
| 4948 |
minuva/llm-flow-classification
LLM conversation flow classification 💬 |
|
Experimental |
| 4949 |
ryan-air/Alpaca-3B-Fine-Tuned
In this project, I have provided code and a Colaboratory notebook that... |
|
Experimental |
| 4950 |
gmongaras/2Mamba2Furious
Code for the paper "2Mamba2Furious: Linear in complexity, competitive in accuracy" |
|
Experimental |
| 4951 |
Khushiyant/tether
Tether is a Triton-powered framework for training and deploying Spiking Transformers. |
|
Experimental |
| 4952 |
Shengwei-Peng/TOCFL-MultiBench
TOCFL-MultiBench: A multimodal benchmark for evaluating Chinese language... |
|
Experimental |
| 4953 |
mspronesti/llm.sycl
llm.c, but in SYCL/Intel oneAPI! |
|
Experimental |
| 4954 |
kapshaul/LLM-finetune-vuln-detection
Fine-tuning a Large Language Model (LLM) for code vulnerability detection... |
|
Experimental |
| 4955 |
edoost/pert
Persian Ezafe Recognition Using Transformers and Its Role in Part-Of-Speech Tagging |
|
Experimental |
| 4956 |
wasim/scaling-specialization-dense-lms
Do dense LMs develop MoE-like specialization as they scale? Measure it,... |
|
Experimental |
| 4957 |
zabir-nabil/bangla-multilingual-llm-eval
Evaluation of Open and Closed-Source Multi-lingual LLMs for Low-Resource... |
|
Experimental |
| 4958 |
ngavu2004/text-to-knowledge-graph
Turn your Text into a mind map based on LLMs knowledge graph |
|
Experimental |
| 4959 |
H0NEYP0T-466/Isabella
⚙️ Isabella – a full-stack 🚀 conversational system built on FastAPI ✨... |
|
Experimental |
| 4960 |
FareedKhan-dev/best-introduction-to-transformer
transformer again in the same manner as I did in my previous blog (for both... |
|
Experimental |
| 4961 |
ebarkhordar/voter-behavior-prediction-LLM
This project explores the predictive power of large language models (LLMs)... |
|
Experimental |
| 4962 |
Mustapha-AJEGHRIR/arabic_calligraphy
This is a repo containing our code for Arabic calligraphy style detection... |
|
Experimental |
| 4963 |
yophis/decom-renorm-merge
Decom-Renorm-Merge: Merging deep learning models through shared representation space. |
|
Experimental |
| 4964 |
AlanC12138/summarizer-api
AI-powered document summarization API built with FastAPI, Hugging Face... |
|
Experimental |
| 4965 |
affjljoo3581/CommonLit-Readability-Prize
🥈42nd place in CommonLit Readability Prize competition🥈 |
|
Experimental |
| 4966 |
HES-XPLAIN/mlxplain
An open platform for accelerating the development of eXplainable AI systems |
|
Experimental |
| 4967 |
a-kostikova/LLLMs-Survey
The GitHub page for the survey paper "LLLMs: A Data-Driven Survey of... |
|
Experimental |
| 4968 |
zufeshan12/fine-tuning-and-reinforcement-learning-on-llms
supervised fine tuning and RLAIF on DeepSeek-math-7b-base using LoRA... |
|
Experimental |
| 4969 |
NoviceStone/Keqing
An interpretable KBQA system that operates at the natural language level... |
|
Experimental |
| 4970 |
rishikksh20/qwen3-playground
Readable implementation of Qwen3 0.6B model |
|
Experimental |
| 4971 |
EgosOwn/llama-linux-helper
Never Google for linux commands again with the help of LLaMA |
|
Experimental |
| 4972 |
alipay/fin_domain_llm
Implementation of the paper: WeaverBird: Empowering Financial... |
|
Experimental |
| 4973 |
SkAndMl/captiongpt
Image Captioning using ViT and GPT. Notebook version in the following link |
|
Experimental |
| 4974 |
LMLK-seal/LLModel
Private LLModel GUI Chat allows users to interact with a local large... |
|
Experimental |
| 4975 |
FardinHash/multilabel-classification-llm
Multi-label classification using LLMs, with additional enhancements using... |
|
Experimental |
| 4976 |
colinrizzman/Neural-Romance-v2
A neural network calculates your chance of finding love. |
|
Experimental |
| 4977 |
brej-29/analytics-copilot-text2sql
Analytics Copilot (Text-to-SQL) is an end-to-end LLM engineering project... |
|
Experimental |
| 4978 |
lapismyt/pyAIHorde
Simple library for interacting with AI Horde API. |
|
Experimental |
| 4979 |
waelantar/ATTS_Complete_Free_Package
ATTS: Adaptive Test-Time Scaling - A validated framework for optimizing LLM... |
|
Experimental |
| 4980 |
jha-lab/transcode
[TCAD'23] TransCODE: Co-design of Transformers and Accelerators for... |
|
Experimental |
| 4981 |
sergio-sanz-rodriguez/Vision-Transformers-Image-Classification
Development of Vision Transformer (ViT) networks for multi-class image... |
|
Experimental |
| 4982 |
kbulutozler/transformers-text-classification
using transformers to do text classification. |
|
Experimental |
| 4983 |
necrashter/transformers-learnable-memory
Fine-tuning Image Transformers using Learnable Memory |
|
Experimental |
| 4984 |
Wangmerlyn/KeyChain
KeyChain, UUID-driven data augmentation design behind LoongRL (ICLR 2026 oral) |
|
Experimental |
| 4985 |
nopperl/corporate_emission_reports
Finetuning and evaluating LLMs to extract GHG emissions from PDF reports... |
|
Experimental |
| 4986 |
freddxvill/Proyecto_Traductor_de_la_LSB
Traductor de Lengua de Señas Boliviana (LSB) a texto utilizando redes... |
|
Experimental |
| 4987 |
himanshu231204/langchain-playground--for-llms-
A personal learning space for LangChain, featuring code snippets, notes, and... |
|
Experimental |
| 4988 |
Aqib121201/FairNLP-SHAP-Based-Bias-Detection-in-Multilingual-BERT-Models
Bias analysis in multilingual BERT using SHAP and fairness metrics (EN, DE, HI) |
|
Experimental |
| 4989 |
eminorhan/llm-memory
Memory experiments with LLMs |
|
Experimental |
| 4990 |
ColinWu0403/LLaMA-2-hf-Chatbot
Chatbot from pretrained LLaMA-2 LLM model, fine-tuned with medical research... |
|
Experimental |
| 4991 |
GusLovesMath/Llama3_MacSilicon
Repository for running LLMs efficiently on Mac silicon (M1, M2, M3).... |
|
Experimental |
| 4992 |
gemlorg/Thesis-Trading-LLM
Bachelor's Thesis on Machine Learning for Stock Market Forecasting. Several... |
|
Experimental |
| 4993 |
chazciii/rd-net
Inference-time drift experiment demonstrating reduced repetition collapse in... |
|
Experimental |
| 4994 |
JangYeongSil/JettaRLLLM
Jetta-Reinforcement-Learning-Hybrid-LLM-Architecture |
|
Experimental |
| 4995 |
mlsw/partial-embedding-matrix-adaptation
Vocabulary-level memory efficiency for language model fine-tuning. |
|
Experimental |
| 4996 |
Traffic-Alpha/VLMLight
Official implementation of VLMLight |
|
Experimental |
| 4997 |
OmarBouhamed/T-DPnet
T-DPnet-Transformer-based-deep-Probabilistic-network-for-load-forecasting |
|
Experimental |
| 4998 |
bagh2178/GC-VLN
[CoRL 2025] GC-VLN: Instruction as Graph Constraints for Training-free... |
|
Experimental |
| 4999 |
s-JoL/Llama3-extend-vocab
A demo of expanding the vocabulary of the Llama3 model, applicable to other... |
|
Experimental |
| 5000 |
JerryPan2718/flexgpt
Tradeoff between runtime and RAM usage for large language model inference. |
|
Experimental |