All Transformer Models
7,795 models ranked by quality score · Page 41 of 78
| # | Model | Score | Tier |
|---|---|---|---|
| 4001 |
eladwf/adaptive-multirate-transformers
DSP-inspired multirate wrappers for GPT with adaptive hyperparameters and... |
|
Experimental |
| 4002 |
naveenkumar123/llm-training-project
Different LLM model usage projects |
|
Experimental |
| 4003 |
Jorffy/NoteMR
[CVPR 2025] Code for "Notes-guided MLLM Reasoning: Enhancing MLLM with... |
|
Experimental |
| 4004 |
visresearch/SDMPrune
The official implementation of "SDMPrune: Self-Distillation MLP Pruning for... |
|
Experimental |
| 4005 |
inuwamobarak/stable-diffusion
Implementing a diffusion framework with Hugging Face. Stable diffusion... |
|
Experimental |
| 4006 |
AIDC-AI/Wings
The code repository for "Wings: Learning Multimodal LLMs without Text-only... |
|
Experimental |
| 4007 |
okefemi12/ecommerce-recommender-system
Production-grade Hybrid Recommender System combining Sequential Transformers... |
|
Experimental |
| 4008 |
Kyle1668/LLM-TTA
Code for the paper: Improving Black-box Robustness with In-Context Rewriting |
|
Experimental |
| 4009 |
koc-lab/graph-teacher
The repository for the "GraphTeacher: Transductive Fine-Tuning of Encoders... |
|
Experimental |
| 4010 |
Mattbusel/LLMTokenStreamQuantEngine
A low-latency, C++-based simulation engine that ingests token streams from... |
|
Experimental |
| 4011 |
baohuyvanba/Vision-Zephyr
Vision-Zephyr: a multimodal LLM for Visual Commonsense Reasoning—CLIP-ViT +... |
|
Experimental |
| 4012 |
insoochung/transformer_bcq
BCQ tutorial for transformers |
|
Experimental |
| 4013 |
U4RASD/dalla-model-training
Dalla training recipe using Huggingface SFT trainer |
|
Experimental |
| 4014 |
iiis-ai/IterativeQuestionComposing
[AAAI 2025] Augmenting Math Word Problems via Iterative Question Composing... |
|
Experimental |
| 4015 |
The-Swarm-Corporation/MoF
This work introduces Flow Matching Mixture of Experts (FM-MoE), a framework... |
|
Experimental |
| 4016 |
SatvikPraveen/PyTroch-Mastery-Hub
A comprehensive PyTorch portfolio demonstrating advanced ML implementations... |
|
Experimental |
| 4017 |
rdemarqui/llm_complaint_management
Multi-label Classification of Complaints with LLM |
|
Experimental |
| 4018 |
albertan017/LLM4Binary
Collect thoughts, suggestions and resources to build an LLM model for binary |
|
Experimental |
| 4019 |
vizzies/NASA-Semantic-Search-Engine-for-Scientific-Literature
Intelligent scientific literature search and analysis application that uses... |
|
Experimental |
| 4020 |
llap4585/T5-Refiner-DomainFocus
Derived from Medical Literature Development: Injecting domain expertise into... |
|
Experimental |
| 4021 |
ictnlp/SiLLM
SiLLM is a Simultaneous Machine Translation (SiMT) Framework. It utilizes a... |
|
Experimental |
| 4022 |
sukrazit/nlp-playground-notebooks
🚀 Explore NLP with hands-on notebooks for RNNs and Transformers, featuring... |
|
Experimental |
| 4023 |
AlinaMustaqeem/open-LLM
Kickstart with LLMs |
|
Experimental |
| 4024 |
tushar2704/LLM_ChatBot_streamlit
Streamlit application called T-BOT, using HugChat |
|
Experimental |
| 4025 |
sovit-123/lm_sft
Various LMs/LLMs below 3B parameters (for now) trained using SFT (Supervised... |
|
Experimental |
| 4026 |
piratheon/LB-llm_training_scripts
A bunch of script to train your own offsec LLM |
|
Experimental |
| 4027 |
AKC23/Harnessing-LLMs-over-transformer-models-for-detecting-Bengali-depressive-text-A-comprehensive-study
Harnessing large language models over transformer models for detecting... |
|
Experimental |
| 4028 |
kyegomez/ai-reading-list
This collection brings together the highest-signal research papers in modern... |
|
Experimental |
| 4029 |
NLPForUA/ZNO
Structured test tasks and model tuning scripts for multiple subjects from... |
|
Experimental |
| 4030 |
Brokttv/Transformers-on-Majority
An empirical study that builds on Merrill et al. theoretical construction to... |
|
Experimental |
| 4031 |
MuhammadTahaNasir/llm-learning-hub
A hands-on collection of practical notebooks for learning and building with... |
|
Experimental |
| 4032 |
NamrataThakur/Large_Language_Model_From_Scratch_Implementation
Implementing an LLM from scratch block-by-block using PyTorch |
|
Experimental |
| 4033 |
eduardoleao052/Transformer-from-scratch
Educational Transformer from scratch (no autograd), with forward and backprop. |
|
Experimental |
| 4034 |
seonglae/llama2gptq
Chat to LLaMa 2 that also provides responses with reference documents over... |
|
Experimental |
| 4035 |
piratheon/LiquidBunny-llm
A bunch of script to train your own offsec LLM |
|
Experimental |
| 4036 |
SwethaMagesh/sankshepika-mlpro
NLP based Legal document summariser. Takes large documents with complex... |
|
Experimental |
| 4037 |
AswaniSahoo/weather-transformer-scratch
Physics-aware Vision Transformer for weather forecasting built from scratch... |
|
Experimental |
| 4038 |
Anmol25/NewsDigest
An AI-powered full-stack news platform delivering personalized... |
|
Experimental |
| 4039 |
ossirytk/llm_resources
Information and resources on everything related about running large language... |
|
Experimental |
| 4040 |
shreydan/VisionGPT2
Combining ViT and GPT-2 for image captioning. Trained on MS-COCO. The model... |
|
Experimental |
| 4041 |
vitality-vis/vitality-vis.github.io
Promoting Serendipitous Discovery of Academic Literature with Transformers &... |
|
Experimental |
| 4042 |
Dangocan/comfyui_glm_ocr
ComfyUI custom node to run GLM-OCR locally — text, formula, and table... |
|
Experimental |
| 4043 |
cmavro/PackLLM
Pack of LLMs: Model Fusion at Test-Time via Perplexity Optimization |
|
Experimental |
| 4044 |
Letian2003/MM_INF
An efficient multi-modal instruction-following data synthesis tool and the... |
|
Experimental |
| 4045 |
elixpo/emoji_transnetv1
A Machine Learning Initiative Taken to fine tune MT5_SMALL to contextually... |
|
Experimental |
| 4046 |
shreyansh26/Red-Teaming-Language-Models-with-Language-Models
A re-implementation of the "Red Teaming Language Models with Language... |
|
Experimental |
| 4047 |
Spico197/MoE-SFT
🍼 Official implementation of Dynamic Data Mixing Maximizes Instruction... |
|
Experimental |
| 4048 |
sarnikowski/danish_transformers
A collection of Danish Transformers |
|
Experimental |
| 4049 |
InternLM/Visual-ERM
Official Implementation of "Visual-ERM: Reward Modeling for Visual Equivalence" |
|
Experimental |
| 4050 |
lenticularis39/llama2.inferno
Inference Llama 2 in one file of pure Limbo |
|
Experimental |
| 4051 |
HKUSTDial/megatran
[VLDB'25] Official repo for Paper "Weak-to-Strong Prompts with... |
|
Experimental |
| 4052 |
davor10105/relative-absolute-magnitude-propagation
Explain the outputs of your Vision Transformers, Residual Networks and... |
|
Experimental |
| 4053 |
juancmacias/Small_Lenguage_Model
Píldora formativa sobre SLM (Small Lenguage Model) |
|
Experimental |
| 4054 |
LuDanielPoyu/Cowrie-Log-Helper
Multi-task NLP on Cowrie honeypot attacker-session logs—classification, QA,... |
|
Experimental |
| 4055 |
research-outcome/llm-langchain-examples
Examples of llm apps developed using langchain opensource framework |
|
Experimental |
| 4056 |
tmro98/financial-news-classification
binary topic and 3-class sentiment classification of full-length news articles |
|
Experimental |
| 4057 |
PRITHIVSAKTHIUR/Florence-2-Image-Caption
This application utilizes the powerful Florence-2 vision-language model from... |
|
Experimental |
| 4058 |
jsuryanm/text-summarization-system
End-to-End text summarization system built with bart-base using HuggingFace... |
|
Experimental |
| 4059 |
zizisaigao/ePQA-NLP-modeling
This project builds nlp models on ePQA dataset. It covers LM trained for... |
|
Experimental |
| 4060 |
RhythrosaLabs/game-maker-public-dev
Game Maker is a Streamlit app that uses AI to accelerate game development by... |
|
Experimental |
| 4061 |
Scr44gr/elelems
A very, very, very simple SLM. I'm just learning. |
|
Experimental |
| 4062 |
ranzeet013/LLM_Notebooks
Exploring LLMs with interactive notebooks. |
|
Experimental |
| 4063 |
AI-14/micar-vl-moe
[IJCNN 2025] [Official code] - MicarVLMoE: A modern gated cross-aligned... |
|
Experimental |
| 4064 |
benisalla/Tiny-ViT-Transformer-from-scratch
This repository offers a straightforward implementation of Vision... |
|
Experimental |
| 4065 |
Rahulkumar010/microDPO
microDPO: A minimalist, pure PyTorch implementation of Direct Preference... |
|
Experimental |
| 4066 |
MU-Enigma/BotForge
Welcome to BotForge, an open-source project dedicated to advancing NLP-based... |
|
Experimental |
| 4067 |
subhasis-ai/Hindi-ASR-Wav2Vec2
This repository demonstrates development of Hindi ASR model using transformers. |
|
Experimental |
| 4068 |
FusionSid/Rick-AI
An AI chatbot made using DialoGPT in python | Join why discord to try it out: |
|
Experimental |
| 4069 |
5663015/LLMs_train
一套代码指令微调大模型 |
|
Experimental |
| 4070 |
anar-rzayev/Empathetic-Dialogue-Generation
Open-Domain Dialogue model which produces empathetic responses when trained... |
|
Experimental |
| 4071 |
jacksonchen1998/LLaMA-Paper-List
Collection of papers using LLaMA as backbone model |
|
Experimental |
| 4072 |
elip06/covid19-fact-checking
A fact-checking system of short to medium-sized documents on the topic of COVID-19 |
|
Experimental |
| 4073 |
yihedeng9/rlhf-summary-notes
A brief and partial summary of RLHF algorithms. |
|
Experimental |
| 4074 |
asigalov61/Incredible-MahlerNet
Absolutely fantastic and fully working SOTA Transformer-XL Music AI... |
|
Experimental |
| 4075 |
abhisheksingh-7/cotrend
Extending Decoders with an Integrated Encoder, as Part of Llama-3 Hackathon |
|
Experimental |
| 4076 |
Silvestre17/ChatMeter_FinalProject.inDataScience
💬 Capstone project for the Data Science bachelor's at ISCTE. "ChatMeter" is... |
|
Experimental |
| 4077 |
NeuralCoder3/custom_infinite_craft
A custom implementation of Infinite Craft (https://neal.fun/infinite-craft/) |
|
Experimental |
| 4078 |
jaepil/geometric-adam
A Ray Tracing-Inspired Approach to Neural Network Optimization |
|
Experimental |
| 4079 |
wambugu71/SmartAgriImage_classification_ViT
Vision Transformer trained with thousands of agricultural diseases in... |
|
Experimental |
| 4080 |
mattialoszach/LoRA-Agentic-Output-Format
Fine-tuning LLMs for structured agent-style outputs (e.g. JSON), built for... |
|
Experimental |
| 4081 |
Emart29/phi4-finance-finetuning
Fine-tuning Microsoft Phi-4 Mini 3.8B on SEC 10-K financial Q&A using QLoRA... |
|
Experimental |
| 4082 |
EdVince/whisper-trtllm
Whisper in TensorRT-LLM |
|
Experimental |
| 4083 |
MakazhanAlpamys/Soup
Soup turns the pain of LLM fine-tuning into a simple workflow. One config,... |
|
Experimental |
| 4084 |
sahilichake/Document-Summarization-App-using-LLM
Document Summarization App using large language model (LLM) and Langchain... |
|
Experimental |
| 4085 |
himanshuvnm/Foundation-Model-Large-Language-Model-FM-LLM
This repository was commited under the action of executing important tasks... |
|
Experimental |
| 4086 |
SharathHebbar/Model-Sharding
Sharding Large Language Models for loading them efficiently in lesser RAM |
|
Experimental |
| 4087 |
therrshan/image-captioning
Comparitive analysis of image captioning model using RNN, BiLSTM and... |
|
Experimental |
| 4088 |
rese1f/STEVE
[ECCV 2024] STEVE in Minecraft is for See and Think: Embodied Agent in... |
|
Experimental |
| 4089 |
ZhiningLiu1998/SelfElicit
[ACL'25 Main] SelfElicit: Your Language Model Secretly Knows Where is the... |
|
Experimental |
| 4090 |
waqasm86/Ubuntu-Cuda-Llama.cpp-Executable
Pre-built llama.cpp CUDA binary for Ubuntu 22.04. No compilation required -... |
|
Experimental |
| 4091 |
taranjeet/llmformatter
Get deterministic output in any format like json from any LLM. |
|
Experimental |
| 4092 |
Chirayu-Tripathi/Paper-Implementations
My implementation of Machine Learning and Deep Learning papers from scratch. |
|
Experimental |
| 4093 |
mbeps/llama3.1_fine-tuning_mult-it
Fine-tuning various Llama 3.1 family of models on the Mult-It dataset |
|
Experimental |
| 4094 |
Pro-GenAI/ShortLang
Compressed Text for efficient LLMs |
|
Experimental |
| 4095 |
Arneunalarming861/Laminae
Bridge raw large language models to production-ready AI with a lightweight... |
|
Experimental |
| 4096 |
uhasker/large-language-models
Files for the book "Large Language Models" |
|
Experimental |
| 4097 |
mcd-unison/llm
Material sobre Grandes Modelos de Lenguajes (LLM) realizado en forma... |
|
Experimental |
| 4098 |
K2-BoundaryArchitect/Reflex-Motor-DB
A deterministic reflex memory layer for stabilizing LLM execution. |
|
Experimental |
| 4099 |
akunba3970/llm-cost-calculator
Estimate token usage and API costs for large language models to help... |
|
Experimental |
| 4100 |
YY0649/ICE-PIXIU
ICE-PIXIU:A Cross-Language Financial Megamodeling Framework |
|
Experimental |