All Transformer Models
7,795 models ranked by quality score · Page 61 of 78
| # | Model | Score | Tier |
|---|---|---|---|
| 6001 |
Sarhamam/ZetaFormer
Curriculum learning framework that uses geometrically structured datasets... |
|
Experimental |
| 6002 |
AndreaLolli2912/SemEval2026-EmoVA
SemEval-2026 Task 2: EmoVA. A Transformer-LSTM architecture with Set... |
|
Experimental |
| 6003 |
yass-ML/slm-few-shot-optimization
An empirical investigation into optimizing few-shot prompting strategies for... |
|
Experimental |
| 6004 |
shrutikakapade/Building-LLM-Pipelines-with-Hugging-Face-LangChain
An end-to-end guide to building robust LLM pipelines with Hugging Face and... |
|
Experimental |
| 6005 |
Ruiyang-061X/Awesome-MLLM-Uncertainty
✨A curated list of papers on the uncertainty in multi-modal large language... |
|
Experimental |
| 6006 |
viktor-shcherb/qk-sniffer
Capture sampled Q/K attention vectors from HF transformers into per-branch... |
|
Experimental |
| 6007 |
Scottcjn/pse-vcipher-collapse
Non-bijunctive attention collapse for LLM inference — POWER8 hardware AES... |
|
Experimental |
| 6008 |
chen-hao-chao/mdm-prime-v2
MDM-Prime-v2: Binary Encoding and Index Shuffling Enable Compute-optimal... |
|
Experimental |
| 6009 |
kevincojean/llama-vim-adapter
Extends the llama.vim plugin to enable LLM autocompletion from third party... |
|
Experimental |
| 6010 |
yos-r/go_emotions
Multi-Label Emotion Classification from Text Using Deep Learning :... |
|
Experimental |
| 6011 |
HTLinh0604/invoice_ocr_craft_llama3
This CRAFT + Llama 3.1 pipeline automates invoice semantic extraction,... |
|
Experimental |
| 6012 |
GoJo-Rika/Text-Summarizer-Using-HuggingFace-Transformers
An end-to-end MLOps project for text summarization using the HuggingFace... |
|
Experimental |
| 6013 |
Franekskc/gemma3-qa-finetuning
Comparing Full Fine-Tuning, LoRA, and Layer Freezing for extractive QA on... |
|
Experimental |
| 6014 |
k-siddhartha-ai/multilingual-sentiment-analysis
Multilingual Sentiment Analysis using Hugging Face Transformers and Gradio |
|
Experimental |
| 6015 |
dineshsoudagar/llm-lab-from-scratch-to-fine-tuning
Comprehensive resources and scripts for training and fine-tuning Large... |
|
Experimental |
| 6016 |
tsvlgd/gpt-from-scratch
decoder-only Transformer (GPT) language model coded from scratch in pytorch |
|
Experimental |
| 6017 |
FreezB11/PsyDuck
a 60M parameter LLM from scratch |
|
Experimental |
| 6018 |
yhinsson/airllm
🚀 Optimize memory for large language models, enabling 70B models on a 4GB... |
|
Experimental |
| 6019 |
stefanpietrusky/IEC
Repository for the article in the online magazine Level Up Coding |
|
Experimental |
| 6020 |
liyucheng09/llm-compressive
Longitudinal Evaluation of LLMs via Data Compression |
|
Experimental |
| 6021 |
Asimo-o/blipren_release
🚀 Train any LLM with BLIPren, a flexible architecture that adapts to your... |
|
Experimental |
| 6022 |
zchoi/Multi-Modal-Large-Language-Learning
Awesome multi-modal large language paper/project, collections of popular... |
|
Experimental |
| 6023 |
sastpg/CoVo
Consistent Paths Lead to Truth: Self-Rewarding Reinforcement Learning for... |
|
Experimental |
| 6024 |
germain-hug/NeurHal
Visual Correspondence Hallucination: Towards Geometric Reasoning (Under Review) |
|
Experimental |
| 6025 |
GodreignElgin/llm-comparision
Jupyter Notebook for LLM compression via quantization (INT8, INT4, FP16) and... |
|
Experimental |
| 6026 |
Samarth2001/LLM-Fine-tuning
Parameter-efficient fine-tuning experiments for 7B LLMs on consumer... |
|
Experimental |
| 6027 |
Vujavujavuja/Vsearcher
A sequential Large Language Model (LLM) agent system designed for automated... |
|
Experimental |
| 6028 |
tsinghua-fib-lab/PIGEON
[ACL 2025 Findings] Open-Set Living Need Prediction with Large Language Models |
|
Experimental |
| 6029 |
Ankur-krGarg/ChatBot
Transformer-based chatbot demo using Hugging Face's conversational models |
|
Experimental |
| 6030 |
sparkup/medical-llm-finetuning-alignment
Medical LLM fine-tuning and preference alignment using SFT and DPO, with... |
|
Experimental |
| 6031 |
Keytoyze/JumpCoder
Code for ACL (main) paper "JumpCoder: Go Beyond Autoregressive Coder via... |
|
Experimental |
| 6032 |
king/transformer-pooling
Pool Me Wisely: On the Effect of Pooling in Transformer-Based Models |
|
Experimental |
| 6033 |
samkibe/Basics-of-model-development-with-Lightning-PyTorch-
One of a kind, hectic |
|
Experimental |
| 6034 |
Sammy-Lastre/BigChat
BigChat is a WinUI 3 chat application built for chatting with large language... |
|
Experimental |
| 6035 |
E1ims/math-vlm-finetune-pipeline
📐 Transcribe handwritten math into accurate LaTeX using a modular... |
|
Experimental |
| 6036 |
nikelborm/amd-amdgpu-rocm-ollama-gfx90c-ati-radeon-vega-ryzen7-5800H-arch-linux
Run Ollama on AMD Ryzen 7 5800H CPU with integrated GPU AMD ATI Radeon Vega... |
|
Experimental |
| 6037 |
murapadev/Phinetuning
A repository dedicated to finetuning phi2 models using advanced machine... |
|
Experimental |
| 6038 |
ensarakbas77/LIFT-UP-Project-Similarity-Analysis
A system that compares newly submitted projects with previously completed... |
|
Experimental |
| 6039 |
aarnetalman/nli-with-transformers
Fine-tune transformers with NLI data |
|
Experimental |
| 6040 |
dhruvjverma/NanoLanguageModel
A minimalist, high-performance GPT implementation in PyTorch, optimized for... |
|
Experimental |
| 6041 |
PurCL/muke
[COLM 2025] Official implementation of μKE - edit LLM knowledge while... |
|
Experimental |
| 6042 |
su-mana-s/Semantic-Communication
Semantic Message Extraction for Text Based Data With Deep Neural Nets |
|
Experimental |
| 6043 |
duoan/ReplicateAI
Recreating every milestone in Machine Learning and Artificial Intelligence |
|
Experimental |
| 6044 |
namjoo2006/Langchain-fundamental-in-model-component-access-data-using-api-keys
LangChain fundamentals for model components: learn to access language and... |
|
Experimental |
| 6045 |
quocnhut134/Finetuning-LLM-Model-for-Intent-Classification-in-Banking
Fine-tuning Large Language Models (LLMs) for precise customer intent... |
|
Experimental |
| 6046 |
SunayHegde2006/Air.rs
Air.rs 70B+ inference on consumer GPU, LLM inference in Rust |
|
Experimental |
| 6047 |
duongkstn/durationqa-vlsp-solution
VLSP 2025 Vietnamese temporalQA - DurationQA. First Rank Solution. |
|
Experimental |
| 6048 |
Nazmul0005/Nazmul0005
AI/ML Engineer | Published Researcher (MDPI 2024) | Building intelligent... |
|
Experimental |
| 6049 |
d-senyaka/letter-forge
From-scratch Transformer implementation for character-level understanding... |
|
Experimental |
| 6050 |
North-Shore-AI/crucible_ensemble
Multi-model ensemble voting strategies for LLM reliability |
|
Experimental |
| 6051 |
theSohamTUmbare/CLIP-model
Reimplementation of the CLIP model |
|
Experimental |
| 6052 |
amoghj98/neuroLIFT
This repository contains code associated with Neuro-LIFT: A Neuromorphic,... |
|
Experimental |
| 6053 |
leonhard-leung/IlokoFusionMT
Bidirectional Iloko ↔ English neural machine translation system using a T5... |
|
Experimental |
| 6054 |
shalakapadalkar16/viral-genome-classifier
Production-ready ML pipeline for viral genome classification from NCBI... |
|
Experimental |
| 6055 |
edersoncorbari/fine-tune-llm
Demonstrate how to fine-tune a pre-trained LLM |
|
Experimental |
| 6056 |
GabMartino/TransformerForDummies
Annotated implementation of vanilla Transformers to guide through all the... |
|
Experimental |
| 6057 |
david-xander/measuring-llm-knowledge
How much does an LLM know about my programming language? |
|
Experimental |
| 6058 |
zjysteven/Awesome-Byte-LLM
A curated list of papers and resources on byte-based large language models... |
|
Experimental |
| 6059 |
AlexeyMalafeev/ruformers
"Руформеры" - список популярных базовых моделей на основе трансформеров для... |
|
Experimental |
| 6060 |
JoyousJohn/deeply-researched
Open-source clone of OpenAI's Deep Research. Works with any transformer,... |
|
Experimental |
| 6061 |
Ultron09/Numpy-Transformer
A pure NumPy implementation of GPT built from scratch for educational... |
|
Experimental |
| 6062 |
VincenzoManto/llmtrim
A library for trimming tokens in encoding and decoding in LLM (Large... |
|
Experimental |
| 6063 |
1337hero/rx7900xtx-llama-bench-rocm
Benchmark script for llama.cpp & results for AMD RX 7900 XTX |
|
Experimental |
| 6064 |
lennor-tan/openrouter-free-model
🌐 Explore and manage free models on OpenRouter effortlessly with our web... |
|
Experimental |
| 6065 |
Thopterek/ChessBenchmark
Aleph Alpha and LEVEL3, LLM benchmark |
|
Experimental |
| 6066 |
SyedAkramaIrshad/transformer-grokking-lab
Tiny Transformer grokking experiment with live notebook visualizations. |
|
Experimental |
| 6067 |
caiomadeira/llama2-psp
Llama 2 inference in C on the PlayStation Portable (PSP). |
|
Experimental |
| 6068 |
harpertoken/memoraxx
LLaMA-style models with memory persistence. |
|
Experimental |
| 6069 |
spignelon/TrustLink_CyberHackathon
TrustLink: Detect and safeguard against deceptive URLs. Real-time threat... |
|
Experimental |
| 6070 |
dineshkgn/deep-learning-lab
Reproducible deep learning experiments: tabular transformers, optimization,... |
|
Experimental |
| 6071 |
DolbyUUU/DeepEnlighten
Pure RL to post-train base models for social reasoning capabilities.... |
|
Experimental |
| 6072 |
ShraddhaSharma24/Natural-Language-Processing
A comprehensive NLP repository covering fundamentals, preprocessing,... |
|
Experimental |
| 6073 |
1337hero/rx7900xtx-llama-bench-vulcan
Benchmark script for llama.cpp & results for AMD RX 7900 XTX - using Vulcan |
|
Experimental |
| 6074 |
Reason-Wang/NAT
[NAACL 2025] The official implementation of paper "Learning From Failure:... |
|
Experimental |
| 6075 |
nsarrazin/chessformer
Experiments in chess & transformers |
|
Experimental |
| 6076 |
SafeRL-Lab/TeaMs-RL
[TMLR] TeaMs-RL: Teaching LLMs to Generate Better Instruction Datasets via... |
|
Experimental |
| 6077 |
ainize-team/free-llama-api
Run Meta Llama 3.2 API without your GPU for free. We always support lastest model 🧡 |
|
Experimental |
| 6078 |
Yousifus/rlhf_loop_humain
RLHF Loop System - Learning project with monitoring dashboard, drift... |
|
Experimental |
| 6079 |
viktor-shcherb/qk-pca-analysis
PCA analysis of Q/K attention vectors to discover position-correlated... |
|
Experimental |
| 6080 |
lakshayGoyal1188/text_to_sql
A schema-aware Text-to-SQL system using a locally hosted Mistral LLM... |
|
Experimental |
| 6081 |
spatialft/spatialft.github.io
LoRA fine-tuning of LFM2.5-1.2B to improve spatial reasoning on StepGame —... |
|
Experimental |
| 6082 |
krishnakoushik225/ecg-peft-benchmark
Benchmarking PEFT (LoRA vs adapters) for ECG segment classification using... |
|
Experimental |
| 6083 |
yamanobora/Android-Offline-Meeting-Recorder
Android app for offline speech recognition and AI meeting summarization... |
|
Experimental |
| 6084 |
Sachin-0001/ChatCut
ChatCut is a text summarizing tool built on Bidirectional Auto Regressive... |
|
Experimental |
| 6085 |
xkiwilabs/llm-inference-hub
A reproducible LLM inference stack built on vLLM + LiteLLM, designed for... |
|
Experimental |
| 6086 |
FromZeroToFanatic/LLM_Practical_Implementation_Demo1
大模型实战学习路线阶段1:大模型技术总览(必备基础)与实战 |
|
Experimental |
| 6087 |
MuthusaravananS/PINPOINT
Pipeline for discovering novel protease inhibtiors at plant pathogen interface. |
|
Experimental |
| 6088 |
Kaden-Schutt/hipfire
RDNA-native LLM inference engine in Rust. 59 tok/s Qwen3-8B on RX 5700 XT —... |
|
Experimental |
| 6089 |
Shreya831/multimodal-ai-visual-analyzer
Multimodal AI system that detects objects in images and answers questions... |
|
Experimental |
| 6090 |
gatorduck/Creating_Custom_Decoder_Transformer
Custom decoder Transformer that treats a patient's medical journey like a... |
|
Experimental |
| 6091 |
NKU-MetautoAI/awesome-large-vision-language-models
Advances in recent large vision language models (LVLMs) |
|
Experimental |
| 6092 |
fake-it0628/jailbreak-defense
Jailbreak Defense System based on Hidden State Causal Monitoring for LLMs |
|
Experimental |
| 6093 |
fajieyuan/recommendation_transfer_learning_pretraining
Pre-training and Transfer learning papers for recommendation |
|
Experimental |
| 6094 |
vishvaRam/Data-Prep-for-LLM-fine-tuning
This repository helps prepare datasets for fine-tuning Large Language Models... |
|
Experimental |
| 6095 |
rajatady/Inference-Stack
Production-grade LLM inference API built from scratch. NestJS gateway +... |
|
Experimental |
| 6096 |
Gaolingx/llama.cpp-Launcher
run llama.cpp quickly and conveniently. |
|
Experimental |
| 6097 |
YahiaGrdh/vibe-agents
Coordinate AI agents to break down tasks, plan workflows, and delegate... |
|
Experimental |
| 6098 |
PratapShashwat/End-to-End-LLM-Fine-Tuning
Train Gemma to summarize documents. |
|
Experimental |
| 6099 |
Baci-Ak/book-recommender
LLM - Book Recommendation system with LLM |
|
Experimental |
| 6100 |
btboilerplate/sms-spam-classification-transformer
SMS spam classification using a Transformer-based model built with... |
|
Experimental |