All Transformer Models
7,795 models ranked by quality score · Page 43 of 78
| # | Model | Score | Tier |
|---|---|---|---|
| 4201 |
NJUxlj/llm-hub
Popular Large Language Model's modeling file and finetune+pretrain scripts,... |
|
Experimental |
| 4202 |
OMI-KALIX/Multi-Agent-AI-Workflow-for-Content-Creation
A fully automated multi-agent AI system that creates LinkedIn content end to... |
|
Experimental |
| 4203 |
pkdubey/content_moderation
An AI-powered content moderation system using Python and Hugging Face... |
|
Experimental |
| 4204 |
rodneylab/local-ai-llm-playground
Experiments running offline LLMs in Python and Rust locally using Ollama and... |
|
Experimental |
| 4205 |
fattorib/Little-GPT
GPT* - Training faster small transformers using ALiBi, Parallel Residual... |
|
Experimental |
| 4206 |
RichardHam-co-uk/ProjectLodestar
AI development environment with 90% cost savings. Routes between 8 LLM... |
|
Experimental |
| 4207 |
Vincentiv/BERT_Finetuning_from_scratch
Notebook on finetuning BERT |
|
Experimental |
| 4208 |
SAP-samples/acl2025-contrastive-perplexity
This reposity contains the source code of the ACL'25 paper "Contrastive... |
|
Experimental |
| 4209 |
wklee610/VLM-Model-fastapi
A reusable FastAPI module for serving and integrating Vision-Language Models (VLM) |
|
Experimental |
| 4210 |
amazon-science/TSFM-Compression
Official Implementation of Understanding Transformers for Time Series: Rank... |
|
Experimental |
| 4211 |
DanMeon/xlstruct
LLM-powered Excel parser — define a Pydantic schema, get structured data... |
|
Experimental |
| 4212 |
theanasuddin/Advanced-Deep-Learning
Computer exercises for Advanced Deep Learning. Includes implementations of... |
|
Experimental |
| 4213 |
RahulSChand/gpt2_squad
GPT2 training on squad dataset |
|
Experimental |
| 4214 |
FuxiaoLiu/DocumentCLIP
[ICPRAI 2024] DocumentCLIP: Linking Figures and Main Body Text in Reflowed Documents |
|
Experimental |
| 4215 |
ihuzaifashoukat/ml-mastery-path
Advanced Machine Learning and LLM training implementations. A comprehensive... |
|
Experimental |
| 4216 |
awsaf49/detect-fake-text
LLM - Detect AI Generated Text || Identify which essay was written by a... |
|
Experimental |
| 4217 |
wesleyscholl/squish
🤖🗜️⚡️ Compress local LLMs once, run them forever at sub-second load times.... |
|
Experimental |
| 4218 |
uakarsh/TiLT-Implementation
Implementation of the paper: Going Full-TILT Boogie on Document... |
|
Experimental |
| 4219 |
mytechnotalent/mechanistic_interpretability
Mechanistic Interpretability (MI) is a subfield of AI alignment and safety... |
|
Experimental |
| 4220 |
BlackRoad-AI/blackroad-llm-fine-tuner
ulackroad llm fine tuner — Part of the BlackRoad OS ecosystem. Sovereign... |
|
Experimental |
| 4221 |
NAME0x0/OMNI
PERSPECTIVE v2 — A 1.05 trillion parameter sparse Mixture-of-Experts... |
|
Experimental |
| 4222 |
qora-protocol/QORA-LLM-3B
Pure Rust inference engine for the SmolLM3-3B language model. No Python... |
|
Experimental |
| 4223 |
kyegomez/TinyGPTV
Simple Implementation of TinyGPTV in super simple Zeta lego blocks |
|
Experimental |
| 4224 |
gurpejsingh13/punjabi-gpt-scratch-20m
Developed and pre-trained a 20.39M-parameter Punjabi GPT-style base model... |
|
Experimental |
| 4225 |
devbm7/QGen
Question Generator System |
|
Experimental |
| 4226 |
symanto-research/merge-tokenizers
Package to align tokens from different tokenizations. |
|
Experimental |
| 4227 |
HySonLab/Protein_Pretrain
Multimodal Pretraining for Unsupervised Protein Representation Learning |
|
Experimental |
| 4228 |
AswaniSahoo/llama-task-agent
Fine-tuned LLaMA-3.1-8B task agent with LoRA for reliable tool execution |
|
Experimental |
| 4229 |
GJ98/Megatron-LM
Megatron-LM implemented by PyTorch |
|
Experimental |
| 4230 |
kunjmehta/cross-modal-retrieval-food-ai
Course project for 198:536 at Rutgers University. The project is about... |
|
Experimental |
| 4231 |
hazdzz/converter
The official PyTorch implementation of Converter. |
|
Experimental |
| 4232 |
abhilashpuli98/Deep-Learning-Paper-Implementations
A collection of paper implementations using the PyTorch framework |
|
Experimental |
| 4233 |
mgokulkrish/LENR.ai
Github Repo for Recommendation System using LLMs. |
|
Experimental |
| 4234 |
mrtrizer/UnityLlamaCpp
Llama.cpp in Unity, straightforward and clean |
|
Experimental |
| 4235 |
liyaooi/TAMO
TAMO: reimagine Table representation as an independent Modality for LLMs |
|
Experimental |
| 4236 |
jawline/Synthic
Automatically generate gameboy music using machine learning |
|
Experimental |
| 4237 |
sairam-s0/local_ai_automation
This project automates question solving using AI and OCR. Instead of... |
|
Experimental |
| 4238 |
amajji/LLM-Quantization-Techniques-Absmax-Zeropoint-GPTQ-GGUF
LLM quantization techniques: absmax, zero-point, GPTQ and GGUF |
|
Experimental |
| 4239 |
NamelyCorp/NamelyCorp-LLM-Studio
Local-first LoRA fine-tuning studio with web UI for document-grounded LLM training. |
|
Experimental |
| 4240 |
BlackRoad-OS/Modelfile
BlackRoad OS Ollama model definitions and custom models |
|
Experimental |
| 4241 |
PRITHIVSAKTHIUR/Molmo2-HF-Demo
A Gradio-based demonstration for the AllenAI Molmo2-8B multimodal model,... |
|
Experimental |
| 4242 |
Nilanshrajput/Intent_classification
Intent Classification with Hugging Face, Mlfow experiment tracking,... |
|
Experimental |
| 4243 |
wanglne/DELMAN
[ACL 2025 Findings] DELMAN: Dynamic Defense Against Large Language Model... |
|
Experimental |
| 4244 |
agentdr1/LA_MIL
Implementation of LA_MIL, Local Attention Graph-based Transformer for WSIs, PyTorch |
|
Experimental |
| 4245 |
DNGros/lmwrapper_OLD
An object-oriented wrapper around language models. Moved to... |
|
Experimental |
| 4246 |
D0men1c0/Benchmark-Gemma-Models
Highly customizable Python suite for LLM evaluation (Gemma, LLaMA+). Full... |
|
Experimental |
| 4247 |
parham1998/Enhancing-High-Vocabulary-IA-with-a-Novel-Attention-Based-Pooling
Official Pytorch Implementation of: "Enhancing High-Vocabulary Image... |
|
Experimental |
| 4248 |
IAAR-Shanghai/FastMem
Fast Memorization of Prompt Improves Context Awareness of Large Language... |
|
Experimental |
| 4249 |
hjshah142/BERT-Fine-Tuning-Software-Requirements-Classification
Fine-tuning a pre-trained model using the Transformers library (Bert) on... |
|
Experimental |
| 4250 |
ccs96307/fast-llm-inference
Accelerating LLM inference with techniques like speculative decoding,... |
|
Experimental |
| 4251 |
mohamedshameem-dev/Review_Classification_Engine
Batch-optimized LLM-based automated customer review classification and... |
|
Experimental |
| 4252 |
papachristoumarios/llm-network-formation
Supplementary Code and Data for "Network Formation and Dynamics among Multi-LLMs" |
|
Experimental |
| 4253 |
pecharesjoselito/chuck.optimizer
Optimize neural network training by monitoring loss, gradients, and... |
|
Experimental |
| 4254 |
llap4585/T5-Refiner-DomainFocus-TrainOnly
This project provides code for fine-tuning T5/mT5 models on data... |
|
Experimental |
| 4255 |
aimagelab/JARVIS
Seeing Beyond Words: Self-Supervised Visual Learning for Multimodal Large... |
|
Experimental |
| 4256 |
jrajath94/jax-transformer-impl
JAX/XLA Transformer with MHA, MQA, GQA (Ainslie et al. 2023) — JIT, vmap, pmap |
|
Experimental |
| 4257 |
aliuyar1234/proberoute
Research code for ProbeRoute, a probe-initialized sparse routing method for... |
|
Experimental |
| 4258 |
aswinvinodd/emotion-detection-system
AI-based Emotion Detection and Sentiment Analysis System using NLP and Streamlit |
|
Experimental |
| 4259 |
timteh/timteh-forge
⚡ TIMTEH Model Forge — Uncensored, abliterated & reasoning-distilled GGUFs.... |
|
Experimental |
| 4260 |
mamounyosef/commit-message-llm
Fine-tuning Qwen2.5-Coder-0.5B LLM using QLoRA (4-bit quantization + LoRA)... |
|
Experimental |
| 4261 |
JacobJ215/Sentiment-Analysis-with-DistilBERT
Here we leverage a subset of the amazon_polarity dataset to train two... |
|
Experimental |
| 4262 |
mcbal/afem
Implementation of approximate free-energy minimization in PyTorch |
|
Experimental |
| 4263 |
machinelearningzuu/LLM-in-Production
Welcome to the "LLM in Production" repository! This project aims to provide... |
|
Experimental |
| 4264 |
SharathHebbar/sft_mathgpt2
Supervised Fine tuning using TRL library |
|
Experimental |
| 4265 |
utkukose/llm_persona_hallucination_study
Code for the study on persona vectors in controling / understanding... |
|
Experimental |
| 4266 |
Trustworthy-ML-Lab/Efficient-LLM-automated-interpretability
[NeurIPS'23 ATTRIB] An efficient framework to generate neuron explanations for LLMs |
|
Experimental |
| 4267 |
Mukuta-Manit-D/AI-Mirror
AI Mirror is a smart, interactive web application that detects human... |
|
Experimental |
| 4268 |
0606zt/PanoLlama
[ICCV 2025 Highlight] Panorama Generation as a Next-Token Prediction Task. |
|
Experimental |
| 4269 |
or4k2l/enhanced-audio-anomaly-detection
Hybrid ensemble (AST + classical) for industrial anomaly detection. Pump:... |
|
Experimental |
| 4270 |
ArshockAbedan/Natural-Language-Processing-with-Attention-Models
Attention Models in NLP |
|
Experimental |
| 4271 |
EN10/BabyLlama
Train and run a small Llama 2 model from scratch on the TinyStories dataset. |
|
Experimental |
| 4272 |
dunktra/attention-binding-a11y
Code for tracking concept emergence via attention-head binding (EB*). Pythia... |
|
Experimental |
| 4273 |
mbeps/qwen3-italic-benchmark
Benchmarking Qwen3 models f various sizes on the ITALIC benchmark to evluate... |
|
Experimental |
| 4274 |
termehtaheri/SAR-LM
Official implementation of “SAR-LM: Symbolic Audio Reasoning with Large... |
|
Experimental |
| 4275 |
anyantudre/NLP-Course-Hugging-Face
This course will teach you about Natural Language Processing (NLP) using... |
|
Experimental |
| 4276 |
Vext-Labs-Inc/vext-pentest-7b
Open-source 7B language model for autonomous penetration testing — parses... |
|
Experimental |
| 4277 |
xHarshit/Self-Healing-Classification-DAG-with-Fine-Tuned-Model
A self-healing text classification pipeline built with LangGraph and a... |
|
Experimental |
| 4278 |
HishamAlyahya/PyLLM
Leverage Large Language Models to generate and execute code dynamically... |
|
Experimental |
| 4279 |
Keyvanhardani/kvcache-autotune
Automatic KV-Cache optimization for HuggingFace Transformers. Find the... |
|
Experimental |
| 4280 |
LinukPerera/Physics-Constrained-Transformer-for-Cyclone-Trajectory-and-Damage-Prediction
This framework fuses satellite imagery, atmospheric data, and terrain... |
|
Experimental |
| 4281 |
kyegomez/open_qwen
A non-official implementation of Qwen 3.5, as there doesn’t seem to be a... |
|
Experimental |
| 4282 |
hereandnowai/transformers-simplified
Simplified, standalone Python scripts for transformer models, LLMs, TTS,... |
|
Experimental |
| 4283 |
rohanmistry231/Transformers-Hugging-Face-Interview-Preparation
A curated resource for mastering Transformers and Hugging Face libraries,... |
|
Experimental |
| 4284 |
M4T1SS3/DeltaLoop
Continuous fine-tuning layer that converts AI agent logs into LoRA adapters. |
|
Experimental |
| 4285 |
deepagency/llm-resource-planner
A simple CLI tool to fetch Hugging Face model metadata and estimate required... |
|
Experimental |
| 4286 |
omerfarooq223/AutoGrader-Agent
AI agent that grades student assignments from a ZIP file using LLMs —... |
|
Experimental |
| 4287 |
tegridydev/mechamap
MechaMap - Toolkit for Mechanistic Interpretability (MI) Research |
|
Experimental |
| 4288 |
orionw/MTLvsIFT
Code for the paper "When to Use Multi-Task Learning vs Intermediate... |
|
Experimental |
| 4289 |
nphdang/Pred-LLM
Generating tabular data via Large Language Models (LLMs) |
|
Experimental |
| 4290 |
himanshu231204/hk-devbrain
HK-DevBrain is a lightweight AI developer assistant built on Llama 3.2 (3B)... |
|
Experimental |
| 4291 |
rmovva/LLM-publication-patterns-public
[NAACL 2024] Topics, Authors, and Institutions in Large Language Model... |
|
Experimental |
| 4292 |
idiap/HMMGradients.jl
Enables computing the gradient of the parameters of Hidden Markov Models (HMMs) |
|
Experimental |
| 4293 |
HectorPulido/discord-bot-LLama
It's a chatbot made with Python that simulates natural conversation with... |
|
Experimental |
| 4294 |
AntonioVFranco/elamonica
Production-ready test-time compute optimization framework for LLM inference.... |
|
Experimental |
| 4295 |
avijit-thawani/Augmented-LMs
Living Survey of Augmented LMs |
|
Experimental |
| 4296 |
svn05/vietnamese-nmt
Vietnamese-English-Japanese NMT with fine-tuned NLLB-200, beam search, and... |
|
Experimental |
| 4297 |
zalkklop/LVSM
Official code for "LVSM: A Large View Synthesis Model with Minimal 3D... |
|
Experimental |
| 4298 |
SolomonB14D3/intelligent-svd
Knowledge-preserving SVD compression for large language models via... |
|
Experimental |
| 4299 |
pinkbanty5707/GEO-AI-Woo
Optimize WooCommerce sites for AI search engines by generating llms.txt,... |
|
Experimental |
| 4300 |
reyrove/Sparrow-Hawk-CodeArtGenerator
A sassy, neon-drenched AI copilot for chaotic creators—built with Groq +... |
|
Experimental |