All Transformer Models
7,795 models ranked by quality score · Page 62 of 78
| # | Model | Score | Tier |
|---|---|---|---|
| 6101 |
AmitAK1/Neural-Vulnerability-Scanner
CodeBERT + LoRA fine-tuning for C/C++ vulnerability detection | F1 = 74.3% |... |
|
Experimental |
| 6102 |
AddictivelyRecursive/lightweight-multimodal-transformer-pipeline
Lightweight multimodal transformer pipeline comparing MobileViT and... |
|
Experimental |
| 6103 |
mtichikawa/llm-bias-detection
Research project detecting and quantifying demographic bias in language models |
|
Experimental |
| 6104 |
igna-s/QLoRA-Experiments
A collection of SFT and distillation pipelines to train specialized medical... |
|
Experimental |
| 6105 |
adityonugrohoid/vllm-explorer
Probes and catalogs the full vLLM server API — endpoint reference, model... |
|
Experimental |
| 6106 |
DzmitryPihulski/Encoder-transformer-from-scratch
Fully functional encoder transformer from tokenizer to lm-head |
|
Experimental |
| 6107 |
anviit/llm-inference-serving
Production LLM inference stack — 28ms TTFT, 39 tok/s, 81% cache hit rate on a 6GB GPU |
|
Experimental |
| 6108 |
benjaminr/gendantic
Generate synthetic data using Pydantic Models and LLMs |
|
Experimental |
| 6109 |
Hidde-Heijnen/audioGPT
Cambridge Master Thesis: Audio Interpretable Transformers |
|
Experimental |
| 6110 |
abdulrahimzulfiqar/QueryMate
A portable, offline-first CLI AI assistant built with C++ and llama.cpp.... |
|
Experimental |
| 6111 |
philippnormann/malicious-payload-detection
🕵️♂️ ML project to identify malicious web payloads, aimed at boosting the... |
|
Experimental |
| 6112 |
taishan1994/Classification-Factory
以极简的代码实现文本分类,包括二分类、多分类、多标签分类。 |
|
Experimental |
| 6113 |
HubertKasperek/ai-companion-py
Python bindings for ai-companion (only backend, without WebUI) |
|
Experimental |
| 6114 |
andreaps96/prompt-injection-classifier
Fine-tuned mDeBERTa-v3 binary classifier for prompt injection detection on... |
|
Experimental |
| 6115 |
ztjhz/miniLM
Small Model Is All You Need - NTU SC4001 Neural Network & Deep Learning Project |
|
Experimental |
| 6116 |
jshwanth/image-captioning
Error-centric comparison of CNN-LSTM, attention-based, and transformer... |
|
Experimental |
| 6117 |
lciric/gptq-from-scratch
GPTQ post-training quantization from scratch — GPT-2, OPT, LLaMA support |
|
Experimental |
| 6118 |
EM7m4/Distill-R1
Combine reinforcement learning with online teacher-student distillation to... |
|
Experimental |
| 6119 |
RManLuo/llm-facteval
Source code of paper "Systematic Assessment of Factual Knowledge in Large... |
|
Experimental |
| 6120 |
rajkundalia/error-analyzer-with-baml
Analyze Java compilation and runtime errors using BAML with a local Ollama... |
|
Experimental |
| 6121 |
Gholamrezadar/finetuning_llm_on_letter_counting
Fine-tuning Gemma-3 4B on the letter-counting dataset |
|
Experimental |
| 6122 |
ghostperpper007/small_programming_model
A from-scratch Python code model with GNN-based structure encoding,... |
|
Experimental |
| 6123 |
AmirK-S/llm-to-production-python
Formation complete LangChain/LangGraph en Python - du LLM basique aux agents avances |
|
Experimental |
| 6124 |
soy-tuber/localllama-insights
Technical insights from r/LocalLLaMA — vLLM, FP8, NVFP4, Blackwell GPU... |
|
Experimental |
| 6125 |
egeozsoy/ORacle
Official code of the paper ORacle: Large Vision-Language Models for... |
|
Experimental |
| 6126 |
shrutikakapade/Designing-Structured-Outputs-for-LLMs-TypedDict-Pydantic-Output-Parsers-with-LangChain
Design robust structured outputs for LLM applications. Learn how to enforce... |
|
Experimental |
| 6127 |
anna-kay/Reddit-summarization
Abstractive summarization of Reddit datasets with Transformers. |
|
Experimental |
| 6128 |
bywords/directed_sentiment_analysis
Dataset and code for directed sentiment analysis in news text. |
|
Experimental |
| 6129 |
TJ-Solergibert/transformers-in-supercomputers
Transformers training in a supercomputer with the 🤗 Stack and Slurm |
|
Experimental |
| 6130 |
nv78/Benchmarking-Question-Answering
Research Code for Benchmarking Question Answering Models Paper |
|
Experimental |
| 6131 |
sacredvoid/alignrl
LLM post-training playbook: SFT, GRPO, DPO, eval, and inference. pip install alignrl |
|
Experimental |
| 6132 |
zpqiu/rl-infra-notes
Personal notes about RL Infra/Algorithm and LLMs |
|
Experimental |
| 6133 |
liziniu/cold_start_rl
Code for Blog Post: Can Better Cold-Start Strategies Improve RL Training for LLMs? |
|
Experimental |
| 6134 |
its-not-rocket-science/mnemosyne
An autonomous, distributed knowledge discovery agent combining LLMs and... |
|
Experimental |
| 6135 |
martainntwiglike586/tinyforge
Train a 0.8B parameter model to improve itself on any task with tests, using... |
|
Experimental |
| 6136 |
ikun-llm/ikun-Distill
知识蒸馏 | Knowledge Distillation from teacher model 🎓 |
|
Experimental |
| 6137 |
huysuy05/LLM-Bias-Benchmark
An analysis of different LLMs' performance on balanced, imbalanced data with... |
|
Experimental |
| 6138 |
iakashpaul/Portal
Android app for the Hole in your Palm project, making LLMs accessible on-device! |
|
Experimental |
| 6139 |
YounesBensafia/Algeria-2-0-FineTuning-workshop
This repository contains resources and examples used in my workshop for... |
|
Experimental |
| 6140 |
Pects1949/LLM-Fine-tuning-Toolkit
A comprehensive toolkit for fine-tuning and deploying Large Language Models... |
|
Experimental |
| 6141 |
Witurpred64/LLM-FineTuning-Toolkit
A comprehensive toolkit for fine-tuning Large Language Models (LLMs) with... |
|
Experimental |
| 6142 |
buhsnn/eli5-gpt2-language-model
Decoder-only Transformer (GPT-2 style) trained from scratch on the ELI5... |
|
Experimental |
| 6143 |
ZZZ150751/cs336_spring2025_assignment1
Implementation of a Decoder-only Transformer language model from scratch for... |
|
Experimental |
| 6144 |
di37/full-fine-tuning-nvidia-question-and-answering
Flan-t5-base model was fine-tuned on Nvidia Question and Answer Pair Dataset... |
|
Experimental |
| 6145 |
miga1999/AirClaw
Run OpenClaw locally on any GPU or CPU without API costs, supporting large... |
|
Experimental |
| 6146 |
Atsusheeesh/vllm-daily
📊 Summarize merged PRs daily with vLLM, ensuring you stay updated on key... |
|
Experimental |
| 6147 |
YUECHE77/EE641-Final-Project
Deprecated |
|
Experimental |
| 6148 |
botosadam/matryoshka
🚀 Build Ruby gems that utilize Rust for enhanced performance through two... |
|
Experimental |
| 6149 |
gayathrikumari/Enterprise-Grade-LLM-Optimization-Alignment-PEFT-RLHF-
End-to-end LLM optimization pipeline covering PEFT (LoRA), RLHF (PPO),... |
|
Experimental |
| 6150 |
eren23/blipren_release
BLIP-2 implementation for training vision-language models. Q-Former + frozen... |
|
Experimental |
| 6151 |
dakshjain-1616/nemotron3-super-vs-gpt5.4-nano
Head-to-head benchmark comparing Nemotron and GPT-5.4-nano on code generation tasks |
|
Experimental |
| 6152 |
ScalingOpt/SGG
[ACL 2025 Main] Taming LLMs by Scaling Learning Rates with Gradient Grouping |
|
Experimental |
| 6153 |
Pyrolignic-paydirt84/pse-vcipher-collapse
Accelerate LLM inference by collapsing attention paths with... |
|
Experimental |
| 6154 |
serhiismetanskyi/llm-output-evaluation-with-deepeval
DeepEval LLM quality evaluation tests with LLM-as-a-judge |
|
Experimental |
| 6155 |
yilin-geng/llm-instruction-conflicts
This repository contains the data and the code for the paper "Control... |
|
Experimental |
| 6156 |
FawwazAhmd/msc-group-project
MSc group project evaluating instruction-tuned LLMs for legal clause... |
|
Experimental |
| 6157 |
Prajit-Rahul/Transformer-Based-Multi-Modal-Stock-Market-Prediction
Multi-modal stock forecasting with transformers, sentiment signals, and... |
|
Experimental |
| 6158 |
Thableaddles57/neural-nexus
A modular framework for distributed LLM training and fine-tuning across... |
|
Experimental |
| 6159 |
mpuodziukas-labs/llm-cobol-benchmark
Systematic benchmark: top LLMs produce broken COBOL. 5 programs, 3 models,... |
|
Experimental |
| 6160 |
SolomonB14D3/confidence-cartography
Teacher-forced confidence as a false-belief sensor for language models. |
|
Experimental |
| 6161 |
webml-community/TranslateGemma-WebGPU
TranslateGemma is a family of translation models from Google, built on top... |
|
Experimental |
| 6162 |
pranavgoel29/comparinator
Browser-local image region comparator using Transformers.js + pixel... |
|
Experimental |
| 6163 |
aims-foundations/reeval
Reliable and Efficient Model-based Generative Model Evaluation |
|
Experimental |
| 6164 |
rick97julho/do-i-have-the-vram
🔍 Estimate your VRAM needs for Hugging Face models in seconds without... |
|
Experimental |
| 6165 |
Almas-ansari/Transliteration-Hindi-To-English
A neural machine transliteration system that converts words written in... |
|
Experimental |
| 6166 |
chizkidd/bert-masked-attention-visualizer
Visualizing and analyzing BERT self-attention heads during masked language modeling. |
|
Experimental |
| 6167 |
Isha1600/LLM-Finetuning
Fine-tuning Large Language Models (LLMs) using custom datasets for improved... |
|
Experimental |
| 6168 |
driellecristine/BERT-Contrastive-LoRA
Enhance BERT fine-tuning for intent classification using supervised... |
|
Experimental |
| 6169 |
codershiyar/llama-google-colab-tutorial
Step-by-step tutorial on loading and using Llama 3.1 8B Instruct in Google... |
|
Experimental |
| 6170 |
Prelf1992/LLM-Orchestration-Framework
A robust framework for building, deploying, and managing large language... |
|
Experimental |
| 6171 |
Josh396s/BERT-Contrastive-LoRA
Optimizing BERT for intent classification on the Amazon Massive dataset... |
|
Experimental |
| 6172 |
Yahy5715/jailbreak-defense
Detect and prevent large language model jailbreaks using hidden state causal... |
|
Experimental |
| 6173 |
derrickrajkumar10/GameOfThrones_AI_Analayser
AI/NLP dashboard that analyses Game of Thrones episodes and predicts... |
|
Experimental |
| 6174 |
Allen0307/AdapterBias
Code for the Findings of NAACL 2022(Long Paper): AdapterBias:... |
|
Experimental |
| 6175 |
tk-yasuno/feature_tsfm_hybrid_gbdt
HVAC Range Deviation Forecast - v2.0 (Hybrid Model) Granite TS Embeddings +... |
|
Experimental |
| 6176 |
samx77710/brrr
🌬️ Streamline your coding workflow with brrr, a tool that enhances... |
|
Experimental |
| 6177 |
GhostGuyyt/llm-cpp
Provide lightweight, zero-dependency C++17 single-header libraries to... |
|
Experimental |
| 6178 |
aakarsh31/qlora-llm-finetuning
QLoRA fine-tuning of Llama 3.2 3B on MedQA with full LoRA rank ablation... |
|
Experimental |
| 6179 |
ajheshbasnet/model-reincarnated
a collection of re-implementations of renowned artificial intelligence... |
|
Experimental |
| 6180 |
macespinoza/mini-transformer-didactico
Implementación didáctica de un Transformer Encoder–Decoder basada en... |
|
Experimental |
| 6181 |
mohitdixit02/webpage-chatbot
A Chrome extension chatbot that answers user queries based on the content of... |
|
Experimental |
| 6182 |
anastber/youtube-sentiment-analysis
AI-powered YouTube comment sentiment analyzer built with Streamlit. ... |
|
Experimental |
| 6183 |
cnygaard/FractalHTransformer
Fractal Hierarchical Transformer: multi-resolution causal attention patterns... |
|
Experimental |
| 6184 |
lxntung95/lyricloop-llm
LyricLoop bridges the gap between semantic LLM text and professional musical... |
|
Experimental |
| 6185 |
olafurjohannsson/models
Transformer Models |
|
Experimental |
| 6186 |
ahmad-albasha/Frankenstein-LLM-Model-fine-tuning-code
Fine-tuning Mistral-7B-v0.1 on Mary Shelley's Frankenstein using LoRA/QLoRA... |
|
Experimental |
| 6187 |
trivox-io/llmx
Unified Python interface for working with multiple LLM providers across... |
|
Experimental |
| 6188 |
VicNa559/code-offline
Run a local coding agent with llama.cpp for private, containerized... |
|
Experimental |
| 6189 |
dustalov/llmfao
Large Language Model Feedback Analysis and Optimization (LLMFAO) |
|
Experimental |
| 6190 |
CogComp/SRL-Spanish
Semantic Role Labeling (SRL) for Spanish using transformers |
|
Experimental |
| 6191 |
rawatshaurya/LORA-vs-QLORA
Reasoning-style fine-tuning of an instruction LLM using LoRA vs QLoRA,... |
|
Experimental |
| 6192 |
bassrehab/steering-vectors-agents
Runtime control of LLM agent behaviors through activation steering vectors.... |
|
Experimental |
| 6193 |
Shreya831/AI-Medical-Symptom-Checker-SLM
AI-powered medical symptom checker using DistilBERT (Small Language Model)... |
|
Experimental |
| 6194 |
MukundaKatta/grammarprobe
GrammarProbe — Universal Grammar Detector. Test whether LLMs have... |
|
Experimental |
| 6195 |
luozichen/NeonBench
A systematic study of ultra-tiny language models |
|
Experimental |
| 6196 |
nachammai779/Large-Language-Models-from-Scratch
🧠 Building Large Language Models from the ground up — BPE tokenization,... |
|
Experimental |
| 6197 |
jinda-liu/R-LoRA
This repository contains the source code and related resources for R-LoRA. |
|
Experimental |
| 6198 |
arafat2020/cut_py
An R&D project to cut the best prat from a video using AI and ffmpeg. |
|
Experimental |
| 6199 |
Harsha-hue/visual-transformer-guide
I built a visual guide explaining how Transformers work. Tokenization... |
|
Experimental |
| 6200 |
dt-3t/TeGu
Official implementation of "Temporal Guidance for Large Language Models". A... |
|
Experimental |