All Transformer Models
7,795 models ranked by quality score · Page 40 of 78
| # | Model | Score | Tier |
|---|---|---|---|
| 3901 |
musialski-lab/LayoutEnhancer
Source code for the Paper: Layout Enahancer |
|
Experimental |
| 3902 |
tech-srl/layer_norm_expressivity_role
Code for the paper "On the Expressivity Role of LayerNorm in Transformers'... |
|
Experimental |
| 3903 |
eddyhkchiu/V2V-LLM
[ICRA2026] Official code of the paper "V2V-LLM: Vehicle-to-Vehicle... |
|
Experimental |
| 3904 |
ayinedjimi/ModelBench
Automated LLM Benchmarking on GPU - tokens/sec, latency percentiles, VRAM... |
|
Experimental |
| 3905 |
danilodjor/image-retrieval-using-transformers
This repository contains code used to perform image retrieval using... |
|
Experimental |
| 3906 |
guglielmocamporese/visual-transformer-pytorch
An easy and minimal implementation of the Visual Transformer (ViT) in... |
|
Experimental |
| 3907 |
wesleyscholl/drex
🦀 The transformer is a brilliant hack scaled past its limits. DREX is what... |
|
Experimental |
| 3908 |
senxd/LLM-Interface
A Kotlin Library for interfacing with LLMs. |
|
Experimental |
| 3909 |
wshi83/MedAdapter
[EMNLP'24] MedAdapter: Efficient Test-Time Adaptation of Large Language... |
|
Experimental |
| 3910 |
sergio11/headline_generation_lstm_transformers
Explore advanced neural networks for crafting captivating headlines! Compare... |
|
Experimental |
| 3911 |
bottergpt/PaperCollection
Collection of ML/DL related papers and notes. |
|
Experimental |
| 3912 |
chaoluond/safetyllama
Finetune LLaMA-2-7b-chat to perform safety evaluation of user-bot conversation |
|
Experimental |
| 3913 |
raajmandale/mos-parameter-golf
CRS-LM: Structure-aware context reduction for tiny language models under... |
|
Experimental |
| 3914 |
RDrahul123/LLMs
A free, practical course on LLMs — Prompt Engineering, APIs, RAG, and Fine-Tuning. |
|
Experimental |
| 3915 |
TIGER-AI-Lab/TableCoT
The code and data for paper "Large Language Models are few(1)-shot Table... |
|
Experimental |
| 3916 |
danadascalescu00/ioai-transformer-workshop
A hands-on introduction to Transformer architecture, designed for... |
|
Experimental |
| 3917 |
steelonion/Monkeys-with-Novelwriters
use llm to write novel 使用大模型的小说写作框架 |
|
Experimental |
| 3918 |
heyisula/infosage-13b
LLM pretraining pipeline using the FineWeb-Edu Dataset |
|
Experimental |
| 3919 |
kgw-wilson/llm-routing
Evaluating different embedding spaces on their effectiveness for LLM routing |
|
Experimental |
| 3920 |
jinmang2/Awesome-Papers
:snowflake: All about my interest Papers and Review :) |
|
Experimental |
| 3921 |
jdleo/tinysafe-1
71M parameter safety classifier (DeBERTa-v3-xsmall). Dual-head: binary... |
|
Experimental |
| 3922 |
Korde-AI/Multi-User-LLM-Agent
Official code for the paper: "Multi-User Large Language Model Agents" |
|
Experimental |
| 3923 |
WindJammer6/37.-A-Hallucination-Mitigation-Scheme-in-Security-Policy-Generation-with-Large-Language-Models
Source code for the paper: A Hallucination Mitigation Scheme in Security... |
|
Experimental |
| 3924 |
andomeder/act-mujoco-manipulation
End-to-end implementation of Action Chunking Transformers (ACT) for... |
|
Experimental |
| 3925 |
AYUSH-ISHAN/MultiAgent-Traffic-Control-with-Transformers
Implementation of Universal Multi-Agent Reinforcement Learning via Policy... |
|
Experimental |
| 3926 |
yelabb/PhantomX
On the Limits of Discrete Representations for Neural Control. A systematic... |
|
Experimental |
| 3927 |
staverm/DARPwTransformers
Transformer network capable of cloning a supervision policy on Dial-a-Ride... |
|
Experimental |
| 3928 |
fannie1208/FactTest
[ICML2025] "FactTest: Factuality Testing in Large Language Models with... |
|
Experimental |
| 3929 |
git-disl/Lisa
This is the official code for the paper "Lazy Safety Alignment for Large... |
|
Experimental |
| 3930 |
jiayuww/SpatialEval
[NeurIPS'24] SpatialEval: a benchmark to evaluate spatial reasoning... |
|
Experimental |
| 3931 |
Victorwz/VaLM
VaLM: Visually-augmented Language Modeling. ICLR 2023. |
|
Experimental |
| 3932 |
zhestyatsky/MCL-WiC
Research on Multilingual and Cross-lingual Word-in-Context Disambiguation |
|
Experimental |
| 3933 |
gulabpatel/LLMs
Alpaca, Bloom, DeciLM, Falcon, Vicuna, Llama2, Zephyr, Mistral(MoE), RAG,... |
|
Experimental |
| 3934 |
sahsaeedi/TPO
[TMLR] Triple Preference Optimization |
|
Experimental |
| 3935 |
HamedBabaei/CoLLM
CoLLM: Consistency of Large Language Models in Knowledge Engineering |
|
Experimental |
| 3936 |
Volscente/NexusLLM
NexusLLM is a GitHub repository dedicated to exploring various experiments... |
|
Experimental |
| 3937 |
anoopkdcs/NLPBias
Towards Comprehensive Understanding of Bias in Pre-trained Neural Language... |
|
Experimental |
| 3938 |
NaS-Research/knowledge-model
Our knowledge system systematically ingests, processes, and indexes... |
|
Experimental |
| 3939 |
PKU-YuanGroup/Video-Bench
A Comprehensive Benchmark and Toolkit for Evaluating Video-based Large... |
|
Experimental |
| 3940 |
thefcraft/torch-transformer-hinglish2hindi-translator
torch-transformer-hinglish2hindi-translator is a character-level translater... |
|
Experimental |
| 3941 |
Anne-Andresen/Multi-Modal-cuda-C-GAN
Raw C/cuda implementation of 3d GAN |
|
Experimental |
| 3942 |
onidahabitual85/llm-server
Launch and optimize llama.cpp servers automatically across Linux, macOS, and... |
|
Experimental |
| 3943 |
Ritaprava95/Custom_Entity_Extraction_Spacy3.5
Making a custom entity extraction model using spacy 3.5 using both... |
|
Experimental |
| 3944 |
thansen0/fastllm.cpp
A low latency, fault tolerant API for accessing LLM's written in C++ using llama.cpp. |
|
Experimental |
| 3945 |
joshstephenson/MorphemeSegmentation
This is a survey of morpheme segmentation techniques including 2 baselines... |
|
Experimental |
| 3946 |
AnkitNayak-eth/Llama-AI
Powered by the Llama 3.3 70B API, it delivers advanced, context-aware, and... |
|
Experimental |
| 3947 |
QuantLet/Encode-the-Qode
Towards Code Summarization for Scientific Domain Experts on Scarce Data... |
|
Experimental |
| 3948 |
IvanMao714/Transformers
Huggingface Transformers Tutorial |
|
Experimental |
| 3949 |
IsmaelMousa/playing-with-finetuning
Practice fine-tuning a Pretrained Transformers model from Hugging Face using... |
|
Experimental |
| 3950 |
simply-pouria/The-LMs-Book
My study notes, code implementations, etc. while reading The Hundred-Page... |
|
Experimental |
| 3951 |
Yahnnosh/Exploring-Model-Fusion-with-Optimal-Transport-on-Transformers
Project for the course "Deep Learning" 2022 at ETH Zurich |
|
Experimental |
| 3952 |
shyamcody/nlp-experiments
I will try small experiments on older state of the art models like bart, t5... |
|
Experimental |
| 3953 |
akash13singh/resilient_nlp
MockingBERT: Making Transformer Models Resilient to Adversarial Misspellings |
|
Experimental |
| 3954 |
Hexastack/hexabot-helper-ollama
The Ollama Helper Extension for Hexabot Chatbot / Agent Builder to enable... |
|
Experimental |
| 3955 |
mims-harvard/TimeX
Time series explainability via self-supervised model behavior consistency |
|
Experimental |
| 3956 |
GiovanniIacuzzo/Classification-instruments
Automatic classification of musical instruments from audio spectrograms... |
|
Experimental |
| 3957 |
AMDonati/SMC-T-v2
Code for the paper "The Monte Carlo Transformer: a stochastic self-attention... |
|
Experimental |
| 3958 |
Vadimbuildercxx/NumpyGPT
A lightweight educational implementation of GPT (Generative Pre-trained... |
|
Experimental |
| 3959 |
liuqidong07/Awesome-LLM-Enhanced-Recommender-Systems
[KDD'25] Large Language Model Enhanced Recommender Systems: Methods,... |
|
Experimental |
| 3960 |
nikisetti01/MTL-LORA-for-PubMedQA-and-Riddle
🚀 Fine-tuning LLaMA 1B for a medical chatbot using LoRA and a custom... |
|
Experimental |
| 3961 |
munnabhaiiii981/llm-attention-visualizer
🔍 Visualize attention patterns in transformer models to better understand... |
|
Experimental |
| 3962 |
TeamxUndefined/peer_hire_hackhazards_25
PeerHire solves the problem of trust and transparency in freelance... |
|
Experimental |
| 3963 |
Riccorl/transformers-ner
Simple NER model, showcasing Transformer Embedder library. |
|
Experimental |
| 3964 |
igorbenav/practical-language-models
An open book that teaches language models starting from the learning problem... |
|
Experimental |
| 3965 |
sugarandgugu/Simple-Trl-Training
基于DPO算法微调语言大模型,简单好上手。 |
|
Experimental |
| 3966 |
GAIR-NLP/scaleeval
Scalable Meta-Evaluation of LLMs as Evaluators |
|
Experimental |
| 3967 |
xamry/llm-lab
Working sample implementations of several use cases involving Large Language Models. |
|
Experimental |
| 3968 |
j341nono/llemb
Unified embedding extraction for decoder-only LLMs with support for pooling... |
|
Experimental |
| 3969 |
chizkidd/microGPT
Minimal char-level GPT inspired by @karpathy's microGPT: multi-dataset... |
|
Experimental |
| 3970 |
codegram/calbert
Catalan ALBERT (A Lite BERT for self-supervised learning of language representations) |
|
Experimental |
| 3971 |
rohanmistry231/NLP-Interview-Preparation
A targeted resource for mastering NLP, featuring practice problems, code... |
|
Experimental |
| 3972 |
stefanpietrusky/FACTS
Repository for the article in the online magazine Data Science Collective. |
|
Experimental |
| 3973 |
declare-lab/della
DELLA-Merging: Reducing Interference in Model Merging through... |
|
Experimental |
| 3974 |
Md-Emon-Hasan/Fine-Tuning
End-to-end fine-tuning of Hugging Face models using LoRA, QLoRA,... |
|
Experimental |
| 3975 |
SertraFurr/Discord-AI-Bot
A simple discord AI chatbot using my own package! |
|
Experimental |
| 3976 |
rafaelvp-db/hf-finetune
Fine tuning a GPT model using the Persuasion for Good dataset. |
|
Experimental |
| 3977 |
Brokttv/Transformer-from-scratch
elaborate transformer implementation + detailed explanation |
|
Experimental |
| 3978 |
eftekhar-hossain/CUET_NLP-EACL_2021
This repository contains the system description and the codes that we... |
|
Experimental |
| 3979 |
Argo-Robot/foundation_models
Overview about state-of-art imitation learning techniques for robotic... |
|
Experimental |
| 3980 |
Junwu0615/RAG-With-LangChain-And-FAISS
用 LangChain + FAISS 實作 RAG ( Gemini / ChatGPT / Breeze / LLama / Vector DB ) |
|
Experimental |
| 3981 |
m3hrdadfi/wiki-summary
A Bert2Bert model which able to summarize articles! |
|
Experimental |
| 3982 |
dragonnomada/ipn-cic-diplomado-ia-2025
Diplomado en Inteligencia Artificial del CIC / IPN |
|
Experimental |
| 3983 |
paxnea/LLM-multimodal-nudging
Zero-Shot Learning for Multimodal Nudging |
|
Experimental |
| 3984 |
caktus/llm-learning
A collection of notebooks and resources for learning about Large Language... |
|
Experimental |
| 3985 |
ashimmortallp/mHC-manifold-constrained-hyper-connections
🔍 Explore mHC for manifold-constrained hyper-connections in PyTorch,... |
|
Experimental |
| 3986 |
vishaln15/roco-image-captioning
Enhanced Image Captioning on ROCO Multimodal dataset using step-by-step distillation |
|
Experimental |
| 3987 |
chagmgang/dinov2-remote-sensing
Implementation dino v2 for remote sensing with huggingface transformers |
|
Experimental |
| 3988 |
viktor-shcherb/llm-tool-call-sft
LoRA fine-tuning pipeline for tool-calling chat LLMs with config-driven... |
|
Experimental |
| 3989 |
SpiritsYouthHarmony/awesome-llm-physics-benchmarks
A curated list of benchmarks for evaluating LLMs on physics reasoning and... |
|
Experimental |
| 3990 |
ethicalabs-ai/FlowerTune-Qwen2.5-Coder-0.5B-Instruct
FlowerTune LLM on Coding Dataset |
|
Experimental |
| 3991 |
Hexastack/hexabot-cli
CLI for Hexabot to create projects and run them. |
|
Experimental |
| 3992 |
8asic/mlpc2025-sound-event-detection
Competition-winning SED (Sound Event Detection) system that identifies audio... |
|
Experimental |
| 3993 |
joisino/zeh
Code for "Even GPT-5.2 Can’t Count to Five: The Case for Zero-Error Horizons... |
|
Experimental |
| 3994 |
ariannamethod/yent.yo
diffusion AI with a bad character |
|
Experimental |
| 3995 |
sanjaydeploys/Netai-Social
Netai-Social is a social media application built with Flask, React, and... |
|
Experimental |
| 3996 |
shikhartuli/cnn_txf_bias
[CogSci'21] Study of human inductive biases in CNNs and Transformers. |
|
Experimental |
| 3997 |
sofieditmer/depression_detection
This repository contains the contents of a Master's degree in Cognitive... |
|
Experimental |
| 3998 |
tbohne/saliency_kd
Saliency map-guided knowledge discovery for subclass identification with... |
|
Experimental |
| 3999 |
koudounasalkis/UnSLU-BENCH
This repo contains the code for <<"Alexa, can you forget me?” Machine... |
|
Experimental |
| 4000 |
TonmoyTalukder/Rank-Your-Summaries-Enhancing-Bengali-Text-Summarization-via-Ranking-based-Approach
Enhancinng Bengali Text Summarization via Ranking based Approach |
|
Experimental |