All Transformer Models

7,795 models ranked by quality score · Page 40 of 78

Showing 3901–4000 of 7,795
# Model Score Tier
3901 musialski-lab/LayoutEnhancer

Source code for the Paper: Layout Enahancer

23
Experimental
3902 tech-srl/layer_norm_expressivity_role

Code for the paper "On the Expressivity Role of LayerNorm in Transformers'...

23
Experimental
3903 eddyhkchiu/V2V-LLM

[ICRA2026] Official code of the paper "V2V-LLM: Vehicle-to-Vehicle...

23
Experimental
3904 ayinedjimi/ModelBench

Automated LLM Benchmarking on GPU - tokens/sec, latency percentiles, VRAM...

23
Experimental
3905 danilodjor/image-retrieval-using-transformers

This repository contains code used to perform image retrieval using...

23
Experimental
3906 guglielmocamporese/visual-transformer-pytorch

An easy and minimal implementation of the Visual Transformer (ViT) in...

23
Experimental
3907 wesleyscholl/drex

🦀 The transformer is a brilliant hack scaled past its limits. DREX is what...

23
Experimental
3908 senxd/LLM-Interface

A Kotlin Library for interfacing with LLMs.

23
Experimental
3909 wshi83/MedAdapter

[EMNLP'24] MedAdapter: Efficient Test-Time Adaptation of Large Language...

23
Experimental
3910 sergio11/headline_generation_lstm_transformers

Explore advanced neural networks for crafting captivating headlines! Compare...

23
Experimental
3911 bottergpt/PaperCollection

Collection of ML/DL related papers and notes.

23
Experimental
3912 chaoluond/safetyllama

Finetune LLaMA-2-7b-chat to perform safety evaluation of user-bot conversation

23
Experimental
3913 raajmandale/mos-parameter-golf

CRS-LM: Structure-aware context reduction for tiny language models under...

23
Experimental
3914 RDrahul123/LLMs

A free, practical course on LLMs — Prompt Engineering, APIs, RAG, and Fine-Tuning.

23
Experimental
3915 TIGER-AI-Lab/TableCoT

The code and data for paper "Large Language Models are few(1)-shot Table...

23
Experimental
3916 danadascalescu00/ioai-transformer-workshop

A hands-on introduction to Transformer architecture, designed for...

23
Experimental
3917 steelonion/Monkeys-with-Novelwriters

use llm to write novel 使用大模型的小说写作框架

23
Experimental
3918 heyisula/infosage-13b

LLM pretraining pipeline using the FineWeb-Edu Dataset

23
Experimental
3919 kgw-wilson/llm-routing

Evaluating different embedding spaces on their effectiveness for LLM routing

23
Experimental
3920 jinmang2/Awesome-Papers

:snowflake: All about my interest Papers and Review :)

23
Experimental
3921 jdleo/tinysafe-1

71M parameter safety classifier (DeBERTa-v3-xsmall). Dual-head: binary...

23
Experimental
3922 Korde-AI/Multi-User-LLM-Agent

Official code for the paper: "Multi-User Large Language Model Agents"

23
Experimental
3923 WindJammer6/37.-A-Hallucination-Mitigation-Scheme-in-Security-Policy-Generation-with-Large-Language-Models

Source code for the paper: A Hallucination Mitigation Scheme in Security...

23
Experimental
3924 andomeder/act-mujoco-manipulation

End-to-end implementation of Action Chunking Transformers (ACT) for...

23
Experimental
3925 AYUSH-ISHAN/MultiAgent-Traffic-Control-with-Transformers

Implementation of Universal Multi-Agent Reinforcement Learning via Policy...

23
Experimental
3926 yelabb/PhantomX

On the Limits of Discrete Representations for Neural Control. A systematic...

23
Experimental
3927 staverm/DARPwTransformers

Transformer network capable of cloning a supervision policy on Dial-a-Ride...

23
Experimental
3928 fannie1208/FactTest

[ICML2025] "FactTest: Factuality Testing in Large Language Models with...

23
Experimental
3929 git-disl/Lisa

This is the official code for the paper "Lazy Safety Alignment for Large...

23
Experimental
3930 jiayuww/SpatialEval

[NeurIPS'24] SpatialEval: a benchmark to evaluate spatial reasoning...

23
Experimental
3931 Victorwz/VaLM

VaLM: Visually-augmented Language Modeling. ICLR 2023.

23
Experimental
3932 zhestyatsky/MCL-WiC

Research on Multilingual and Cross-lingual Word-in-Context Disambiguation

23
Experimental
3933 gulabpatel/LLMs

Alpaca, Bloom, DeciLM, Falcon, Vicuna, Llama2, Zephyr, Mistral(MoE), RAG,...

23
Experimental
3934 sahsaeedi/TPO

[TMLR] Triple Preference Optimization

23
Experimental
3935 HamedBabaei/CoLLM

CoLLM: Consistency of Large Language Models in Knowledge Engineering

23
Experimental
3936 Volscente/NexusLLM

NexusLLM is a GitHub repository dedicated to exploring various experiments...

23
Experimental
3937 anoopkdcs/NLPBias

Towards Comprehensive Understanding of Bias in Pre-trained Neural Language...

23
Experimental
3938 NaS-Research/knowledge-model

Our knowledge system systematically ingests, processes, and indexes...

23
Experimental
3939 PKU-YuanGroup/Video-Bench

A Comprehensive Benchmark and Toolkit for Evaluating Video-based Large...

23
Experimental
3940 thefcraft/torch-transformer-hinglish2hindi-translator

torch-transformer-hinglish2hindi-translator is a character-level translater...

23
Experimental
3941 Anne-Andresen/Multi-Modal-cuda-C-GAN

Raw C/cuda implementation of 3d GAN

23
Experimental
3942 onidahabitual85/llm-server

Launch and optimize llama.cpp servers automatically across Linux, macOS, and...

23
Experimental
3943 Ritaprava95/Custom_Entity_Extraction_Spacy3.5

Making a custom entity extraction model using spacy 3.5 using both...

23
Experimental
3944 thansen0/fastllm.cpp

A low latency, fault tolerant API for accessing LLM's written in C++ using llama.cpp.

23
Experimental
3945 joshstephenson/MorphemeSegmentation

This is a survey of morpheme segmentation techniques including 2 baselines...

23
Experimental
3946 AnkitNayak-eth/Llama-AI

Powered by the Llama 3.3 70B API, it delivers advanced, context-aware, and...

23
Experimental
3947 QuantLet/Encode-the-Qode

Towards Code Summarization for Scientific Domain Experts on Scarce Data...

23
Experimental
3948 IvanMao714/Transformers

Huggingface Transformers Tutorial

23
Experimental
3949 IsmaelMousa/playing-with-finetuning

Practice fine-tuning a Pretrained Transformers model from Hugging Face using...

23
Experimental
3950 simply-pouria/The-LMs-Book

My study notes, code implementations, etc. while reading The Hundred-Page...

23
Experimental
3951 Yahnnosh/Exploring-Model-Fusion-with-Optimal-Transport-on-Transformers

Project for the course "Deep Learning" 2022 at ETH Zurich

23
Experimental
3952 shyamcody/nlp-experiments

I will try small experiments on older state of the art models like bart, t5...

23
Experimental
3953 akash13singh/resilient_nlp

MockingBERT: Making Transformer Models Resilient to Adversarial Misspellings

23
Experimental
3954 Hexastack/hexabot-helper-ollama

The Ollama Helper Extension for Hexabot Chatbot / Agent Builder to enable...

23
Experimental
3955 mims-harvard/TimeX

Time series explainability via self-supervised model behavior consistency

23
Experimental
3956 GiovanniIacuzzo/Classification-instruments

Automatic classification of musical instruments from audio spectrograms...

23
Experimental
3957 AMDonati/SMC-T-v2

Code for the paper "The Monte Carlo Transformer: a stochastic self-attention...

23
Experimental
3958 Vadimbuildercxx/NumpyGPT

A lightweight educational implementation of GPT (Generative Pre-trained...

23
Experimental
3959 liuqidong07/Awesome-LLM-Enhanced-Recommender-Systems

[KDD'25] Large Language Model Enhanced Recommender Systems: Methods,...

23
Experimental
3960 nikisetti01/MTL-LORA-for-PubMedQA-and-Riddle

🚀 Fine-tuning LLaMA 1B for a medical chatbot using LoRA and a custom...

23
Experimental
3961 munnabhaiiii981/llm-attention-visualizer

🔍 Visualize attention patterns in transformer models to better understand...

23
Experimental
3962 TeamxUndefined/peer_hire_hackhazards_25

PeerHire solves the problem of trust and transparency in freelance...

23
Experimental
3963 Riccorl/transformers-ner

Simple NER model, showcasing Transformer Embedder library.

23
Experimental
3964 igorbenav/practical-language-models

An open book that teaches language models starting from the learning problem...

23
Experimental
3965 sugarandgugu/Simple-Trl-Training

基于DPO算法微调语言大模型,简单好上手。

23
Experimental
3966 GAIR-NLP/scaleeval

Scalable Meta-Evaluation of LLMs as Evaluators

23
Experimental
3967 xamry/llm-lab

Working sample implementations of several use cases involving Large Language Models.

23
Experimental
3968 j341nono/llemb

Unified embedding extraction for decoder-only LLMs with support for pooling...

23
Experimental
3969 chizkidd/microGPT

Minimal char-level GPT inspired by @karpathy's microGPT: multi-dataset...

23
Experimental
3970 codegram/calbert

Catalan ALBERT (A Lite BERT for self-supervised learning of language representations)

23
Experimental
3971 rohanmistry231/NLP-Interview-Preparation

A targeted resource for mastering NLP, featuring practice problems, code...

23
Experimental
3972 stefanpietrusky/FACTS

Repository for the article in the online magazine Data Science Collective.

23
Experimental
3973 declare-lab/della

DELLA-Merging: Reducing Interference in Model Merging through...

23
Experimental
3974 Md-Emon-Hasan/Fine-Tuning

End-to-end fine-tuning of Hugging Face models using LoRA, QLoRA,...

23
Experimental
3975 SertraFurr/Discord-AI-Bot

A simple discord AI chatbot using my own package!

23
Experimental
3976 rafaelvp-db/hf-finetune

Fine tuning a GPT model using the Persuasion for Good dataset.

23
Experimental
3977 Brokttv/Transformer-from-scratch

elaborate transformer implementation + detailed explanation

23
Experimental
3978 eftekhar-hossain/CUET_NLP-EACL_2021

This repository contains the system description and the codes that we...

23
Experimental
3979 Argo-Robot/foundation_models

Overview about state-of-art imitation learning techniques for robotic...

23
Experimental
3980 Junwu0615/RAG-With-LangChain-And-FAISS

用 LangChain + FAISS 實作 RAG ( Gemini / ChatGPT / Breeze / LLama / Vector DB )

23
Experimental
3981 m3hrdadfi/wiki-summary

A Bert2Bert model which able to summarize articles!

23
Experimental
3982 dragonnomada/ipn-cic-diplomado-ia-2025

Diplomado en Inteligencia Artificial del CIC / IPN

23
Experimental
3983 paxnea/LLM-multimodal-nudging

Zero-Shot Learning for Multimodal Nudging

23
Experimental
3984 caktus/llm-learning

A collection of notebooks and resources for learning about Large Language...

23
Experimental
3985 ashimmortallp/mHC-manifold-constrained-hyper-connections

🔍 Explore mHC for manifold-constrained hyper-connections in PyTorch,...

23
Experimental
3986 vishaln15/roco-image-captioning

Enhanced Image Captioning on ROCO Multimodal dataset using step-by-step distillation

23
Experimental
3987 chagmgang/dinov2-remote-sensing

Implementation dino v2 for remote sensing with huggingface transformers

23
Experimental
3988 viktor-shcherb/llm-tool-call-sft

LoRA fine-tuning pipeline for tool-calling chat LLMs with config-driven...

23
Experimental
3989 SpiritsYouthHarmony/awesome-llm-physics-benchmarks

A curated list of benchmarks for evaluating LLMs on physics reasoning and...

23
Experimental
3990 ethicalabs-ai/FlowerTune-Qwen2.5-Coder-0.5B-Instruct

FlowerTune LLM on Coding Dataset

23
Experimental
3991 Hexastack/hexabot-cli

CLI for Hexabot to create projects and run them.

23
Experimental
3992 8asic/mlpc2025-sound-event-detection

Competition-winning SED (Sound Event Detection) system that identifies audio...

23
Experimental
3993 joisino/zeh

Code for "Even GPT-5.2 Can’t Count to Five: The Case for Zero-Error Horizons...

23
Experimental
3994 ariannamethod/yent.yo

diffusion AI with a bad character

23
Experimental
3995 sanjaydeploys/Netai-Social

Netai-Social is a social media application built with Flask, React, and...

23
Experimental
3996 shikhartuli/cnn_txf_bias

[CogSci'21] Study of human inductive biases in CNNs and Transformers.

23
Experimental
3997 sofieditmer/depression_detection

This repository contains the contents of a Master's degree in Cognitive...

23
Experimental
3998 tbohne/saliency_kd

Saliency map-guided knowledge discovery for subclass identification with...

23
Experimental
3999 koudounasalkis/UnSLU-BENCH

This repo contains the code for <<"Alexa, can you forget me?” Machine...

23
Experimental
4000 TonmoyTalukder/Rank-Your-Summaries-Enhancing-Bengali-Text-Summarization-via-Ranking-based-Approach

Enhancinng Bengali Text Summarization via Ranking based Approach

23
Experimental
« Prev 1 2 3 38 39 40 41 42 76 77 78 Next »