All Transformer Models

7,795 models ranked by quality score · Page 50 of 78

Showing 4901–5000 of 7,795
# Model Score Tier
4901 Eric-he-cn/Qwen3-QLoRA-News

This project enables the model to directly generate structured summaries...

20
Experimental
4902 frikishaan/pytorch-transformers

This repository contains the original transformers model implementation code.

20
Experimental
4903 ReNothingg/Mind-4

Экспериментальная трансформерная LLM для локального обучения, инференса и...

20
Experimental
4904 StarLight1212/Story-Teller

This repo mainly encapsulates an LLM model + front-end + back-end for...

20
Experimental
4905 KOKOSde/sparse-clt

Cross-Layer Transcoder (CLT) library for extracting sparse interpretable...

20
Experimental
4906 Xhst/ml-record-linkage

Unstructured Record Linkage using Siamese Networks and Large Language Models...

20
Experimental
4907 ZEKE320/llm-dataset-generator

The LLM Dataset Generator is an open source tool for generating text data...

20
Experimental
4908 michelecafagna26/VinVL

Original VinVL (and Oscar) repo with API designed for an easy inference

20
Experimental
4909 sidharrth2002/text-scoring

Industrial Text Scoring using Multimodal Deep Natural Language Processing 🚀 ...

20
Experimental
4910 SauravMaheshkar/nanollm

JAX LLM playground

20
Experimental
4911 secret-ai-labs/awesome-local-llm

Your complete guide to running powerful AI models locally in 2025. Covers...

20
Experimental
4912 lijoraju/llm-news-aggregator

Personalized news summaries using LLMs, FAISS, and Telegram bot.

20
Experimental
4913 Simoso68/llama-lit

Streamlit frontend for Ollama.

20
Experimental
4914 ntphuc149/ViAG

ViAG: A Novel Framework for Fine-tuning Answer Generation models ultilizing...

20
Experimental
4915 Mecanik/Tiny-BPE-Trainer

Lightweight, header-only Byte Pair Encoding (BPE) trainer in modern C++17....

20
Experimental
4916 chrisliu298/awesome-sparse-autoencoders

A resource repository of sparse autoencoders for large language models

20
Experimental
4917 thyt3618/instap-ai

an AI Agent project contributed by students from YingShan Middle...

20
Experimental
4918 di37/multiclass-image-classification-using-multimodal-llms

A comprehensive comparison of multimodal models - llama3.2-vision,...

20
Experimental
4919 Mahadasghar/Amazon-food-sentiment-analyzer

Fine-tuned RoBERTa transformer for Amazon food review sentiment analysis...

20
Experimental
4920 CameLLM/CameLLM

Run your favourite LLMs locally on macOS from Swift

20
Experimental
4921 brihijoshi/granular-similarity-COLING-2020

Code for the paper "The Devil is in the Details: Evaluating Limitations of...

20
Experimental
4922 NISL-MSU/MultiSetSR

Decomposable Neuro Symbolic Regression

20
Experimental
4923 PRITHIVSAKTHIUR/Doc-VLMs-exp

An experimental document-focused Vision-Language Model application that...

20
Experimental
4924 The-Swarm-Corporation/ClusterMoE

A novel neural network architecture that extends Mixture of Experts (MoE)...

20
Experimental
4925 efficientscaling/Z1

[EMNLP'25 Industry] Repo for "Z1: Efficient Test-time Scaling with Code"

20
Experimental
4926 koudounasalkis/divergence-in-speech-systems

Code associated with the paper "Exploring Subgroup Performance in End-to-End...

20
Experimental
4927 jwmke/BiasCompass

Using LLMs to detect bias in news articles.

20
Experimental
4928 rajveer43/titan_transformer

Unofficial implementation of titans transformer

20
Experimental
4929 Saivineeth147/LLM-Compass

The ultimate collection of resources for building, evaluating, and...

20
Experimental
4930 Vaioskn/song-identification-fingerprints-and-embeddings

Song identification combining landmark audio fingerprinting with...

20
Experimental
4931 psunlpgroup/VerbosityLLM

This repository maintains dataset, predictions, and code for paper:...

20
Experimental
4932 RedTeamingforLLMs/RedTeamingforLLMs

A framework designed for executing positive red-teaming experiments on large...

20
Experimental
4933 kyegomez/Mixture-of-MQA

An implementation of a switch transformer like Multi-query attention model

20
Experimental
4934 obss/disgem

[EMNLP 2024] Official Implementation of DisGeM: Distractor Generation for...

20
Experimental
4935 jhuapl-fomo/ralf

A lightweight library to support the development of applications using LLMs

20
Experimental
4936 Aqib121201/BurgerBot

LLM dashboard for German policy documents—translation, summarization, visualization

20
Experimental
4937 Mrigank005/Rubric_Generator

This repository contains a machine learning model designed to generate...

20
Experimental
4938 adarsh-crafts/llama-llm-from-scratch

Educational, from-scratch implementation of a LLaMA-style LLM using PyTorch...

20
Experimental
4939 MauroLuzzatto/lyrics-translator

🎵 LyricsTranslator is a Python library for automated lyrics translation

20
Experimental
4940 zhudotexe/kani-vision

Kani extension for supporting vision-language models (VLMs). Comes with...

20
Experimental
4941 Sarvesh-Yadav-5201/Lyrics-Generation---NLP-Project

This is a project to demonstrate the capabilities of Transformer Models to...

20
Experimental
4942 sappho192/EDMTranslator

.NET Text translator library based on LLM models, especially...

20
Experimental
4943 marksgraham/transformer-ood

Official PyTorch code for "Transformer-based out-of-distribution detection...

20
Experimental
4944 itsDaiton/masters-thesis

Exploration and Comparison of Transformers for Image Classification.

20
Experimental
4945 mduffster/self-referent-test

Testing role-based pathways on small LLMs

20
Experimental
4946 HySonLab/HierAttention

Scalable Hierarchical Self-Attention with Learnable Hierarchy for Long-Range...

20
Experimental
4947 harrisonvshen/triton-accelerated-attention

Custom Triton GPU kernels for multi-head attention, including QK^T, softmax,...

20
Experimental
4948 minuva/llm-flow-classification

LLM conversation flow classification 💬

20
Experimental
4949 ryan-air/Alpaca-3B-Fine-Tuned

In this project, I have provided code and a Colaboratory notebook that...

20
Experimental
4950 gmongaras/2Mamba2Furious

Code for the paper "2Mamba2Furious: Linear in complexity, competitive in accuracy"

20
Experimental
4951 Khushiyant/tether

Tether is a Triton-powered framework for training and deploying Spiking Transformers.

20
Experimental
4952 Shengwei-Peng/TOCFL-MultiBench

TOCFL-MultiBench: A multimodal benchmark for evaluating Chinese language...

20
Experimental
4953 mspronesti/llm.sycl

llm.c, but in SYCL/Intel oneAPI!

20
Experimental
4954 kapshaul/LLM-finetune-vuln-detection

Fine-tuning a Large Language Model (LLM) for code vulnerability detection...

20
Experimental
4955 edoost/pert

Persian Ezafe Recognition Using Transformers and Its Role in Part-Of-Speech Tagging

20
Experimental
4956 wasim/scaling-specialization-dense-lms

Do dense LMs develop MoE-like specialization as they scale? Measure it,...

20
Experimental
4957 zabir-nabil/bangla-multilingual-llm-eval

Evaluation of Open and Closed-Source Multi-lingual LLMs for Low-Resource...

20
Experimental
4958 ngavu2004/text-to-knowledge-graph

Turn your Text into a mind map based on LLMs knowledge graph

20
Experimental
4959 H0NEYP0T-466/Isabella

⚙️ Isabella – a full-stack 🚀 conversational system built on FastAPI ✨...

20
Experimental
4960 FareedKhan-dev/best-introduction-to-transformer

transformer again in the same manner as I did in my previous blog (for both...

20
Experimental
4961 ebarkhordar/voter-behavior-prediction-LLM

This project explores the predictive power of large language models (LLMs)...

20
Experimental
4962 Mustapha-AJEGHRIR/arabic_calligraphy

This is a repo containing our code for Arabic calligraphy style detection...

20
Experimental
4963 yophis/decom-renorm-merge

Decom-Renorm-Merge: Merging deep learning models through shared representation space.

20
Experimental
4964 AlanC12138/summarizer-api

AI-powered document summarization API built with FastAPI, Hugging Face...

20
Experimental
4965 affjljoo3581/CommonLit-Readability-Prize

🥈42nd place in CommonLit Readability Prize competition🥈

20
Experimental
4966 HES-XPLAIN/mlxplain

An open platform for accelerating the development of eXplainable AI systems

20
Experimental
4967 a-kostikova/LLLMs-Survey

The GitHub page for the survey paper "LLLMs: A Data-Driven Survey of...

20
Experimental
4968 zufeshan12/fine-tuning-and-reinforcement-learning-on-llms

supervised fine tuning and RLAIF on DeepSeek-math-7b-base using LoRA...

20
Experimental
4969 NoviceStone/Keqing

An interpretable KBQA system that operates at the natural language level...

20
Experimental
4970 rishikksh20/qwen3-playground

Readable implementation of Qwen3 0.6B model

20
Experimental
4971 EgosOwn/llama-linux-helper

Never Google for linux commands again with the help of LLaMA

20
Experimental
4972 alipay/fin_domain_llm

Implementation of the paper: WeaverBird: Empowering Financial...

20
Experimental
4973 SkAndMl/captiongpt

Image Captioning using ViT and GPT. Notebook version in the following link

20
Experimental
4974 LMLK-seal/LLModel

Private LLModel GUI Chat allows users to interact with a local large...

20
Experimental
4975 FardinHash/multilabel-classification-llm

Multi-label classification using LLMs, with additional enhancements using...

20
Experimental
4976 colinrizzman/Neural-Romance-v2

A neural network calculates your chance of finding love.

20
Experimental
4977 brej-29/analytics-copilot-text2sql

Analytics Copilot (Text-to-SQL) is an end-to-end LLM engineering project...

20
Experimental
4978 lapismyt/pyAIHorde

Simple library for interacting with AI Horde API.

20
Experimental
4979 waelantar/ATTS_Complete_Free_Package

ATTS: Adaptive Test-Time Scaling - A validated framework for optimizing LLM...

20
Experimental
4980 jha-lab/transcode

[TCAD'23] TransCODE: Co-design of Transformers and Accelerators for...

20
Experimental
4981 sergio-sanz-rodriguez/Vision-Transformers-Image-Classification

Development of Vision Transformer (ViT) networks for multi-class image...

20
Experimental
4982 kbulutozler/transformers-text-classification

using transformers to do text classification.

20
Experimental
4983 necrashter/transformers-learnable-memory

Fine-tuning Image Transformers using Learnable Memory

20
Experimental
4984 Wangmerlyn/KeyChain

KeyChain, UUID-driven data augmentation design behind LoongRL (ICLR 2026 oral)

20
Experimental
4985 nopperl/corporate_emission_reports

Finetuning and evaluating LLMs to extract GHG emissions from PDF reports...

20
Experimental
4986 freddxvill/Proyecto_Traductor_de_la_LSB

Traductor de Lengua de Señas Boliviana (LSB) a texto utilizando redes...

20
Experimental
4987 himanshu231204/langchain-playground--for-llms-

A personal learning space for LangChain, featuring code snippets, notes, and...

20
Experimental
4988 Aqib121201/FairNLP-SHAP-Based-Bias-Detection-in-Multilingual-BERT-Models

Bias analysis in multilingual BERT using SHAP and fairness metrics (EN, DE, HI)

20
Experimental
4989 eminorhan/llm-memory

Memory experiments with LLMs

20
Experimental
4990 ColinWu0403/LLaMA-2-hf-Chatbot

Chatbot from pretrained LLaMA-2 LLM model, fine-tuned with medical research...

20
Experimental
4991 GusLovesMath/Llama3_MacSilicon

Repository for running LLMs efficiently on Mac silicon (M1, M2, M3)....

20
Experimental
4992 gemlorg/Thesis-Trading-LLM

Bachelor's Thesis on Machine Learning for Stock Market Forecasting. Several...

20
Experimental
4993 chazciii/rd-net

Inference-time drift experiment demonstrating reduced repetition collapse in...

20
Experimental
4994 JangYeongSil/JettaRLLLM

Jetta-Reinforcement-Learning-Hybrid-LLM-Architecture

20
Experimental
4995 mlsw/partial-embedding-matrix-adaptation

Vocabulary-level memory efficiency for language model fine-tuning.

20
Experimental
4996 Traffic-Alpha/VLMLight

Official implementation of VLMLight

20
Experimental
4997 OmarBouhamed/T-DPnet

T-DPnet-Transformer-based-deep-Probabilistic-network-for-load-forecasting

20
Experimental
4998 bagh2178/GC-VLN

[CoRL 2025] GC-VLN: Instruction as Graph Constraints for Training-free...

20
Experimental
4999 s-JoL/Llama3-extend-vocab

A demo of expanding the vocabulary of the Llama3 model, applicable to other...

20
Experimental
5000 JerryPan2718/flexgpt

Tradeoff between runtime and RAM usage for large language model inference.

20
Experimental
« Prev 1 2 3 48 49 50 51 52 76 77 78 Next »