All Transformer Models
7,795 models ranked by quality score · Page 24 of 78
| # | Model | Score | Tier |
|---|---|---|---|
| 2301 |
guoriyue/LangCommand
LangCommand is a local inference command-line tool that transforms natural... |
|
Emerging |
| 2302 |
AlgonetLabs/Cable
Context-aware Biases for Length Extrapolation |
|
Emerging |
| 2303 |
thinkall/featcopilot
Next-generation LLM-powered auto feature engineering framework |
|
Emerging |
| 2304 |
xinyanghuang7/Basic-Visual-Language-Model
Build a simple basic multimodal large model from scratch. 从零搭建一个简单的基础多模态大模型🤖 |
|
Emerging |
| 2305 |
Anjum48/commonlitreadabilityprize
4th Place solution for the Kaggle CommonLit Readability Prize |
|
Emerging |
| 2306 |
Chunjiang-Intelligence/Credal-Transformer
论文「Credal Transformer: A Principled Approach for Quantifying and Mitigating... |
|
Emerging |
| 2307 |
cgjosephlee/ollama-save-load
Save and load ollama models just like operating docker images. |
|
Emerging |
| 2308 |
Kitsunp/Prueba-de-modelo-de-ByteLatentTransformer
Este es una prueba de concepto del paper mencionado de Meta junto a otros... |
|
Emerging |
| 2309 |
pat-jj/KG-FIT
[NeurIPS'24] Knowledge Graph Fine-Tuning using LLMs |
|
Emerging |
| 2310 |
LunjunZhang/ema-pg
Code for "EMA Policy Gradient: Taming Reinforcement Learning for LLMs with... |
|
Emerging |
| 2311 |
andreped/vit-explainer
🔥 Demonstrating Explainable AI with Vision Transformer in web app |
|
Emerging |
| 2312 |
rakibnsajib/MediBot-AI-Doctor-with-Vision-and-Voice
AI-powered medical assistant using LLaMA-3.2-11B-Vision, Whisper, and... |
|
Emerging |
| 2313 |
kmaurinjones/AllMeans
Automatic topic modelling using minimal external input and computational resources |
|
Emerging |
| 2314 |
VITA-Group/TAPE
[ICML'25] "Rethinking Addressing in Language Models via Contextualized... |
|
Emerging |
| 2315 |
parameterlab/apricot
Source code of "Calibrating Large Language Models Using Their Generations... |
|
Emerging |
| 2316 |
swainshashwat/Flock
Craft custom Language Model Models (LLMs) effortlessly using Flock. Build... |
|
Emerging |
| 2317 |
cocacola-lab/Awesome-Transformer-in-Transportation
Papers & resources linked to Transformer-based research mainly for... |
|
Emerging |
| 2318 |
siwei-li/NLP_summarization
Summarization of lecture video transcripts using BERT. |
|
Emerging |
| 2319 |
franckalbinet/iomeval
Streamline evaluation evidence mapping at scale with LLMs |
|
Emerging |
| 2320 |
martin-wey/cl-code-apis
Replication package of the paper "On the Usage of Continual Learning for... |
|
Emerging |
| 2321 |
haesleinhuepf/vlm-pictionary
Play pictionary with Vision Language Models! |
|
Emerging |
| 2322 |
InquestGeronimo/tllm
An LLM training library for instruction-tuning. |
|
Emerging |
| 2323 |
AlenVelocity/langchain-llama
Run LLAMA LLMs in Node with Langchain |
|
Emerging |
| 2324 |
nightdessert/Retrieval_Head
open-source code for paper: Retrieval Head Mechanistically Explains... |
|
Emerging |
| 2325 |
uiuctml/Localize-and-Stitch
Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic |
|
Emerging |
| 2326 |
markendo/downscaling_intelligence
Downscaling Intelligence: Exploring Perception and Reasoning Bottlenecks in... |
|
Emerging |
| 2327 |
yuecao0119/MMFuser
The official implementation of the paper "MMFuser: Multimodal Multi-Layer... |
|
Emerging |
| 2328 |
PromptMixerDev/prompt-mixer-ollama-connector
Ollama Connector |
|
Emerging |
| 2329 |
jianzhnie/LLMToolkit
LLMToolkit is a toolkit for NLP(Natural Language Processing) and LLM(Large... |
|
Emerging |
| 2330 |
hollobit/GenAI_LLM_timeline
ChatGPT, GenerativeAI and LLMs Timeline |
|
Emerging |
| 2331 |
GURPREETKAURJETHRA/LLaMA3-Quantization
LLaMA3-Quantization |
|
Emerging |
| 2332 |
sanjaradylov/moleculegen-ml
Generate novel molecules using neural language models |
|
Emerging |
| 2333 |
HariomJangra/project-lumen
A 128M parameter language model built from scratch for learning how large... |
|
Emerging |
| 2334 |
yang-ai-lab/OSF-Open-Sleep-FM
OSF: On Pre-training and Scaling of Sleep Foundation Models |
|
Emerging |
| 2335 |
actypedef/ARCQuant
Code for the paper "ARCQuant: Boosting NVFP4 Quantization with Augmented... |
|
Emerging |
| 2336 |
josStorer/llama.cpp-unicode-windows
llama.cpp with unicode (windows) support |
|
Emerging |
| 2337 |
AshishGautamX/K8s-LLM-Scheduler
An intelligent Kubernetes scheduler powered by Meta's Llama-3.3-70B model... |
|
Emerging |
| 2338 |
stchakwdev/kan_transformer
Baantu Research: Hybrid KAN-Transformer for investigating learnable... |
|
Emerging |
| 2339 |
yaojin17/Unlearning_LLM
[ACL 2024] Code and data for "Machine Unlearning of Pre-trained Large... |
|
Emerging |
| 2340 |
UCDvision/NOLA
Code for NOLA, an implementation of "nola: Compressing LoRA using Linear... |
|
Emerging |
| 2341 |
mkofinas/neural-graphs
Official source code for "Graph Neural Networks for Learning Equivariant... |
|
Emerging |
| 2342 |
horseee/LLaMA-Pruning
Structural Pruning for LLaMA |
|
Emerging |
| 2343 |
sail-sg/dice
Official implementation of Bootstrapping Language Models via DPO Implicit Rewards |
|
Emerging |
| 2344 |
Beomi/KcBERT-Finetune
KcBERT/KcELECTRA Fine Tune Benchmarks code (forked from... |
|
Emerging |
| 2345 |
tanulsingh/Humour.ai-Language-model-that-can-crack-Jokes
Language Model that makes you Laugh . |
|
Emerging |
| 2346 |
duyhominhnguyen/Exgra-Med
[NeurIPS 2025] ExGra-Med: Medical Multi-Modal LLM with Extended Context Alignment |
|
Emerging |
| 2347 |
tanishqgautam/Image-Captioning
Implemented 3 different architectures to tackle the Image Caption problem,... |
|
Emerging |
| 2348 |
psmarter/mini-infer
A high-performance LLM inference engine with PagedAttention |... |
|
Emerging |
| 2349 |
rkinas/reasoning_models_how_to
This repository serves as a collection of research notes and resources on... |
|
Emerging |
| 2350 |
microsoft/InteractiveTextGeneration
An implementation of the paper "Interactive Text Generation" |
|
Emerging |
| 2351 |
kyegomez/DifferentialTransformer
An open source community implementation of the model from "DIFFERENTIAL... |
|
Emerging |
| 2352 |
omron-sinicx/crystalformer
The official code respository for "Crystalformer: Infinitely Connected... |
|
Emerging |
| 2353 |
UCSB-NLP-Chang/ULD
Implementation of paper 'Reversing the Forget-Retain Objectives: An... |
|
Emerging |
| 2354 |
codewithdark-git/QuantLLM
QuantLLM is a Python library designed for developers, researchers, and teams... |
|
Emerging |
| 2355 |
Gapi505/Sparky-2
This is a discord bot running on llama cpp with the llama 3 model and image... |
|
Emerging |
| 2356 |
ananttripathi/Resume-Analyzer-MLOps
Resume Analyzer is an AI-powered MLOps platform that optimizes your resume... |
|
Emerging |
| 2357 |
bloomberg/minilmv2.bb
Our open source implementation of MiniLMv2... |
|
Emerging |
| 2358 |
smitkiri/news-qa
Reading comprehension based question-answering model for news articles. |
|
Emerging |
| 2359 |
Esmail-ibraheem/Tinyllamas-pytorch
Tinyllamas🦙 is an Extensible advanced language model framework, inspired by... |
|
Emerging |
| 2360 |
SAP-samples/btp-running-language-models
This repository contains different code examples around the topic of... |
|
Emerging |
| 2361 |
poloclub/tsr-convstem
High-Performance Transformers for Table Structure Recognition Need Early Convolutions |
|
Emerging |
| 2362 |
nicolay-r/Reasoning-for-Sentiment-Analysis-Framework
The official code for CoT / ZSL reasoning framework 🧠, utilized in paper:... |
|
Emerging |
| 2363 |
MLD3/steerability
An open-source evaluation framework for measuring LLM steerability. |
|
Emerging |
| 2364 |
andreped/INF1600-ai-workshop
🔥 Workshop in AI Deployment (INF-1600, UiT) |
|
Emerging |
| 2365 |
jseeio/gpt2-tfjs
GPT2 with Tensorflow.js |
|
Emerging |
| 2366 |
songxiaoshuai/progco
Official Implementation of "ProgCo: Program Helps Self-Correction of Large... |
|
Emerging |
| 2367 |
bipinKrishnan/ml-recipe-book
A book containing step by step instructions to train deep learning models... |
|
Emerging |
| 2368 |
ApplyU-ai/ColorBlindnessEval
ColorBlindnessEval: Can Vision Language Models Pass Color Blindness Tests? |
|
Emerging |
| 2369 |
Wang-ML-Lab/multimodal-needle-in-a-haystack
[NAACL 2025 Oral] Multimodal Needle in a Haystack (MMNeedle): Benchmarking... |
|
Emerging |
| 2370 |
richouzo/hate-speech-detection-survey
Trained Neural Networks (LSTM, HybridCNN/LSTM, PyramidCNN, Transformers,... |
|
Emerging |
| 2371 |
adapter-hub/efficient-task-transfer
Research code for "What to Pre-Train on? Efficient Intermediate Task... |
|
Emerging |
| 2372 |
UBC-MDS/fixml
LLM Tool for effective test evaluation of ML projects with curated... |
|
Emerging |
| 2373 |
GURPREETKAURJETHRA/LLMs-Evaluation
LLMs Evaluation |
|
Emerging |
| 2374 |
cosmoquester/transformers-tf-finetune
Scripts to finetune huggingface transformers models with Tensorflow 2 |
|
Emerging |
| 2375 |
asigalov61/Lars-Ulrich-Transformer
[DEPRECIATED] [339M] [88% acc] Fast full-featured drums inpainting... |
|
Emerging |
| 2376 |
ROIM1998/APT
[ICML'24 Oral] APT: Adaptive Pruning and Tuning Pretrained Language Models... |
|
Emerging |
| 2377 |
Lanerra/reasoning-bank-slm
An experiment that applies Google Research's `ReasoningBank` technique to... |
|
Emerging |
| 2378 |
submarat/removing-layer-norm
Transformers Don’t Need LayerNorm at Inference Time |
|
Emerging |
| 2379 |
chrisjob1021/transformer-examples
A collection of educational toy implementations and examples of key... |
|
Emerging |
| 2380 |
anyscale/llm-router
Tutorial for building LLM router |
|
Emerging |
| 2381 |
zjunlp/LightThinker
[EMNLP 2025] LightThinker: Thinking Step-by-Step Compression |
|
Emerging |
| 2382 |
avsrma/LLM-based-AI-Assistant
A general purpose AI voice assistant built using GPT-4. |
|
Emerging |
| 2383 |
yotamnahum/DNA-Data-Storage
Single Read Reconstruction for DNA Data Storage Using Transformers (official... |
|
Emerging |
| 2384 |
declare-lab/TEAM
Our EMNLP 2022 paper on MCQA |
|
Emerging |
| 2385 |
xuanlinli17/large_vlm_distillation_ood
Distilling Large Vision-Language Model with Out-of-Distribution... |
|
Emerging |
| 2386 |
WooooDyy/BAPO
Codes for the paper "BAPO: Stabilizing Off-Policy Reinforcement Learning for... |
|
Emerging |
| 2387 |
ZigeW/data_management_LLM
Collection of training data management explorations for large language models |
|
Emerging |
| 2388 |
mcbal/deep-implicit-attention
Implementation of deep implicit attention in PyTorch |
|
Emerging |
| 2389 |
BIDS-Xu-Lab/Me-LLaMA
A novel medical large language model family with 13/70B parameters, which... |
|
Emerging |
| 2390 |
telekom/transformer-tools
Transformers Training Tools |
|
Emerging |
| 2391 |
YunzeMan/Lexicon3D
[NeurIPS 2024] Lexicon3D: Probing Visual Foundation Models for Complex 3D... |
|
Emerging |
| 2392 |
Nondzu/LlamaTor
LlamaTor: Decentralized AI model sharing via BitTorrent for efficient,... |
|
Emerging |
| 2393 |
crux82/u-deppllama
Dependency parsing with Large Language Models |
|
Emerging |
| 2394 |
monk1337/NanoPeft
The simplest repository & Neat implementation of different Lora methods for... |
|
Emerging |
| 2395 |
Vitgracer/DinoV3-Object-Tracking
Object tracking using the DINOv3 model. |
|
Emerging |
| 2396 |
elephantmipt/compressors
A small library with distillation, quantization and pruning pipelines |
|
Emerging |
| 2397 |
Marvin-VW/python-ollama-local
This Python script enables hands-free interaction with a local Llama2... |
|
Emerging |
| 2398 |
Orlando-CS/Awesome-VLA
✨✨latest advancements in VLA models(VIsion Language Action) |
|
Emerging |
| 2399 |
srsawant34/efficient_instruction_learning
Code base for the paper "Instruction Tuned Models are Quick Learners". |
|
Emerging |
| 2400 |
ES7/LLaMA-from-Scratch
In this repository, I have explained the working of the LLaMA Model,... |
|
Emerging |