All Transformer Models
7,795 models ranked by quality score · Page 15 of 78
| # | Model | Score | Tier |
|---|---|---|---|
| 1401 |
loong64/ollama
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other... |
|
Emerging |
| 1402 |
cloudmercato/ollama-benchmark
Handy tool to measure the performance and efficiency of LLMs workloads. |
|
Emerging |
| 1403 |
laelhalawani/gguf_llama
Wrapper for simplified use of Llama2 GGUF quantized models. |
|
Emerging |
| 1404 |
WangJingyao07/Awesome-GRPO
Codebase of GRPO: Implementations and Resources of GRPO and Its Variants |
|
Emerging |
| 1405 |
ArchAIve-Project/Backend
A complex Flask API system empowered by custom ML models, LLMs and... |
|
Emerging |
| 1406 |
UKPLab/5pils
Code associated with the EMNLP 2024 Main paper: "Image, tell me your story!"... |
|
Emerging |
| 1407 |
takara-ai/SwarmFormer
A pytorch implementation of SwarmFormer for text classification. |
|
Emerging |
| 1408 |
deep-div/Custom-Transformer-Pytorch
A clean, ground-up implementation of the Transformer architecture in... |
|
Emerging |
| 1409 |
MrYxJ/calculate-flops.pytorch
The calflops is designed to calculate FLOPs、MACs and Parameters in all... |
|
Emerging |
| 1410 |
KishanBagaria/OCLB
🦙 One Click Llama Button for DeviantArt.com |
|
Emerging |
| 1411 |
praj2408/Text-Summarizer-Project
The text summarizer project is an innovative tool designed to condense... |
|
Emerging |
| 1412 |
StyrbjornKall/TRIDENT
A collection of transformer-based models and developmental scripts presented... |
|
Emerging |
| 1413 |
mshenoda/roberta-spam
RoBERTa based Spam Message Detection |
|
Emerging |
| 1414 |
amoffat/HeimdaLLM
Constrain LLM output |
|
Emerging |
| 1415 |
franjgs/llm-rl-finance-trader
Hybrid project integrating Large Language Models (LLM) for financial news... |
|
Emerging |
| 1416 |
czg1225/dParallel
[ICLR 2026] dParallel: Learnable Parallel Decoding for dLLMs |
|
Emerging |
| 1417 |
yifanzhang-pro/AutoMathText
[ACL 2025 Findings] Autonomous Data Selection with Zero-shot Generative... |
|
Emerging |
| 1418 |
FareedKhan-dev/create-million-parameter-llm-from-scratch
Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture. |
|
Emerging |
| 1419 |
complex-reasoning/RPG
[ICLR 2026] RPG: KL-Regularized Policy Gradient (https://arxiv.org/abs/2505.17508) |
|
Emerging |
| 1420 |
THUDM/LongAlign
[EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs |
|
Emerging |
| 1421 |
virevolai/logos-shift-client
Replace expensive LLM calls with finetunes automatically |
|
Emerging |
| 1422 |
xiuqhou/Relation-DETR
[ECCV2024 Oral] Official implementation of the paper "Relation DETR:... |
|
Emerging |
| 1423 |
horseee/Awesome-Efficient-LLM
A curated list for Efficient Large Language Models |
|
Emerging |
| 1424 |
AyushExel/trolo
An SDK for Transformers + YOLO and other SSD family models |
|
Emerging |
| 1425 |
ramonclaudio/perplexity-ai-toolkit
A lightweight Python API wrapper and CLI for Perplexity’s Sonar language models. |
|
Emerging |
| 1426 |
padeler/PE-former
2D Human Pose estimation using transformers. Implementation in Pytorch |
|
Emerging |
| 1427 |
TatevKaren/BabyGPT-Build_GPT_From_Scratch
BabyGPT: Build Your Own GPT Large Language Model from Scratch Pre-Training... |
|
Emerging |
| 1428 |
nicola-decao/KnowledgeEditor
Code for Editing Factual Knowledge in Language Models |
|
Emerging |
| 1429 |
FareedKhan-dev/qwen3-MoE-from-scratch
A Step-by-Step Implementation of Qwen 3 MoE Architecture from Scratch |
|
Emerging |
| 1430 |
HKUDS/GraphEdit
"GraphEdit: Large Language Models for Graph Structure Learning" |
|
Emerging |
| 1431 |
zhenyi4/codi
Official repository for "CODI: Compressing Chain-of-Thought into Continuous... |
|
Emerging |
| 1432 |
akshitac8/OW-DETR
[CVPR 2022] Official Pytorch code for OW-DETR: Open-world Detection Transformer |
|
Emerging |
| 1433 |
Sea-Snell/CALM-Dialogue
Official code for the paper "Context-Aware Language Modeling for... |
|
Emerging |
| 1434 |
jackaduma/Vicuna-LoRA-RLHF-PyTorch
A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer... |
|
Emerging |
| 1435 |
NSLab-CUK/Unified-Graph-Transformer
Unified Graph Transformer (UGT) is a novel Graph Transformer model... |
|
Emerging |
| 1436 |
Mj23978/sam-assistant
🤖 Sam-assistant is a personal assistant that is designed to understand your... |
|
Emerging |
| 1437 |
PCfVW/hf-fetch-model
Fast HuggingFace model downloads for Rust — an embeddable library for... |
|
Emerging |
| 1438 |
Md-Emon-Hasan/InformaTruth
Fine-tuned roberta-base classifier on the LIAR dataset. Aaccepts multiple... |
|
Emerging |
| 1439 |
INWLY/LWTformer
LWTformer: A Detail-Aware, Learnable Wavelet-Transformer for Ancient Chinese... |
|
Emerging |
| 1440 |
Lamorati92/LLMs-from-scratch
📚 Build and train your own GPT-like Large Language Model from scratch with... |
|
Emerging |
| 1441 |
leaderj1001/CLIP
CLIP: Connecting Text and Image (Learning Transferable Visual Models From... |
|
Emerging |
| 1442 |
declare-lab/red-instruct
Codes and datasets of the paper Red-Teaming Large Language Models using... |
|
Emerging |
| 1443 |
slSeanWU/Compose_and_Embellish
Official PyTorch implementation of ICASSP 2023 paper "Compose & Embellish:... |
|
Emerging |
| 1444 |
knotgrass/attention
several types of attention modules written in PyTorch for learning purposes |
|
Emerging |
| 1445 |
GAIR-NLP/OctoThinker
Revisiting Mid-training in the Era of Reinforcement Learning Scaling |
|
Emerging |
| 1446 |
Lupin1998/Awesome-MIM
[Survey] Masked Modeling for Self-supervised Representation Learning on... |
|
Emerging |
| 1447 |
18907305772/FuseAI
FuseAI Project |
|
Emerging |
| 1448 |
ariG23498/gemma3-object-detection
Fine tune Gemma 3 on an object detection task |
|
Emerging |
| 1449 |
StevenRice99/LLM-IK
LLM-IK: Solving Inverse Kinematics using Large Language Models |
|
Emerging |
| 1450 |
nihalsangeeth/behaviour-seq-transformer
Pytorch implementation of "Behaviour Sequence Transformer for E-commerce... |
|
Emerging |
| 1451 |
rishub-tamirisa/tamper-resistance
[ICLR 2025] Official Repository for "Tamper-Resistant Safeguards for... |
|
Emerging |
| 1452 |
microsoft/COCO-LM
[NeurIPS 2021] COCO-LM: Correcting and Contrasting Text Sequences for... |
|
Emerging |
| 1453 |
xiuqhou/Salience-DETR
[CVPR 2024] Official implementation of the paper "Salience DETR: Enhancing... |
|
Emerging |
| 1454 |
waroad/losver
Source Code for LOSVER: Line-Level Modifiability Signal-Guided Vulnerability... |
|
Emerging |
| 1455 |
asprenger/ray_vllm_inference
A simple service that integrates vLLM with Ray Serve for fast and scalable... |
|
Emerging |
| 1456 |
kssteven418/BigLittleDecoder
[NeurIPS'23] Speculative Decoding with Big Little Decoder |
|
Emerging |
| 1457 |
hitz-zentroa/GoLLIE
Guideline following Large Language Model for Information Extraction |
|
Emerging |
| 1458 |
SPUTNIKAI/LeechTransformer
Leech-Lila: A Geometric Attention Transformer(Language Model) with the Leech... |
|
Emerging |
| 1459 |
armbues/SiLLM-examples
Examples for using the SiLLM framework for training and running Large... |
|
Emerging |
| 1460 |
g8a9/ferret
A python package for benchmarking interpretability techniques on Transformers. |
|
Emerging |
| 1461 |
truefoundry/models
Community-maintained registry of AI/LLM model configurations - pricing,... |
|
Emerging |
| 1462 |
hpcaitech/SwiftInfer
Efficient AI Inference & Serving |
|
Emerging |
| 1463 |
Nithin-Holla/meme_challenge
Repository containing code from team Kingsterdam for the Hateful Memes Challenge |
|
Emerging |
| 1464 |
monarch-initiative/pheval.llm
Analysis of LLMs for Clinical Observations |
|
Emerging |
| 1465 |
wpeebles/G.pt
Official PyTorch Implementation of "Learning to Learn with Generative Models... |
|
Emerging |
| 1466 |
nlpaueb/greek-bert
A Greek edition of BERT pre-trained language model |
|
Emerging |
| 1467 |
NohTow/PPL-MCTS
Repository for the code of the "PPL-MCTS: Constrained Textual Generation... |
|
Emerging |
| 1468 |
VectorInstitute/atomgen
Library for handling atomistic graph datasets focusing on transformer-based... |
|
Emerging |
| 1469 |
chef-transformer/chef-transformer
Chef Transformer 🍲 . |
|
Emerging |
| 1470 |
HxCodeWarrior/StellarByte
从零实现基础的Transformer的Decoerder-Only模型,并进行模型升级,构建专属于自己的LLM模型 |
|
Emerging |
| 1471 |
zarzouram/image_captioning_with_transformers
Pytorch implementation of image captioning using transformer-based model. |
|
Emerging |
| 1472 |
madibabaiasl/MobileRobotGPT4LLaMA2024
Deployment of Large Language Models to Control Mobile Robots at the Edge |
|
Emerging |
| 1473 |
BaiTheBest/SparseLLM
Official Repo for SparseLLM: Global Pruning of LLMs (NeurIPS 2024) |
|
Emerging |
| 1474 |
IAmPara0x/Yuno
Yuno is context based search engine for anime. |
|
Emerging |
| 1475 |
Koratahiu/Advanced_Optimizers
A family of highly efficient, lightweight yet powerful optimizers. |
|
Emerging |
| 1476 |
TrelisResearch/install-guides
Various installation guides for Large Language Models |
|
Emerging |
| 1477 |
baaivision/EVE
EVE Series: Encoder-Free Vision-Language Models from BAAI |
|
Emerging |
| 1478 |
BodhiSearch/BodhiApp
Run Open Source/Open Weight LLMs locally with OpenAI compatible APIs |
|
Emerging |
| 1479 |
argonne-lcf/LLM-Inference-Bench
LLM-Inference-Bench |
|
Emerging |
| 1480 |
BhabhaAI/dataformer
Solving data for LLMs - Create quality synthetic datasets! |
|
Emerging |
| 1481 |
lukashermann/hulc
Hierarchical Universal Language Conditioned Policies |
|
Emerging |
| 1482 |
Traffic-Alpha/iLLM-TSC
This repository contains the code for the paper“iLLM-TSC: Integration... |
|
Emerging |
| 1483 |
ByteDance-Seed/FlexPrefill
Code for paper: [ICLR2025 Oral] FlexPrefill: A Context-Aware Sparse... |
|
Emerging |
| 1484 |
robert-mcdermott/LLM-Image-Classification
Image Classification Testing with LLMs |
|
Emerging |
| 1485 |
intersun/LightningDOT
source code and pre-trained/fine-tuned checkpoint for NAACL 2021 paper LightningDOT |
|
Emerging |
| 1486 |
deep-div/Fine-Tuning-LLMs-and-VisionModels
Fine-Tuning LLMs (Gemma, LLaMA, Mistral, etc.) A practical guide to... |
|
Emerging |
| 1487 |
toyaix/TritonLLM
LLM Inference via Triton (Flexible & Modular): Focused on Kernel... |
|
Emerging |
| 1488 |
Nkluge-correa/Tucano
Natively pre-trained open-source Portuguese language models. |
|
Emerging |
| 1489 |
AlexIoannides/transformers-gen-ai
Developing generative language models using transformers. |
|
Emerging |
| 1490 |
openpsi-project/ReaLHF
Super-Efficient RLHF Training of LLMs with Parameter Reallocation |
|
Emerging |
| 1491 |
sshh12/multi_token
Embed arbitrary modalities (images, audio, documents, etc) into large... |
|
Emerging |
| 1492 |
jdaln/dgx-spark-inference-stack
Serve the home! Inference stack for your Nvidia DGX Spark aka the Grace... |
|
Emerging |
| 1493 |
nercone-dev/zeta-llm-tool
Fully Open-source LLM Tool |
|
Emerging |
| 1494 |
icon-lab/BolT
Fused Window Transformers for fMRI Time Series Analysis... |
|
Emerging |
| 1495 |
styfeng/TinyDialogues
Code & data for the EMNLP 2024 paper: Is Child-Directed Speech Effective... |
|
Emerging |
| 1496 |
KishanBagaria/dAbot
🤖 CLI tool to automate stuff on DeviantArt.com |
|
Emerging |
| 1497 |
Whiax/BERT-Transformer-Pytorch
Basic implementation of BERT and Transformer in Pytorch in one short python... |
|
Emerging |
| 1498 |
xingyizhou/GTR
Global Tracking Transformers, CVPR 2022 |
|
Emerging |
| 1499 |
NTU-SQUAD/transformers-coqa
Albert for Conversational Question Answering Challenge |
|
Emerging |
| 1500 |
singhsidhukuldeep/Text-Summarizer
Comparing state of the art models for text summary generation |
|
Emerging |