All Transformer Models

7,795 models ranked by quality score · Page 15 of 78

Showing 1401–1500 of 7,795
# Model Score Tier
1401 loong64/ollama

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other...

39
Emerging
1402 cloudmercato/ollama-benchmark

Handy tool to measure the performance and efficiency of LLMs workloads.

39
Emerging
1403 laelhalawani/gguf_llama

Wrapper for simplified use of Llama2 GGUF quantized models.

39
Emerging
1404 WangJingyao07/Awesome-GRPO

Codebase of GRPO: Implementations and Resources of GRPO and Its Variants

39
Emerging
1405 ArchAIve-Project/Backend

A complex Flask API system empowered by custom ML models, LLMs and...

39
Emerging
1406 UKPLab/5pils

Code associated with the EMNLP 2024 Main paper: "Image, tell me your story!"...

39
Emerging
1407 takara-ai/SwarmFormer

A pytorch implementation of SwarmFormer for text classification.

39
Emerging
1408 deep-div/Custom-Transformer-Pytorch

A clean, ground-up implementation of the Transformer architecture in...

39
Emerging
1409 MrYxJ/calculate-flops.pytorch

The calflops is designed to calculate FLOPs、MACs and Parameters in all...

39
Emerging
1410 KishanBagaria/OCLB

🦙 One Click Llama Button for DeviantArt.com

39
Emerging
1411 praj2408/Text-Summarizer-Project

The text summarizer project is an innovative tool designed to condense...

39
Emerging
1412 StyrbjornKall/TRIDENT

A collection of transformer-based models and developmental scripts presented...

39
Emerging
1413 mshenoda/roberta-spam

RoBERTa based Spam Message Detection

39
Emerging
1414 amoffat/HeimdaLLM

Constrain LLM output

39
Emerging
1415 franjgs/llm-rl-finance-trader

Hybrid project integrating Large Language Models (LLM) for financial news...

39
Emerging
1416 czg1225/dParallel

[ICLR 2026] dParallel: Learnable Parallel Decoding for dLLMs

39
Emerging
1417 yifanzhang-pro/AutoMathText

[ACL 2025 Findings] Autonomous Data Selection with Zero-shot Generative...

39
Emerging
1418 FareedKhan-dev/create-million-parameter-llm-from-scratch

Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.

39
Emerging
1419 complex-reasoning/RPG

[ICLR 2026] RPG: KL-Regularized Policy Gradient (https://arxiv.org/abs/2505.17508)

39
Emerging
1420 THUDM/LongAlign

[EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs

39
Emerging
1421 virevolai/logos-shift-client

Replace expensive LLM calls with finetunes automatically

39
Emerging
1422 xiuqhou/Relation-DETR

[ECCV2024 Oral] Official implementation of the paper "Relation DETR:...

39
Emerging
1423 horseee/Awesome-Efficient-LLM

A curated list for Efficient Large Language Models

39
Emerging
1424 AyushExel/trolo

An SDK for Transformers + YOLO and other SSD family models

39
Emerging
1425 ramonclaudio/perplexity-ai-toolkit

A lightweight Python API wrapper and CLI for Perplexity’s Sonar language models.

39
Emerging
1426 padeler/PE-former

2D Human Pose estimation using transformers. Implementation in Pytorch

39
Emerging
1427 TatevKaren/BabyGPT-Build_GPT_From_Scratch

BabyGPT: Build Your Own GPT Large Language Model from Scratch Pre-Training...

39
Emerging
1428 nicola-decao/KnowledgeEditor

Code for Editing Factual Knowledge in Language Models

39
Emerging
1429 FareedKhan-dev/qwen3-MoE-from-scratch

A Step-by-Step Implementation of Qwen 3 MoE Architecture from Scratch

39
Emerging
1430 HKUDS/GraphEdit

"GraphEdit: Large Language Models for Graph Structure Learning"

39
Emerging
1431 zhenyi4/codi

Official repository for "CODI: Compressing Chain-of-Thought into Continuous...

39
Emerging
1432 akshitac8/OW-DETR

[CVPR 2022] Official Pytorch code for OW-DETR: Open-world Detection Transformer

39
Emerging
1433 Sea-Snell/CALM-Dialogue

Official code for the paper "Context-Aware Language Modeling for...

39
Emerging
1434 jackaduma/Vicuna-LoRA-RLHF-PyTorch

A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer...

39
Emerging
1435 NSLab-CUK/Unified-Graph-Transformer

Unified Graph Transformer (UGT) is a novel Graph Transformer model...

39
Emerging
1436 Mj23978/sam-assistant

🤖 Sam-assistant is a personal assistant that is designed to understand your...

39
Emerging
1437 PCfVW/hf-fetch-model

Fast HuggingFace model downloads for Rust — an embeddable library for...

39
Emerging
1438 Md-Emon-Hasan/InformaTruth

Fine-tuned roberta-base classifier on the LIAR dataset. Aaccepts multiple...

39
Emerging
1439 INWLY/LWTformer

LWTformer: A Detail-Aware, Learnable Wavelet-Transformer for Ancient Chinese...

39
Emerging
1440 Lamorati92/LLMs-from-scratch

📚 Build and train your own GPT-like Large Language Model from scratch with...

39
Emerging
1441 leaderj1001/CLIP

CLIP: Connecting Text and Image (Learning Transferable Visual Models From...

39
Emerging
1442 declare-lab/red-instruct

Codes and datasets of the paper Red-Teaming Large Language Models using...

39
Emerging
1443 slSeanWU/Compose_and_Embellish

Official PyTorch implementation of ICASSP 2023 paper "Compose & Embellish:...

39
Emerging
1444 knotgrass/attention

several types of attention modules written in PyTorch for learning purposes

39
Emerging
1445 GAIR-NLP/OctoThinker

Revisiting Mid-training in the Era of Reinforcement Learning Scaling

39
Emerging
1446 Lupin1998/Awesome-MIM

[Survey] Masked Modeling for Self-supervised Representation Learning on...

39
Emerging
1447 18907305772/FuseAI

FuseAI Project

39
Emerging
1448 ariG23498/gemma3-object-detection

Fine tune Gemma 3 on an object detection task

39
Emerging
1449 StevenRice99/LLM-IK

LLM-IK: Solving Inverse Kinematics using Large Language Models

39
Emerging
1450 nihalsangeeth/behaviour-seq-transformer

Pytorch implementation of "Behaviour Sequence Transformer for E-commerce...

39
Emerging
1451 rishub-tamirisa/tamper-resistance

[ICLR 2025] Official Repository for "Tamper-Resistant Safeguards for...

39
Emerging
1452 microsoft/COCO-LM

[NeurIPS 2021] COCO-LM: Correcting and Contrasting Text Sequences for...

39
Emerging
1453 xiuqhou/Salience-DETR

[CVPR 2024] Official implementation of the paper "Salience DETR: Enhancing...

39
Emerging
1454 waroad/losver

Source Code for LOSVER: Line-Level Modifiability Signal-Guided Vulnerability...

39
Emerging
1455 asprenger/ray_vllm_inference

A simple service that integrates vLLM with Ray Serve for fast and scalable...

39
Emerging
1456 kssteven418/BigLittleDecoder

[NeurIPS'23] Speculative Decoding with Big Little Decoder

39
Emerging
1457 hitz-zentroa/GoLLIE

Guideline following Large Language Model for Information Extraction

39
Emerging
1458 SPUTNIKAI/LeechTransformer

Leech-Lila: A Geometric Attention Transformer(Language Model) with the Leech...

39
Emerging
1459 armbues/SiLLM-examples

Examples for using the SiLLM framework for training and running Large...

39
Emerging
1460 g8a9/ferret

A python package for benchmarking interpretability techniques on Transformers.

39
Emerging
1461 truefoundry/models

Community-maintained registry of AI/LLM model configurations - pricing,...

39
Emerging
1462 hpcaitech/SwiftInfer

Efficient AI Inference & Serving

39
Emerging
1463 Nithin-Holla/meme_challenge

Repository containing code from team Kingsterdam for the Hateful Memes Challenge

39
Emerging
1464 monarch-initiative/pheval.llm

Analysis of LLMs for Clinical Observations

39
Emerging
1465 wpeebles/G.pt

Official PyTorch Implementation of "Learning to Learn with Generative Models...

39
Emerging
1466 nlpaueb/greek-bert

A Greek edition of BERT pre-trained language model

38
Emerging
1467 NohTow/PPL-MCTS

Repository for the code of the "PPL-MCTS: Constrained Textual Generation...

38
Emerging
1468 VectorInstitute/atomgen

Library for handling atomistic graph datasets focusing on transformer-based...

38
Emerging
1469 chef-transformer/chef-transformer

Chef Transformer 🍲 .

38
Emerging
1470 HxCodeWarrior/StellarByte

从零实现基础的Transformer的Decoerder-Only模型,并进行模型升级,构建专属于自己的LLM模型

38
Emerging
1471 zarzouram/image_captioning_with_transformers

Pytorch implementation of image captioning using transformer-based model.

38
Emerging
1472 madibabaiasl/MobileRobotGPT4LLaMA2024

Deployment of Large Language Models to Control Mobile Robots at the Edge

38
Emerging
1473 BaiTheBest/SparseLLM

Official Repo for SparseLLM: Global Pruning of LLMs (NeurIPS 2024)

38
Emerging
1474 IAmPara0x/Yuno

Yuno is context based search engine for anime.

38
Emerging
1475 Koratahiu/Advanced_Optimizers

A family of highly efficient, lightweight yet powerful optimizers.

38
Emerging
1476 TrelisResearch/install-guides

Various installation guides for Large Language Models

38
Emerging
1477 baaivision/EVE

EVE Series: Encoder-Free Vision-Language Models from BAAI

38
Emerging
1478 BodhiSearch/BodhiApp

Run Open Source/Open Weight LLMs locally with OpenAI compatible APIs

38
Emerging
1479 argonne-lcf/LLM-Inference-Bench

LLM-Inference-Bench

38
Emerging
1480 BhabhaAI/dataformer

Solving data for LLMs - Create quality synthetic datasets!

38
Emerging
1481 lukashermann/hulc

Hierarchical Universal Language Conditioned Policies

38
Emerging
1482 Traffic-Alpha/iLLM-TSC

This repository contains the code for the paper“iLLM-TSC: Integration...

38
Emerging
1483 ByteDance-Seed/FlexPrefill

Code for paper: [ICLR2025 Oral] FlexPrefill: A Context-Aware Sparse...

38
Emerging
1484 robert-mcdermott/LLM-Image-Classification

Image Classification Testing with LLMs

38
Emerging
1485 intersun/LightningDOT

source code and pre-trained/fine-tuned checkpoint for NAACL 2021 paper LightningDOT

38
Emerging
1486 deep-div/Fine-Tuning-LLMs-and-VisionModels

Fine-Tuning LLMs (Gemma, LLaMA, Mistral, etc.) A practical guide to...

38
Emerging
1487 toyaix/TritonLLM

LLM Inference via Triton (Flexible & Modular): Focused on Kernel...

38
Emerging
1488 Nkluge-correa/Tucano

Natively pre-trained open-source Portuguese language models.

38
Emerging
1489 AlexIoannides/transformers-gen-ai

Developing generative language models using transformers.

38
Emerging
1490 openpsi-project/ReaLHF

Super-Efficient RLHF Training of LLMs with Parameter Reallocation

38
Emerging
1491 sshh12/multi_token

Embed arbitrary modalities (images, audio, documents, etc) into large...

38
Emerging
1492 jdaln/dgx-spark-inference-stack

Serve the home! Inference stack for your Nvidia DGX Spark aka the Grace...

38
Emerging
1493 nercone-dev/zeta-llm-tool

Fully Open-source LLM Tool

38
Emerging
1494 icon-lab/BolT

Fused Window Transformers for fMRI Time Series Analysis...

38
Emerging
1495 styfeng/TinyDialogues

Code & data for the EMNLP 2024 paper: Is Child-Directed Speech Effective...

38
Emerging
1496 KishanBagaria/dAbot

🤖 CLI tool to automate stuff on DeviantArt.com

38
Emerging
1497 Whiax/BERT-Transformer-Pytorch

Basic implementation of BERT and Transformer in Pytorch in one short python...

38
Emerging
1498 xingyizhou/GTR

Global Tracking Transformers, CVPR 2022

38
Emerging
1499 NTU-SQUAD/transformers-coqa

Albert for Conversational Question Answering Challenge

38
Emerging
1500 singhsidhukuldeep/Text-Summarizer

Comparing state of the art models for text summary generation

38
Emerging
« Prev 1 2 3 13 14 15 16 17 76 77 78 Next »