All Transformer Models

7,795 models ranked by quality score · Page 6 of 78

Showing 501–600 of 7,795
# Model Score Tier
501 dali92002/DocEnTR

DocEnTr: An end-to-end document image enhancement transformer - ICPR 2022

47
Emerging
502 zackshen/gguf

a GGUF file parser

47
Emerging
503 noahho/CAAFE

Semi-automatic feature engineering process using Language Models and your...

47
Emerging
504 conceptofmind/LaMDA-rlhf-pytorch

Open-source pre-training implementation of Google's LaMDA in PyTorch. Adding...

47
Emerging
505 Tzohar/PassLLM

World's most accurate password guessing AI tool. A PyTorch implementation of...

47
Emerging
506 kenhktsui/anyclassifier

One Line To Build Zero-Data Classifiers in Minutes

47
Emerging
507 EleutherAI/gpt-neo

An implementation of model parallel GPT-2 and GPT-3-style models using the...

47
Emerging
508 awslabs/mlm-scoring

Python library & examples for Masked Language Model Scoring (ACL 2020)

47
Emerging
509 mim-solutions/bert_for_longer_texts

BERT classification model for processing texts longer than 512 tokens. Text...

47
Emerging
510 rxn4chemistry/rxn-onmt-models

Training of OpenNMT-based RXN models

47
Emerging
511 x-tabdeveloping/turftopic

Robust and fast topic models with sentence-transformers.

47
Emerging
512 Gleghorn-Lab/Protify

Low code molecular property prediction

47
Emerging
513 jobergum/browser-ml-inference

Edge Inference in Browser with Transformer NLP model

47
Emerging
514 predibase/lorax

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

47
Emerging
515 lorenzorovida/FHE-BERT-Tiny

Source code for the paper "Transformer-based Language Models and Homomorphic...

47
Emerging
516 dorarad/gansformer

Generative Adversarial Transformers

47
Emerging
517 dusty-nv/NanoLLM

Optimized local inference for LLMs with HuggingFace-like APIs for...

47
Emerging
518 kyegomez/SimplifiedTransformers

SimplifiedTransformer simplifies transformer block without affecting...

47
Emerging
519 jackaduma/Recurrent-LLM

The open-source LLM implementation of paper: RecurrentGPT: Interactive...

47
Emerging
520 chuanyangjin/MMToM-QA

[🏆Outstanding Paper Award at ACL 2024] MMToM-QA: Multimodal Theory of Mind...

47
Emerging
521 geeks-of-data/knowledge-gpt

Extract knowledge from all information sources using gpt and other language...

47
Emerging
522 monologg/KoBERT-Transformers

KoBERT on 🤗 Huggingface Transformers 🤗 (with Bug Fixed)

47
Emerging
523 qcri/LLMeBench

Benchmarking Large Language Models

47
Emerging
524 vinjn/llm-metahuman

An open solution for AI-powered photorealistic digital humans.

47
Emerging
525 The-AI-Summer/self-attention-cv

Implementation of various self-attention mechanisms focused on computer...

47
Emerging
526 The-Swarm-Corporation/MedGuard

MedGuard is a robust, production-grade Python library that ensures HIPAA...

47
Emerging
527 back2matching/turboquant

First open-source TurboQuant KV cache compression for LLM inference. Drop-in...

47
Emerging
528 ycq091044/BIOT

BIOT - A framework for pretraining biosignals at scale. Large EEG pre-trained models.

47
Emerging
529 ssbuild/chatglm_finetuning

chatglm 6b finetuning and alpaca finetuning

47
Emerging
530 soulteary/docker-llama2-chat

Play LLaMA2 (official / 中文版 / INT4 / llama2.cpp) Together! ONLY 3 STEPS! (...

47
Emerging
531 xusenlinzy/api-for-open-llm

Openai style api for open large language models, using LLMs just as chatgpt!...

47
Emerging
532 cedrickchee/awesome-transformer-nlp

A curated list of NLP resources focused on Transformer networks, attention...

47
Emerging
533 svdrecbd/mhc-mlx

MLX + Metal implementation of mHC: Manifold-Constrained Hyper-Connections by...

47
Emerging
534 ARM-software/keyword-transformer

Official implementation of the Keyword Transformer: https://arxiv.org/abs/2104.00769

47
Emerging
535 r2d4/rellm

Exact structure out of any language model completion.

47
Emerging
536 mlabonne/llm-datasets

Curated list of datasets and tools for post-training.

47
Emerging
537 Zefan-Cai/KVCache-Factory

Unified KV Cache Compression Methods for Auto-Regressive Models

47
Emerging
538 bobazooba/xllm

🦖 X—LLM: Cutting Edge & Easy LLM Finetuning

47
Emerging
539 deepseek-ai/Janus

Janus-Series: Unified Multimodal Understanding and Generation Models

47
Emerging
540 kyegomez/Lets-Verify-Step-by-Step

"Improving Mathematical Reasoning with Process Supervision" by OPENAI

47
Emerging
541 jhkchan/translategemma-cli

Local CLI for Google's TranslateGemma translation models with multi-platform...

47
Emerging
542 davidpirogov/toon-llm

Token-Oriented Object Notation (TOON) is an LLM-optimized data serialization...

47
Emerging
543 LM-Kit/lm-kit-net-samples

.NET samples for LM-Kit.NET

47
Emerging
544 showlab/Show-o

[ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer...

47
Emerging
545 jeya-maria-jose/TransWeather

Pytorch Code for the paper TransWeather - CVPR 2022

47
Emerging
546 cztomsik/ava

All-in-one desktop app for running LLMs locally.

47
Emerging
547 AliHaiderAhmad001/GPT-from-Scratch-with-Tensorflow

Implementation for "Improving Language Understanding by Generative...

47
Emerging
548 sinanuozdemir/oreilly-llm-rl-alignment

This training offers an intensive exploration into the frontier of...

47
Emerging
549 leaderj1001/BottleneckTransformers

Bottleneck Transformers for Visual Recognition

47
Emerging
550 Uminosachi/open-llm-webui

This repository contains a web application designed to execute relatively...

47
Emerging
551 prrao87/tweet-stance-prediction

Applying NLP transfer learning techniques to predict Tweet stance toward a topic

47
Emerging
552 mirpo/fastapi-gen

Build LLM-enabled FastAPI applications without build configuration.

47
Emerging
553 horseee/LLM-Pruner

[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language...

47
Emerging
554 haotian-liu/LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V...

47
Emerging
555 ictnlp/LLaMA-Omni

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction...

47
Emerging
556 vectorch-ai/ScaleLLM

A high-performance inference system for large language models, designed for...

47
Emerging
557 The-FinAI/PIXIU

This repository introduces PIXIU, an open-source resource featuring the...

47
Emerging
558 Cardinal-Operations/ORLM

ORLM: Training Large Language Models for Optimization Modeling

47
Emerging
559 willyfh/graph-transformer

An unofficial implementation of Graph Transformer (Masked Label Prediction:...

47
Emerging
560 NVlabs/Eagle

Eagle: Frontier Vision-Language Models with Data-Centric Strategies

47
Emerging
561 kyegomez/MHMoE

Community Implementation of the paper: "Multi-Head Mixture-of-Experts" In PyTorch

47
Emerging
562 tigerchen52/query_level_uncertainty

query-level uncertainty in LLMs

47
Emerging
563 DaoD/INTERS

This is the repository for our paper "INTERS: Unlocking the Power of Large...

47
Emerging
564 jiwidi/Behavior-Sequence-Transformer-Pytorch

This is a pytorch implementation for the BST model from Alibaba...

47
Emerging
565 HHousen/TransformerSum

Models to perform neural summarization (extractive and abstractive) using...

47
Emerging
566 locuslab/wanda

A simple and effective LLM pruning approach.

47
Emerging
567 VinAIResearch/PhoBERT

PhoBERT: Pre-trained language models for Vietnamese (EMNLP-2020 Findings)

47
Emerging
568 codewithdark-git/Building-LLMs-from-scratch

This repository guides you through the process of building a GPT-style Large...

47
Emerging
569 sagorbrur/bangla-bert

Bangla-Bert is a pretrained bert model for Bengali language

47
Emerging
570 Event-AHU/Medical_Image_Analysis

Foundation models based medical image analysis

47
Emerging
571 kyegomez/SingLoRA

This repository provides a minimal, single-file implementation of SingLoRA...

47
Emerging
572 DmitryNekrasov/ai-code-completion-idea-plugin

Implementation of IntelliJ IDEA code completion plugin using a local LLM.

47
Emerging
573 hiyouga/ChatGLM-Efficient-Tuning

Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调

47
Emerging
574 kayoyin/transformer-slt

Sign Language Translation with Transformers (COLING'2020, ECCV'20 SLRTP Workshop)

47
Emerging
575 alpa-projects/alpa

Training and serving large-scale neural networks with auto parallelization.

47
Emerging
576 ymcui/Chinese-LLaMA-Alpaca-2

中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs...

47
Emerging
577 raymin0223/mixture_of_recursions

Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive...

47
Emerging
578 fashn-AI/fashn-human-parser

Human parsing model for fashion and virtual try-on applications

47
Emerging
579 AviSoori1x/makeMoE

From scratch implementation of a sparse mixture of experts language model...

46
Emerging
580 xNul/chat-llama-discord-bot

A Discord Bot for chatting with LLaMA, Vicuna, Alpaca, MPT, or any other...

46
Emerging
581 chaitjo/learning-tsp

Code for the paper 'Learning TSP Requires Rethinking Generalization' (CP 2021)

46
Emerging
582 davidiommi/Pytorch--3D-Medical-Images-Segmentation--SALMON

Segmentation deep learning ALgorithm based on MONai toolbox: single and...

46
Emerging
583 intel/intel-extension-for-transformers

⚡ Build your chatbot within minutes on your favorite device; offer SOTA...

46
Emerging
584 JIA-Lab-research/LongLoRA

Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

46
Emerging
585 FoundationVision/Liquid

(Accepted by IJCV) Liquid: Language Models are Scalable and Unified...

46
Emerging
586 mit-han-lab/lite-transformer

[ICLR 2020] Lite Transformer with Long-Short Range Attention

46
Emerging
587 FudanDISC/DISC-LawLLM

[中文法律大模型] DISC-LawLLM: an intelligent legal system powered by large language...

46
Emerging
588 kmeng01/memit

Mass-editing thousands of facts into a transformer memory (ICLR 2023)

46
Emerging
589 voidful/TFkit

🤖📇 handling multiple nlp task in one pipeline

46
Emerging
590 quantium-ai/research

Research experiments exploring uncommon quant techniques.

46
Emerging
591 j-min/VL-T5

PyTorch code for "Unifying Vision-and-Language Tasks via Text Generation" (ICML 2021)

46
Emerging
592 MagedSaeed/generate-sequences

A python package made to generate sequences (greedy and beam-search) from...

46
Emerging
593 KRR-Oxford/HierarchyTransformers

Language Models as Hierarchy Encoders

46
Emerging
594 THU-SI/Spatial-MLLM

[NeurIPS 2025] Official implementation of Spatial-MLLM: Boosting MLLM...

46
Emerging
595 KristiyanVachev/Leaf-Question-Generation

Easy to use and understand multiple-choice question generation algorithm...

46
Emerging
596 Paranioar/Awesome_Matching_Pretraining_Transfering

The Paper List of Large Multi-Modality Model (Perception, Generation,...

46
Emerging
597 verifai/multiLLM

🚀 Invoke multiple large language models concurrently and the rank results....

46
Emerging
598 bytedance/byteir

A model compilation solution for various hardware

46
Emerging
599 thu-nics/MoA

[CoLM'25] The official implementation of the paper

46
Emerging
600 palewire/first-llm-classifier

Learn how journalists use large-language models to organize and analyze...

46
Emerging
« Prev 1 2 3 4 5 6 7 8 76 77 78 Next »