All Transformer Models

7,795 models ranked by quality score · Page 11 of 78

Showing 1001–1100 of 7,795
# Model Score Tier
1001 asigalov61/SuperPiano

Absolutely amazing SOTA Google Colab (Jupyter) Notebooks for...

42
Emerging
1002 johnmai-dev/ChatMLX

🤖✨ChatMLX is a modern, open-source, high-performance chat application for...

42
Emerging
1003 softmax1/Flash-Attention-Softmax-N

CUDA and Triton implementations of Flash Attention with SoftmaxN.

42
Emerging
1004 ashleykleynhans/text-generation-docker

Docker image for the Text Generation Web UI: A Gradio web UI for Large...

42
Emerging
1005 RobertCsordas/transformer_generalization

The official repository for our paper "The Devil is in the Detail: Simple...

42
Emerging
1006 harleyszhang/llm_note

LLM notes, including model inference, transformer model structure, and llm...

42
Emerging
1007 TextGeneratorio/text-generator.io

Run Vision LLMs, TTS and STT APIs. Website and API for https://text-generator.io

42
Emerging
1008 shushanxingzhe/transformers_ner

Add CRF or LSTM+CRF for huggingface transformers bert to perform better on...

42
Emerging
1009 Gunale0926/SORSA

SORSA: Singular Values and Orthonormal Regularized Singular Vectors...

42
Emerging
1010 a-tokyo/ai-zero-shot-classifier

🧠 leverage advanced AI embeddings to perform multilingual zero-shot text...

42
Emerging
1011 ahmetkumass/yolo-gen

Train YOLO + VLM with one command. Auto-generate vision-language training...

42
Emerging
1012 ariannamethod/nanollama

Train Llama 3 models from scratch. Any scale, any personality. By Arianna Method.

42
Emerging
1013 xmindflow/Awesome-Transformer-in-Medical-Imaging

[MedIA Journal] An ultimately comprehensive paper list of Vision...

42
Emerging
1014 sinanuozdemir/oreilly-ai-pipelines

Designing and Deploying LLM Pipelines

42
Emerging
1015 bilibili/Index-1.9B

A lightweight multilingual LLM

42
Emerging
1016 monologg/GoEmotions-Korean

Korean version of GoEmotions Dataset 😍😢😱

42
Emerging
1017 SensAI-PT/LLaMa2lang

Convenience scripts to finetune (chat-)LLaMa3 and other models for any language

42
Emerging
1018 xyjigsaw/LLM-Pretrain-SFT

Scripts of LLM pre-training and fine-tuning (w/wo LoRA, DeepSpeed)

42
Emerging
1019 monologg/DistilKoBERT

Distillation of KoBERT from SKTBrain (Lightweight KoBERT)

42
Emerging
1020 HKUDS/OpenGraph

[EMNLP'2024] "OpenGraph: Towards Open Graph Foundation Models"

42
Emerging
1021 HyperCluster-Tech/manimator

Transform research papers and mathematical concepts into stunning visual...

42
Emerging
1022 josStorer/selfhostedAI

A collection of one-click self-hosted AI

42
Emerging
1023 Rishit-dagli/Conformer

An implementation of Conformer: Convolution-augmented Transformer for Speech...

42
Emerging
1024 tatsu-lab/alpaca_farm

A simulation framework for RLHF and alternatives. Develop your RLHF method...

42
Emerging
1025 gitctrlx/llama.go

Llama from scratch in Go.

42
Emerging
1026 menon92/BangalASR

Transformer based Bangla Speech Recognition | Encoder Decoder Architecture

42
Emerging
1027 ai-forever/mgpt

Multilingual Generative Pretrained Model

42
Emerging
1028 amirfeder/CausaLM

CausaLM: Causal Model Explanation Through Counterfactual Language Models

42
Emerging
1029 NVlabs/GroupViT

Official PyTorch implementation of GroupViT: Semantic Segmentation Emerges...

42
Emerging
1030 mojivalipour/symbolicgpt

Symbolic regression is the task of identifying a mathematical expression...

42
Emerging
1031 oxpig/CaLM

Protein language model trained on coding DNA

42
Emerging
1032 grctest/FastAPI-BitNet

Running Microsoft's BitNet inference framework via FastAPI, Uvicorn and Docker.

42
Emerging
1033 aimclub/FEDOT.LLM

LLM-based prototype for nexgen AutoML

42
Emerging
1034 LLukas22/llm-rs-python

Unofficial python bindings for the rust llm library. 🐍❤️🦀

42
Emerging
1035 mmaaz60/EdgeNeXt

[CADL'22, ECCVW] Official repository of paper titled "EdgeNeXt: Efficiently...

42
Emerging
1036 CAMeL-Lab/CAMeLBERT

Code and models for "The Interplay of Variant, Size, and Task Type in Arabic...

42
Emerging
1037 garyb9/twitter-llm-bot

Fully automatic asynchronous AI operated Twitter bot using Large Language...

42
Emerging
1038 therealoliver/Deepdive-llama3-from-scratch

Achieve the llama3 inference step-by-step, grasp the core concepts, master...

42
Emerging
1039 thuml/Flowformer

About Code release for "Flowformer: Linearizing Transformers with...

42
Emerging
1040 LowinLi/fastgpt

⚡ boost inference speed of GPT models in transformers by onnxruntime

42
Emerging
1041 sedthh/BeatLearning

Open Source Generative AI Models for Automatic Rhythm Game Beatmap...

42
Emerging
1042 ayaka14732/llama-2-jax

JAX implementation of the Llama 2 model

42
Emerging
1043 biodatlab/thonburian-whisper

Thonburian Whisper: Open models for fine-tuned Whisper in Thai. Try our demo...

42
Emerging
1044 dmanuel64/codablellm

A framework for creating and curating high-quality code datasets tailored...

42
Emerging
1045 hyperonym/basaran

Basaran is an open-source alternative to the OpenAI text completion API. It...

42
Emerging
1046 jankais3r/LLaMA_MPS

Run LLaMA (and Stanford-Alpaca) inference on Apple Silicon GPUs.

42
Emerging
1047 woodRock/fishy-business

Machine Learning for Rapid Evaporative Ionization Mass Spectrometry for...

42
Emerging
1048 NVIDIA/Cosmos-Tokenizer

A suite of image and video neural tokenizers

42
Emerging
1049 sangmichaelxie/doremi

Pytorch implementation of DoReMi, a method for optimizing the data mixture...

42
Emerging
1050 gotzmann/llama.go

llama.go is like llama.cpp in pure Golang!

42
Emerging
1051 declare-lab/flan-alpaca

This repository contains code for extending the Stanford Alpaca synthetic...

42
Emerging
1052 LISA-ITMO/LLM-resume-moderator

Автоматизирует модерацию резюме на русском языке с помощью LLM. Для...

42
Emerging
1053 ZinYY/Online_RLHF

A PyTorch implementation of the paper "Provably Efficient Online RLHF with...

42
Emerging
1054 sayakpaul/robustness-vit

Contains code for the paper "Vision Transformers are Robust Learners" (AAAI 2022).

42
Emerging
1055 waltonfuture/Diabetica

[SCI-FM@ICLR 2025] Specialized LLMs capable of handling various diabetes tasks

42
Emerging
1056 Dicklesworthstone/llm_introspective_compression_and_metacognition

A novel approach for transformer model introspection that enables saving,...

42
Emerging
1057 ZO-Bench/ZO-LLM

[ICML‘24] Official code for the paper "Revisiting Zeroth-Order Optimization...

42
Emerging
1058 zjunlp/KnowledgeCircuits

[NeurIPS 2024] Knowledge Circuits in Pretrained Transformers

42
Emerging
1059 liuqidong07/LLM-ESR

[NeurIPS'24 Spotlight] The official implementation code of LLM-ESR.

42
Emerging
1060 efeslab/fiddler

[ICLR'25] Fast Inference of MoE Models with CPU-GPU Orchestration

42
Emerging
1061 harleyszhang/lite_llama

A light llama-like llm inference framework based on the triton kernel.

42
Emerging
1062 shivendrra/SmallLanguageModel

a LLM cookbook, for building your own from scratch, all the way from...

42
Emerging
1063 sail-sg/understand-r1-zero

Understanding R1-Zero-Like Training: A Critical Perspective

42
Emerging
1064 A-baoYang/alpaca-7b-chinese

Finetune LLaMA-7B with Chinese instruction datasets

42
Emerging
1065 taufeeque9/codebook-features

Sparse and discrete interpretability tool for neural networks

42
Emerging
1066 omron-sinicx/crystalframer

The official code respository for "Rethinking the role of frames for...

42
Emerging
1067 nickduran/align2-linguistic-alignment

ALIGN 2.0: Modern Python package for multi-level linguistic alignment...

42
Emerging
1068 RobbenRibery/TuoTuo

TuoTuo is a Topic Modeling library for Researchers and Engineers

42
Emerging
1069 golsun/DialogRPT

EMNLP 2020: "Dialogue Response Ranking Training with Large-Scale Human Feedback Data"

42
Emerging
1070 westlake-repl/IDvs.MoRec

End-to-end Training for Multimodal Recommendation Systems

42
Emerging
1071 nuhmanpk/quick-llama

Run Ollama models on Google Colab

42
Emerging
1072 leehanchung/lora-instruct

Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA

42
Emerging
1073 macabdul9/AnyGen

A Unified and Minimalist Pipeline for Generating Outputs with LLMs...

42
Emerging
1074 thunlp/InfLLM

The code of our paper "InfLLM: Unveiling the Intrinsic Capacity of LLMs for...

42
Emerging
1075 sinanuozdemir/foundations-of-gen-ai

Transformer Architectures for Generative AI

42
Emerging
1076 Azure99/BlossomData

A fluent, scalable, and easy-to-use LLM data processing framework.

42
Emerging
1077 Knuckles-Team/genius-chatbot

Chatbot that uses any desired hugging face model or allows for scalable...

42
Emerging
1078 RobertCsordas/modules

The official repository for our paper "Are Neural Nets Modular? Inspecting...

41
Emerging
1079 Beomi/InfiniTransformer

Unofficial PyTorch/🤗Transformers(Gemma/Llama3) implementation of Leave No...

41
Emerging
1080 AdrianBZG/llama-multimodal-vqa

Multimodal Instruction Tuning for Llama 3

41
Emerging
1081 jshilong/GPT4RoI

(ECCVW 2025)GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest

41
Emerging
1082 adaptivetokensampling/ATS

Adaptive Token Sampling for Efficient Vision Transformers (ECCV 2022 Oral...

41
Emerging
1083 jxiw/MambaInLlama

[NeurIPS 2024] Official Repository of The Mamba in the Llama: Distilling and...

41
Emerging
1084 ictnlp/LLaVA-Mini

LLaVA-Mini is a unified large multimodal model (LMM) that can support the...

41
Emerging
1085 Pyenb/Ollama-models

A collection of zipped Ollama models for offline use. Simply download,...

41
Emerging
1086 declare-lab/instruct-eval

This repository contains code to quantitatively evaluate instruction-tuned...

41
Emerging
1087 poloclub/LLM-Attributor

LLM Attributor: Attribute LLM's Generated Text to Training Data

41
Emerging
1088 audioku/meta-transfer-learning

Implementation of meta-transfer-learning for ASR and LM (ACL 2020)

41
Emerging
1089 eugenehp/bitnet-cpp-rs

Rust bindings for bitnet.cpp based on llama-cpp-4

41
Emerging
1090 zjysteven/mink-plus-plus

[ICLR'25 Spotlight] Min-K%++: Improved baseline for detecting pre-training...

41
Emerging
1091 JohnMachado11/Build-a-Large-Language-Model-from-Scratch

Building a GPT-like LLM from scratch with PyTorch.

41
Emerging
1092 mlpc-ucsd/BLIVA

(AAAI 2024) BLIVA: A Simple Multimodal LLM for Better Handling of Text-rich...

41
Emerging
1093 Eamon2009/Transformer-language-model

An educational implementation of a GPT-style language model built from...

41
Emerging
1094 okuvshynov/slowllama

Finetune llama2-70b and codellama on MacBook Air without quantization

41
Emerging
1095 Beomi/KcELECTRA

🤗 Korean Comments ELECTRA: 한국어 댓글로 학습한 ELECTRA 모델

41
Emerging
1096 zai-org/GLM-Edge

GLM Series Edge Models

41
Emerging
1097 louisbrulenaudet/tsdae

Transformer-based Denoising AutoEncoder for Sentence Transformers...

41
Emerging
1098 datadreamer-dev/DataDreamer

DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models.   🤖💤

41
Emerging
1099 HarderThenHarder/transformers_tasks

⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification,...

41
Emerging
1100 Kakz/prometheus-llm

PrometheusLLM is a unique transformer architecture inspired by dignity and...

41
Emerging
« Prev 1 2 3 9 10 11 12 13 76 77 78 Next »