All Transformer Models
7,795 models ranked by quality score · Page 5 of 78
| # | Model | Score | Tier |
|---|---|---|---|
| 401 |
Deep-Spark/DeepSparkInference
DeepSparkInference has selected 216 inference models of both small and large... |
|
Emerging |
| 402 |
sapientinc/HRM
Hierarchical Reasoning Model Official Release |
|
Emerging |
| 403 |
MediaBrain-SJTU/MING
明医 (MING):中文医疗问诊大模型 |
|
Emerging |
| 404 |
higgsfield-ai/higgsfield
Fault-tolerant, highly scalable GPU orchestration, and a machine learning... |
|
Emerging |
| 405 |
muxi-ai/onellm
Unified interface for interacting with various LLMs hundreds of models,... |
|
Emerging |
| 406 |
Leeroo-AI/mergoo
A library for easily merging multiple LLM experts, and efficiently train the... |
|
Emerging |
| 407 |
rese1f/MovieChat
[CVPR 2024] MovieChat: From Dense Token to Sparse Memory for Long Video Understanding |
|
Emerging |
| 408 |
kyegomez/MultiModalMamba
A novel implementation of fusing ViT with Mamba into a fast, agile, and high... |
|
Emerging |
| 409 |
EvelynFan/FaceFormer
[CVPR 2022] FaceFormer: Speech-Driven 3D Facial Animation with Transformers |
|
Emerging |
| 410 |
wxhcore/bumblecore
An LLM training framework built from the ground up, featuring a custom... |
|
Emerging |
| 411 |
shell-nlp/gpt_server
gpt_server是一个用于生产级部署LLMs、Embedding、Reranker、ASR、TTS、文生图、图片编辑和文生视频的开源框架。 |
|
Emerging |
| 412 |
riyanshibohra/TuneKit
Upload your data → Get a fine-tuned SLM. Free. |
|
Emerging |
| 413 |
VectorInstitute/vector-inference
Efficient LLM inference on Slurm clusters. |
|
Emerging |
| 414 |
tjake/Jlama
Jlama is a modern LLM inference engine for Java |
|
Emerging |
| 415 |
wuwangzhang1216/abliterix
Fully automatic censorship removal for language models. LoRA abliteration +... |
|
Emerging |
| 416 |
floriankark/cs224n-win2223
Code and written solutions of the assignments of the Stanford CS224N:... |
|
Emerging |
| 417 |
time-series-foundation-models/lag-llama
Lag-Llama: Towards Foundation Models for Probabilistic Time Series Forecasting |
|
Emerging |
| 418 |
vtuber-plan/langport
Langport is a language model inference service |
|
Emerging |
| 419 |
tomaarsen/attention_sinks
Extend existing LLMs way beyond the original training length with constant... |
|
Emerging |
| 420 |
ngxson/wllama
WebAssembly binding for llama.cpp - Enabling on-browser LLM inference |
|
Emerging |
| 421 |
dell-research-harvard/linktransformer
A convenient way to link, deduplicate, aggregate and cluster data(frames) in... |
|
Emerging |
| 422 |
maxischuh/TwinBooster
Package for TwinBooster. Enables fast and powerful zero-shot molecular... |
|
Emerging |
| 423 |
jy-yuan/KIVI
[ICML 2024] KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV Cache |
|
Emerging |
| 424 |
balisujohn/localwriter
A LibreOffice Writer extension that adds local-inference generative AI features. |
|
Emerging |
| 425 |
EnnengYang/Awesome-Model-Merging-Methods-Theories-Applications
Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications... |
|
Emerging |
| 426 |
Shivanandroy/KeyPhraseTransformer
KeyPhraseTransformer lets you quickly extract key phrases, topics, themes... |
|
Emerging |
| 427 |
huggingface/tflite-android-transformers
DistilBERT / GPT-2 for on-device inference thanks to TensorFlow Lite with... |
|
Emerging |
| 428 |
IntelLabs/nlp-architect
A model library for exploring state-of-the-art deep learning topologies and... |
|
Emerging |
| 429 |
hscspring/hcgf
Humanable Chat Generative-model Fine-tuning | LLM微调 |
|
Emerging |
| 430 |
yoshoku/llama_cpp.rb
llama_cpp.rb provides Ruby bindings for llama.cpp |
|
Emerging |
| 431 |
alephpi/Texo
A minimalist SOTA LaTeX OCR model with only 20M parameters, running in... |
|
Emerging |
| 432 |
OpenGVLab/OmniQuant
[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization... |
|
Emerging |
| 433 |
HUSTAI/uie_pytorch
PaddleNLP UIE模型的PyTorch版实现 |
|
Emerging |
| 434 |
MadryLab/context-cite
Attribute (or cite) statements generated by LLMs back to in-context information. |
|
Emerging |
| 435 |
AMontgomerie/question_generator
An NLP system for generating reading comprehension questions |
|
Emerging |
| 436 |
intel/ipex-llm
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM,... |
|
Emerging |
| 437 |
oripress/AlgoTune
AlgoTune is a NeurIPS 2025 benchmark made up of 154 math, physics, and... |
|
Emerging |
| 438 |
multimodal-art-projection/YuE
YuE: Open Full-song Music Generation Foundation Model, something similar to... |
|
Emerging |
| 439 |
larslorch/avici
Amortized Inference for Causal Structure Learning, NeurIPS 2022 |
|
Emerging |
| 440 |
helpmefindaname/transformer-smaller-training-vocab
Temporary remove unused tokens during training to save ram and speed. |
|
Emerging |
| 441 |
graphdeeplearning/graphtransformer
Graph Transformer Architecture. Source code for "A Generalization of... |
|
Emerging |
| 442 |
WangRongsheng/XrayGLM
🩺 首个会看胸部X光片的中文多模态医学大模型 | The first Chinese Medical Multimodal Model that... |
|
Emerging |
| 443 |
curiousily/Deploy-BERT-for-Sentiment-Analysis-with-FastAPI
Deploy BERT for Sentiment Analysis as REST API using FastAPI, Transformers... |
|
Emerging |
| 444 |
jmont-dev/ollama-hpp
Modern, Header-only C++ bindings for the Ollama API. |
|
Emerging |
| 445 |
fcakyon/video-transformers
Easiest way of fine-tuning HuggingFace video classification models |
|
Emerging |
| 446 |
OFA-Sys/Chinese-CLIP
Chinese version of CLIP which achieves Chinese cross-modal retrieval and... |
|
Emerging |
| 447 |
Beomi/KoAlpaca
KoAlpaca: 한국어 명령어를 이해하는 오픈소스 언어모델 (KoAlpaca: An open-source language model... |
|
Emerging |
| 448 |
ChanithaAbey/AI-Agent-for-Stock-Prediction
An AI Agent for stock data analysis, news rerieval, and prediction; powered... |
|
Emerging |
| 449 |
xrsrke/toolformer
Implementation of Toolformer: Language Models Can Teach Themselves to Use Tools |
|
Emerging |
| 450 |
hila-chefer/Transformer-Explainability
[CVPR 2021] Official PyTorch implementation for Transformer Interpretability... |
|
Emerging |
| 451 |
X-D-Lab/LangChain-ChatGLM-Webui
基于LangChain和ChatGLM-6B等系列LLM的针对本地知识库的自动问答 |
|
Emerging |
| 452 |
steering-vectors/steering-vectors
Steering vectors for transformer language models in Pytorch / Huggingface |
|
Emerging |
| 453 |
kyegomez/GPT4o
Community Open Source Implementation of GPT4o in PyTorch |
|
Emerging |
| 454 |
VHellendoorn/Code-LMs
Guide to using pre-trained large language models of source code |
|
Emerging |
| 455 |
blegat/LINMA2472
Course material for the course LINMA2472 at UCLouvain |
|
Emerging |
| 456 |
ymcui/Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs) |
|
Emerging |
| 457 |
fboulnois/llama-cpp-docker
Run llama.cpp in a GPU accelerated Docker container |
|
Emerging |
| 458 |
cheahjs/free-llm-api-resources
A list of free LLM inference resources accessible via API. |
|
Emerging |
| 459 |
TUDB-Labs/mLoRA
An Efficient "Factory" to Build Multiple LoRA Adapters |
|
Emerging |
| 460 |
NVIDIA-AI-IOT/nanoowl
A project that optimizes OWL-ViT for real-time inference with NVIDIA TensorRT. |
|
Emerging |
| 461 |
Tencent/TencentPretrain
Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo |
|
Emerging |
| 462 |
kyegomez/LIMoE
Implementation of the "the first large-scale multimodal mixture of experts... |
|
Emerging |
| 463 |
snowby666/poe-api-wrapper
👾 A Python API wrapper for Poe.com. With this, you will have free access to... |
|
Emerging |
| 464 |
yuriwa/crewai-sheets-ui
Use google sheets as a gui for crewAI |
|
Emerging |
| 465 |
deepset-ai/FARM
:house_with_garden: Fast & easy transfer learning for NLP. Harvesting... |
|
Emerging |
| 466 |
microsoft/augmented-interpretable-models
Interpretable and efficient predictors using pre-trained language models.... |
|
Emerging |
| 467 |
mallorbc/Finetune_LLMs
Repo for fine-tuning Casual LLMs |
|
Emerging |
| 468 |
FoundationVision/Infinity
[CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for... |
|
Emerging |
| 469 |
nuance1979/llama-server
LLaMA Server combines the power of LLaMA C++ with the beauty of Chatbot UI. |
|
Emerging |
| 470 |
gjbex/Deploying-LLMs-locally
Material for a training on AI tools |
|
Emerging |
| 471 |
AndrewZhe/lawyer-llama
中文法律LLaMA (LLaMA for Chinese legel domain) |
|
Emerging |
| 472 |
local-ai-zone/local-ai-zone.github.io
Discover the Best AI Models for Your PC |
|
Emerging |
| 473 |
affjljoo3581/GPT2
PyTorch Implementation of OpenAI GPT-2 |
|
Emerging |
| 474 |
MiniMax-AI/MiniMax-01
The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model... |
|
Emerging |
| 475 |
Esmail-ibraheem/Axon
AI research lab🔬: implementations of AI papers and theoretical research:... |
|
Emerging |
| 476 |
chengchingwen/Transformers.jl
Julia Implementation of Transformer models |
|
Emerging |
| 477 |
datawhalechina/llms-from-scratch-cn
仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理 |
|
Emerging |
| 478 |
rojagtap/transformer-abstractive-summarization
Abstractive Text Summarization using Transformer |
|
Emerging |
| 479 |
bryanlimy/tf2-transformer-chatbot
Transformer Chatbot in TensorFlow 2 with TPU support. |
|
Emerging |
| 480 |
monologg/GoEmotions-pytorch
Pytorch Implementation of GoEmotions 😍😢😱 |
|
Emerging |
| 481 |
kyegomez/HLT
Implementation of the transformer from the paper: "Real-World Humanoid... |
|
Emerging |
| 482 |
explosion/curated-transformers
🤖 A PyTorch library of curated Transformer models and their composable components |
|
Emerging |
| 483 |
ruanchaves/hashformers
Accurate word segmentation for hashtags and text, powered by Transformers... |
|
Emerging |
| 484 |
Thinklab-SJTU/Crossformer
Official implementation of our ICLR 2023 paper "Crossformer: Transformer... |
|
Emerging |
| 485 |
NVIDIA/FasterTransformer
Transformer related optimization, including BERT, GPT |
|
Emerging |
| 486 |
worldbank/REaLTabFormer
A suite of auto-regressive and Seq2Seq (sequence-to-sequence) transformer... |
|
Emerging |
| 487 |
slwang-ustc/nano-vllm-v1
Nano vLLM with vLLM v1's request scheduling strategy and chunked prefill |
|
Emerging |
| 488 |
THUDM/LongWriter
[ICLR 2025] LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs |
|
Emerging |
| 489 |
IbrahimSobh/llms
Large Language Models: In this repository Language models are introduced... |
|
Emerging |
| 490 |
OscarKjell/text
Using Transformers from HuggingFace in R |
|
Emerging |
| 491 |
microsoft/DialoGPT
Large-scale pretraining for dialogue |
|
Emerging |
| 492 |
SakanaAI/doc-to-lora
Hypernetworks that update LLMs to remember factual information |
|
Emerging |
| 493 |
tensorops/TransformerX
Flexible Python library providing building blocks (layers) for reproducible... |
|
Emerging |
| 494 |
SearchSavior/OpenArc
Inference engine for Intel devices. Serve LLMs, VLMs, Whisper, Kokoro-TTS,... |
|
Emerging |
| 495 |
thammegowda/nllb-serve
Meta's "No Language Left Behind" models served as web app and REST API |
|
Emerging |
| 496 |
spencerbraun/anomaly_transformer_pytorch
PyTorch implementation of Anomaly Transformer: Time Series Anomaly Detection... |
|
Emerging |
| 497 |
jianghoucheng/AlphaEdit
AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models,... |
|
Emerging |
| 498 |
kakaobrain/kogpt
KakaoBrain KoGPT (Korean Generative Pre-trained Transformer) |
|
Emerging |
| 499 |
ALucek/ppt2desc
Convert PowerPoint files into semantically rich text using vision language models |
|
Emerging |
| 500 |
Facico/Chinese-Vicuna
Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model ——... |
|
Emerging |