All Transformer Models

7,795 models ranked by quality score · Page 5 of 78

Showing 401–500 of 7,795
# Model Score Tier
401 Deep-Spark/DeepSparkInference

DeepSparkInference has selected 216 inference models of both small and large...

49
Emerging
402 sapientinc/HRM

Hierarchical Reasoning Model Official Release

49
Emerging
403 MediaBrain-SJTU/MING

明医 (MING):中文医疗问诊大模型

49
Emerging
404 higgsfield-ai/higgsfield

Fault-tolerant, highly scalable GPU orchestration, and a machine learning...

49
Emerging
405 muxi-ai/onellm

Unified interface for interacting with various LLMs hundreds of models,...

49
Emerging
406 Leeroo-AI/mergoo

A library for easily merging multiple LLM experts, and efficiently train the...

49
Emerging
407 rese1f/MovieChat

[CVPR 2024] MovieChat: From Dense Token to Sparse Memory for Long Video Understanding

49
Emerging
408 kyegomez/MultiModalMamba

A novel implementation of fusing ViT with Mamba into a fast, agile, and high...

49
Emerging
409 EvelynFan/FaceFormer

[CVPR 2022] FaceFormer: Speech-Driven 3D Facial Animation with Transformers

49
Emerging
410 wxhcore/bumblecore

An LLM training framework built from the ground up, featuring a custom...

49
Emerging
411 shell-nlp/gpt_server

gpt_server是一个用于生产级部署LLMs、Embedding、Reranker、ASR、TTS、文生图、图片编辑和文生视频的开源框架。

49
Emerging
412 riyanshibohra/TuneKit

Upload your data → Get a fine-tuned SLM. Free.

49
Emerging
413 VectorInstitute/vector-inference

Efficient LLM inference on Slurm clusters.

49
Emerging
414 tjake/Jlama

Jlama is a modern LLM inference engine for Java

49
Emerging
415 wuwangzhang1216/abliterix

Fully automatic censorship removal for language models. LoRA abliteration +...

49
Emerging
416 floriankark/cs224n-win2223

Code and written solutions of the assignments of the Stanford CS224N:...

49
Emerging
417 time-series-foundation-models/lag-llama

Lag-Llama: Towards Foundation Models for Probabilistic Time Series Forecasting

49
Emerging
418 vtuber-plan/langport

Langport is a language model inference service

49
Emerging
419 tomaarsen/attention_sinks

Extend existing LLMs way beyond the original training length with constant...

49
Emerging
420 ngxson/wllama

WebAssembly binding for llama.cpp - Enabling on-browser LLM inference

49
Emerging
421 dell-research-harvard/linktransformer

A convenient way to link, deduplicate, aggregate and cluster data(frames) in...

49
Emerging
422 maxischuh/TwinBooster

Package for TwinBooster. Enables fast and powerful zero-shot molecular...

49
Emerging
423 jy-yuan/KIVI

[ICML 2024] KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV Cache

49
Emerging
424 balisujohn/localwriter

A LibreOffice Writer extension that adds local-inference generative AI features.

49
Emerging
425 EnnengYang/Awesome-Model-Merging-Methods-Theories-Applications

Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications...

49
Emerging
426 Shivanandroy/KeyPhraseTransformer

KeyPhraseTransformer lets you quickly extract key phrases, topics, themes...

49
Emerging
427 huggingface/tflite-android-transformers

DistilBERT / GPT-2 for on-device inference thanks to TensorFlow Lite with...

49
Emerging
428 IntelLabs/nlp-architect

A model library for exploring state-of-the-art deep learning topologies and...

49
Emerging
429 hscspring/hcgf

Humanable Chat Generative-model Fine-tuning | LLM微调

49
Emerging
430 yoshoku/llama_cpp.rb

llama_cpp.rb provides Ruby bindings for llama.cpp

49
Emerging
431 alephpi/Texo

A minimalist SOTA LaTeX OCR model with only 20M parameters, running in...

49
Emerging
432 OpenGVLab/OmniQuant

[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization...

49
Emerging
433 HUSTAI/uie_pytorch

PaddleNLP UIE模型的PyTorch版实现

49
Emerging
434 MadryLab/context-cite

Attribute (or cite) statements generated by LLMs back to in-context information.

49
Emerging
435 AMontgomerie/question_generator

An NLP system for generating reading comprehension questions

49
Emerging
436 intel/ipex-llm

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM,...

49
Emerging
437 oripress/AlgoTune

AlgoTune is a NeurIPS 2025 benchmark made up of 154 math, physics, and...

49
Emerging
438 multimodal-art-projection/YuE

YuE: Open Full-song Music Generation Foundation Model, something similar to...

49
Emerging
439 larslorch/avici

Amortized Inference for Causal Structure Learning, NeurIPS 2022

49
Emerging
440 helpmefindaname/transformer-smaller-training-vocab

Temporary remove unused tokens during training to save ram and speed.

48
Emerging
441 graphdeeplearning/graphtransformer

Graph Transformer Architecture. Source code for "A Generalization of...

48
Emerging
442 WangRongsheng/XrayGLM

🩺 首个会看胸部X光片的中文多模态医学大模型 | The first Chinese Medical Multimodal Model that...

48
Emerging
443 curiousily/Deploy-BERT-for-Sentiment-Analysis-with-FastAPI

Deploy BERT for Sentiment Analysis as REST API using FastAPI, Transformers...

48
Emerging
444 jmont-dev/ollama-hpp

Modern, Header-only C++ bindings for the Ollama API.

48
Emerging
445 fcakyon/video-transformers

Easiest way of fine-tuning HuggingFace video classification models

48
Emerging
446 OFA-Sys/Chinese-CLIP

Chinese version of CLIP which achieves Chinese cross-modal retrieval and...

48
Emerging
447 Beomi/KoAlpaca

KoAlpaca: 한국어 명령어를 이해하는 오픈소스 언어모델 (KoAlpaca: An open-source language model...

48
Emerging
448 ChanithaAbey/AI-Agent-for-Stock-Prediction

An AI Agent for stock data analysis, news rerieval, and prediction; powered...

48
Emerging
449 xrsrke/toolformer

Implementation of Toolformer: Language Models Can Teach Themselves to Use Tools

48
Emerging
450 hila-chefer/Transformer-Explainability

[CVPR 2021] Official PyTorch implementation for Transformer Interpretability...

48
Emerging
451 X-D-Lab/LangChain-ChatGLM-Webui

基于LangChain和ChatGLM-6B等系列LLM的针对本地知识库的自动问答

48
Emerging
452 steering-vectors/steering-vectors

Steering vectors for transformer language models in Pytorch / Huggingface

48
Emerging
453 kyegomez/GPT4o

Community Open Source Implementation of GPT4o in PyTorch

48
Emerging
454 VHellendoorn/Code-LMs

Guide to using pre-trained large language models of source code

48
Emerging
455 blegat/LINMA2472

Course material for the course LINMA2472 at UCLouvain

48
Emerging
456 ymcui/Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

48
Emerging
457 fboulnois/llama-cpp-docker

Run llama.cpp in a GPU accelerated Docker container

48
Emerging
458 cheahjs/free-llm-api-resources

A list of free LLM inference resources accessible via API.

48
Emerging
459 TUDB-Labs/mLoRA

An Efficient "Factory" to Build Multiple LoRA Adapters

48
Emerging
460 NVIDIA-AI-IOT/nanoowl

A project that optimizes OWL-ViT for real-time inference with NVIDIA TensorRT.

48
Emerging
461 Tencent/TencentPretrain

Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo

48
Emerging
462 kyegomez/LIMoE

Implementation of the "the first large-scale multimodal mixture of experts...

48
Emerging
463 snowby666/poe-api-wrapper

👾 A Python API wrapper for Poe.com. With this, you will have free access to...

48
Emerging
464 yuriwa/crewai-sheets-ui

Use google sheets as a gui for crewAI

48
Emerging
465 deepset-ai/FARM

:house_with_garden: Fast & easy transfer learning for NLP. Harvesting...

48
Emerging
466 microsoft/augmented-interpretable-models

Interpretable and efficient predictors using pre-trained language models....

48
Emerging
467 mallorbc/Finetune_LLMs

Repo for fine-tuning Casual LLMs

48
Emerging
468 FoundationVision/Infinity

[CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for...

48
Emerging
469 nuance1979/llama-server

LLaMA Server combines the power of LLaMA C++ with the beauty of Chatbot UI.

48
Emerging
470 gjbex/Deploying-LLMs-locally

Material for a training on AI tools

48
Emerging
471 AndrewZhe/lawyer-llama

中文法律LLaMA (LLaMA for Chinese legel domain)

48
Emerging
472 local-ai-zone/local-ai-zone.github.io

Discover the Best AI Models for Your PC

48
Emerging
473 affjljoo3581/GPT2

PyTorch Implementation of OpenAI GPT-2

48
Emerging
474 MiniMax-AI/MiniMax-01

The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model...

48
Emerging
475 Esmail-ibraheem/Axon

AI research lab🔬: implementations of AI papers and theoretical research:...

48
Emerging
476 chengchingwen/Transformers.jl

Julia Implementation of Transformer models

48
Emerging
477 datawhalechina/llms-from-scratch-cn

仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理

48
Emerging
478 rojagtap/transformer-abstractive-summarization

Abstractive Text Summarization using Transformer

48
Emerging
479 bryanlimy/tf2-transformer-chatbot

Transformer Chatbot in TensorFlow 2 with TPU support.

48
Emerging
480 monologg/GoEmotions-pytorch

Pytorch Implementation of GoEmotions 😍😢😱

48
Emerging
481 kyegomez/HLT

Implementation of the transformer from the paper: "Real-World Humanoid...

48
Emerging
482 explosion/curated-transformers

🤖 A PyTorch library of curated Transformer models and their composable components

48
Emerging
483 ruanchaves/hashformers

Accurate word segmentation for hashtags and text, powered by Transformers...

48
Emerging
484 Thinklab-SJTU/Crossformer

Official implementation of our ICLR 2023 paper "Crossformer: Transformer...

48
Emerging
485 NVIDIA/FasterTransformer

Transformer related optimization, including BERT, GPT

48
Emerging
486 worldbank/REaLTabFormer

A suite of auto-regressive and Seq2Seq (sequence-to-sequence) transformer...

48
Emerging
487 slwang-ustc/nano-vllm-v1

Nano vLLM with vLLM v1's request scheduling strategy and chunked prefill

48
Emerging
488 THUDM/LongWriter

[ICLR 2025] LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs

48
Emerging
489 IbrahimSobh/llms

Large Language Models: In this repository Language models are introduced...

48
Emerging
490 OscarKjell/text

Using Transformers from HuggingFace in R

48
Emerging
491 microsoft/DialoGPT

Large-scale pretraining for dialogue

48
Emerging
492 SakanaAI/doc-to-lora

Hypernetworks that update LLMs to remember factual information

48
Emerging
493 tensorops/TransformerX

Flexible Python library providing building blocks (layers) for reproducible...

48
Emerging
494 SearchSavior/OpenArc

Inference engine for Intel devices. Serve LLMs, VLMs, Whisper, Kokoro-TTS,...

48
Emerging
495 thammegowda/nllb-serve

Meta's "No Language Left Behind" models served as web app and REST API

48
Emerging
496 spencerbraun/anomaly_transformer_pytorch

PyTorch implementation of Anomaly Transformer: Time Series Anomaly Detection...

48
Emerging
497 jianghoucheng/AlphaEdit

AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models,...

48
Emerging
498 kakaobrain/kogpt

KakaoBrain KoGPT (Korean Generative Pre-trained Transformer)

48
Emerging
499 ALucek/ppt2desc

Convert PowerPoint files into semantically rich text using vision language models

48
Emerging
500 Facico/Chinese-Vicuna

Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model ——...

48
Emerging
« Prev 1 2 3 4 5 6 7 76 77 78 Next »