All Transformer Models

7,795 models ranked by quality score · Page 4 of 78

Showing 301–400 of 7,795
# Model Score Tier
301 AI-Hypercomputer/JetStream

JetStream is a throughput and memory optimized engine for LLM inference on...

51
Established
302 fixie-ai/ultravox

A fast multimodal LLM for real-time voice

51
Established
303 OpenVoiceOS/ovos-audio-transformer-plugin-ggwave

data over sound plugin

51
Established
304 monologg/JointBERT

Pytorch implementation of JointBERT: "BERT for Joint Intent Classification...

51
Established
305 jadore801120/attention-is-all-you-need-pytorch

A PyTorch implementation of the Transformer model in "Attention is All You Need".

51
Established
306 ai-forever/ru-gpts

Russian GPT3 models.

51
Established
307 daviddaytw/react-native-transformers

Run local LLM from Huggingface in React-Native or Expo using onnxruntime.

51
Established
308 NiuTrans/LaTeXTrans

A tool for translating the content of LaTeX documents into various other...

51
Established
309 zjunlp/EasyInstruct

[ACL 2024] An Easy-to-use Instruction Processing Framework for LLMs.

51
Established
310 abhimishra91/transformers-tutorials

Github repo with tutorials to fine tune transformers for diff NLP tasks

51
Established
311 bshao001/ChatLearner

A chatbot implemented in TensorFlow based on the seq2seq model, with certain...

51
Established
312 vitoplantamura/OnnxStream

Lightweight inference library for ONNX files, written in C++. It can run...

51
Established
313 grammarly/gector

Official implementation of the papers "GECToR โ€“ Grammatical Error...

51
Established
314 LoicGrobol/zeldarose

Train transformer-based models.

51
Established
315 ikergarcia1996/Easy-Translate

Easy-Translate is a script for translating large text files with a SINGLE...

51
Established
316 lone-cloud/gerbil

A desktop app for running Large Language Models locally.

51
Established
317 rllm-team/rllm

Pytorch Library for Relational Table Learning with LLMs.

51
Established
318 tylerelyt/LLM-Workshop

๐ŸŒŸ Learn Large Language Model development through hands-on projects and...

51
Established
319 kennethleungty/Llama-2-Open-Source-LLM-CPU-Inference

Running Llama 2 and other Open-Source LLMs on CPU Inference Locally for Documentย Q&A

51
Established
320 tensorgi/TPA

[NeurIPS 2025 Spotlight] TPA: Tensor ProducT ATTenTion Transformer (T6)...

51
Established
321 Nicolepcx/transformers-the-definitive-guide

This is the official repository for the book Transformers - The Definitive Guide

51
Established
322 telekom/mltb2

Machine Learning Toolbox 2

51
Established
323 DashyDashOrg/pandas-llm

Pandas-LLM

51
Established
324 EricFillion/happy-transformer

Happy Transformer makes it easy to fine-tune and perform inference with NLP...

51
Established
325 rasbt/LLM-workshop-2024

A 4-hour coding workshop to understand how LLMs are implemented and used

51
Established
326 Rishit-dagli/Fast-Transformer

An implementation of Additive Attention

51
Established
327 CVHub520/X-AnyLabeling-Server

A Simple, Lightweight, and Extensible Serving Framework for X-AnyLabeling

51
Established
328 shreyansh26/Annotated-ML-Papers

Annotations of the interesting ML papers I read

51
Established
329 kyegomez/LongNet

Implementation of plug in and play Attention from "LongNet: Scaling...

51
Established
330 kyegomez/PALI3

Implementation of PALI3 from the paper PALI-3 VISION LANGUAGE MODELS:...

51
Established
331 beehive-lab/GPULlama3.java

GPU-accelerated Llama3.java inference in pure Java using TornadoVM.

51
Established
332 PKU-Alignment/safe-rlhf

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from...

51
Established
333 chanind/frame-semantic-transformer

Frame Semantic Parser based on T5 and FrameNet

51
Established
334 AdityaNG/kan-gpt

The PyTorch implementation of Generative Pre-trained Transformers (GPTs)...

51
Established
335 tatsu-lab/alpaca_eval

An automatic evaluator for instruction-following language models....

51
Established
336 symfony/ai-platform

PHP library for interacting with AI platform provider.

51
Established
337 abelriboulot/onnxt5

Summarization, translation, sentiment-analysis, text-generation and more at...

51
Established
338 GURPREETKAURJETHRA/END-TO-END-GENERATIVE-AI-PROJECTS

End to End Generative AI Industry Projects on LLM Models with...

51
Established
339 monologg/KoELECTRA

Pretrained ELECTRA Model for Korean

51
Established
340 pszemraj/textsum

CLI & Python API to easily summarize text-based files with transformers

51
Established
341 opendilab/LightRFT

LightRFT: Light, Efficient, Omni-modal & Reward-model Driven Reinforcement...

51
Established
342 alephpi/Texo-web

The web application for Texo, a minimalist SOTA LaTeX OCR model which...

51
Established
343 lonePatient/Bert-Multi-Label-Text-Classification

This repo contains a PyTorch implementation of a pretrained BERT model for...

51
Established
344 EleutherAI/knowledge-neurons

A library for finding knowledge neurons in pretrained transformer models.

50
Established
345 avilum/minrlm

Token-efficient Recursive Language Model. 3.6x fewer tokens than vanilla...

50
Established
346 Strvm/meta-ai-api

Llama 3 API 70B & 405B (MetaAI Reverse Engineered)

50
Established
347 kyegomez/SwitchTransformers

Implementation of Switch Transformers from the paper: "Switch Transformers:...

50
Established
348 microsoft/sarathi-serve

A low-latency & high-throughput serving engine for LLMs

50
Established
349 zemlyansky/gpt-tfjs

GPT in TensorFlow.js

50
Established
350 pengzhangzhi/Open-dLLM

Open diffusion language model for code generation โ€” releasing pretraining,...

50
Established
351 tensorchord/modelz-llm

OpenAI compatible API for LLMs and embeddings (LLaMA, Vicuna, ChatGLM and...

50
Established
352 huggingface/transformers-bloom-inference

Fast Inference Solutions for BLOOM

50
Established
353 salesforce/TransmogrifAI

TransmogrifAI (pronounced trฤƒns-mลgหˆrษ™-fฤซ) is an AutoML library for building...

50
Established
354 Troyanovsky/Local-LLM-Comparison-Colab-UI

Compare the performance of different LLM that can be deployed locally on...

50
Established
355 gordicaleksa/pytorch-original-transformer

My implementation of the original transformer model (Vaswani et al.). I've...

50
Established
356 bytefer/ollama-ocr

Implementing OCR with a local visual model run by ollama.

50
Established
357 ridgerchu/matmulfreellm

Implementation for MatMul-free LM.

50
Established
358 serge-chat/serge

A web interface for chatting with Alpaca through llama.cpp. Fully...

50
Established
359 gitkaz/mlx_gguf_server

This is a FastAPI based LLM server. Load multiple LLM models (MLX or...

50
Established
360 jina-ai/rungpt

An open-source cloud-native of large multi-modal models (LMMs) serving framework.

50
Established
361 camenduru/text-generation-webui-colab

A colab gradio web UI for running Large Language Models

50
Established
362 Imalwayshere/Open-Detector

BERT-based AI-generated academic text detection model

50
Established
363 appvision-ai/fast-bert

Super easy library for BERT based NLP models

50
Established
364 EfficientMoE/MoE-Infinity

PyTorch library for cost-effective, fast and easy serving of MoE models.

50
Established
365 adrienpetralia/NILMFormer

[KDD 2025] NILMFormer: A Sequence-To-Sequence Non-Stationarity Aware...

50
Established
366 bytedance/video-SALMONN-2

video-SALMONN 2 is a powerful audio-visual large language model (LLM) that...

50
Established
367 Gen-Verse/dLLM-RL

[ICLR 2026] Official code for TraceRL: Revolutionizing post-training for...

50
Established
368 cruiseresearchgroup/SensorLLM

[EMNLP 2025] Official implementation of "SensorLLM: Aligning Large Language...

50
Established
369 modelscope/easydistill

a toolkit on knowledge distillation for large language models

50
Established
370 BioinfoMachineLearning/DeepInteract

A geometric deep learning framework (Geometric Transformers) for predicting...

50
Established
371 polakowo/gpt2bot

Your new Telegram buddy powered by transformers

50
Established
372 MDGrey33/pyvisionai

The PyVisionAI Official Repo

50
Established
373 ml4fp/2025-lbnl

ML4FP 2025: notebooks used for the Machine Learning for Fundamental Physics...

50
Established
374 keith2018/TinyGPT

Tiny C++ LLM inference implementation from scratch

50
Established
375 microsoft/rat-sql

A relation-aware semantic parsing model from English to SQL

50
Established
376 Tencent/TurboTransformers

a fast and user-friendly runtime for transformer inference (Bert, Albert,...

50
Established
377 bbruceyuan/LLMs-Zero-to-Hero

ไปŽๆ— ๅๅฐๅ’ๅˆฐๅคงๆจกๅž‹๏ผˆLLM๏ผ‰ๅคง่‹ฑ้›„~ ๆฌข่ฟŽๅ…ณๆณจๅŽ็ปญ๏ผ๏ผ๏ผ

50
Established
378 HPAI-BSC/TuRTLe

TuRTLe: A Unified Evaluation of LLMs for RTL Generation ๐Ÿข (MLCAD 2025)

50
Established
379 uber-research/PPLM

Plug and Play Language Model implementation. Allows to steer topic and...

50
Established
380 sammcj/ingest

Parse files (e.g. code repos) and websites to clipboard or a file for...

50
Established
381 SamsungSAILMontreal/nino

Code for "Accelerating Training with Neuron Interaction and Nowcasting...

50
Established
382 yotambraun/APDTFlow

APDTFlow is a modern and extensible forecasting framework for time series...

50
Established
383 olivkoch/nano-trm

An implementation of Tiny Recursive Models (TRM)

50
Established
384 BeRo1985/pasllm

PasLLM - LLM inference engine in Object Pascal (synced from my private work...

50
Established
385 dbiir/UER-py

Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo

49
Emerging
386 CPJKU/wechsel

Code for WECHSEL: Effective initialization of subword embeddings for...

49
Emerging
387 JIA-Lab-research/MGM-Omni

MGM-Omni: Scaling Omni LLMs to Personalized Long-Horizon Speech

49
Emerging
388 iusztinpaul/hands-on-llms

๐Ÿฆ– ๐—Ÿ๐—ฒ๐—ฎ๐—ฟ๐—ป about ๐—Ÿ๐—Ÿ๐— ๐˜€, ๐—Ÿ๐—Ÿ๐— ๐—ข๐—ฝ๐˜€, and ๐˜ƒ๐—ฒ๐—ฐ๐˜๐—ผ๐—ฟ ๐——๐—•๐˜€ for free by designing, training,...

49
Emerging
389 stair-lab/mlhp

Machine Learning from Human Preferences

49
Emerging
390 yuanzhoulvpi2017/zero_nlp

ไธญๆ–‡nlp่งฃๅ†ณๆ–นๆกˆ(ๅคงๆจกๅž‹ใ€ๆ•ฐๆฎใ€ๆจกๅž‹ใ€่ฎญ็ปƒใ€ๆŽจ็†)

49
Emerging
391 deep-symbolic-mathematics/LLM-SR

[ICLR 2025 Oral] This is the official repo for the paper "LLM-SR" on...

49
Emerging
392 matlab-deep-learning/transformer-models

Deep Learning Transformer models in MATLAB

49
Emerging
393 asahi417/lmppl

Calculate perplexity on a text with pre-trained language models. Support MLM...

49
Emerging
394 hkproj/pytorch-llama

LLaMA 2 implemented from scratch in PyTorch

49
Emerging
395 AmpereComputingAI/ampere_model_library

AML's goal is to make benchmarking of various AI architectures on Ampere...

49
Emerging
396 mead-ml/mead-baseline

Deep-Learning Model Exploration and Development for NLP

49
Emerging
397 minggnim/nlp-models

A repository for training transformer based models

49
Emerging
398 pbloem/former

Simple transformer implementation from scratch in pytorch. (archival, latest...

49
Emerging
399 CLUEbenchmark/CLUE

ไธญๆ–‡่ฏญ่จ€็†่งฃๆต‹่ฏ„ๅŸบๅ‡† Chinese Language Understanding Evaluation Benchmark: datasets,...

49
Emerging
400 google-research/bigbird

Transformers for Longer Sequences

49
Emerging
« Prev 1 2 3 4 5 6 76 77 78 Next »