All Transformer Models
7,795 models ranked by quality score · Page 7 of 78
| # | Model | Score | Tier |
|---|---|---|---|
| 601 |
inboxpraveen/LLM-Minutes-of-Meeting
🎤📄 An innovative tool that transforms audio or video files into text... |
|
Emerging |
| 602 |
10Nates/ollama-autocoder
A simple to use Ollama autocompletion engine with options exposed and... |
|
Emerging |
| 603 |
ai4co/routefinder
[TMLR 2025 + ICML 2024 FM-Wild Oral] RouteFinder: Towards Foundation Models... |
|
Emerging |
| 604 |
belladoreai/llama3-tokenizer-js
JS tokenizer for LLaMA 3 and LLaMA 3.1 |
|
Emerging |
| 605 |
megagonlabs/ginza-transformers
Use custom tokenizers in spacy-transformers |
|
Emerging |
| 606 |
Czi24/Awesome-MLLM-LLM-Colab
Happy experimenting with MLLM and LLM models! |
|
Emerging |
| 607 |
patil-suraj/onnx_transformers
Accelerated NLP pipelines for fast inference on CPU. Built with Transformers... |
|
Emerging |
| 608 |
kyegomez/TeraGPT
Train a production grade GPT in less than 400 lines of code. Better than... |
|
Emerging |
| 609 |
naru-project/naru
Neural Relation Understanding: neural cardinality estimators for tabular data |
|
Emerging |
| 610 |
vfeofanov/mantis
Mantis: Lightweight Calibrated Foundation Model for User-Friendly Time... |
|
Emerging |
| 611 |
THUDM/ProteinLM
Protein Language Model |
|
Emerging |
| 612 |
jmisilo/clip-gpt-captioning
CLIPxGPT Captioner is Image Captioning Model based on OpenAI's CLIP and GPT-2. |
|
Emerging |
| 613 |
MegEngine/InferLLM
a lightweight LLM model inference framework |
|
Emerging |
| 614 |
ScrapeGraphAI/toonify
Toonify: Compact data format reducing LLM token usage by 30-60% |
|
Emerging |
| 615 |
mfoud444/ollamafreeapi
OllamaFreeAPI: Free Distributed API for Ollama LLMs Public gateway to our... |
|
Emerging |
| 616 |
huggingface/optimum-graphcore
Blazing fast training of 🤗 Transformers on Graphcore IPUs |
|
Emerging |
| 617 |
Rishit-dagli/Perceiver
Implementation of Perceiver, General Perception with Iterative Attention |
|
Emerging |
| 618 |
microsoft/GODEL
Large-scale pretrained models for goal-directed dialog |
|
Emerging |
| 619 |
georgian-io/LLM-Finetuning-Toolkit
Toolkit for fine-tuning, ablating and unit-testing open-source LLMs. |
|
Emerging |
| 620 |
sinanuozdemir/oreilly-optimizing-llms
Optimizing LLMs with Fine-Tuning and Prompt Engineering |
|
Emerging |
| 621 |
janelu9/EasyLLM
Running Large Language Model easily. |
|
Emerging |
| 622 |
NVlabs/RLP
[ICLR 2026] Official PyTorch Implementation of RLP: Reinforcement as a... |
|
Emerging |
| 623 |
Cognitive-AI-Systems/MAPF-GPT-DDG
[IROS-2025] MAPF-GPT-DDG is a scalable decentralized multi-agent pathfinding... |
|
Emerging |
| 624 |
IDEA-CCNL/Fengshenbang-LM
Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。 |
|
Emerging |
| 625 |
DAMO-NLP-SG/Video-LLaMA
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language... |
|
Emerging |
| 626 |
qingsongedu/time-series-transformers-review
A professionally curated list of awesome resources (paper, code, data, etc.)... |
|
Emerging |
| 627 |
mbzuai-oryx/LLMVoX
LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM |
|
Emerging |
| 628 |
replit/ReplitLM
Inference code and configs for the ReplitLM model family |
|
Emerging |
| 629 |
dddzg/up-detr
[TPAMI 2022 & CVPR2021 Oral] UP-DETR: Unsupervised Pre-training for Object... |
|
Emerging |
| 630 |
tintn/vision-transformer-from-scratch
A Simplified PyTorch Implementation of Vision Transformer (ViT) |
|
Emerging |
| 631 |
MahmoudWahdan/dialog-nlu
Tensorflow and Keras implementation of the state of the art researches in... |
|
Emerging |
| 632 |
young-geng/EasyLM
Large language models (LLMs) made easy, EasyLM is a one stop solution for... |
|
Emerging |
| 633 |
dvmazur/mixtral-offloading
Run Mixtral-8x7B models in Colab or consumer desktops |
|
Emerging |
| 634 |
Arkapravo-Ghosh/speech-to-text
Speech to Text Transcription using OpenAI Whisper v3 and FastAPI |
|
Emerging |
| 635 |
KB-AI-Research/KB-ALBERT
KB국민은행에서 제공하는 경제/금융 도메인에 특화된 한국어 ALBERT 모델 |
|
Emerging |
| 636 |
OctoberChang/X-Transformer
X-Transformer: Taming Pretrained Transformers for eXtreme Multi-label Text... |
|
Emerging |
| 637 |
zai-org/CogView
Text-to-Image generation. The repo for NeurIPS 2021 paper "CogView:... |
|
Emerging |
| 638 |
DUTIR-BioNLP/Taiyi-LLM
Taiyi 2, Biomedical LLM, A Bilingual (Chinese and English) Fine-Tuned Large... |
|
Emerging |
| 639 |
ItsPi3141/alpaca-electron
The simplest way to run Alpaca (and other LLaMA-based local LLMs) on your... |
|
Emerging |
| 640 |
qubvel/transformers-notebooks
Inference and fine-tuning examples for vision models from 🤗 Transformers |
|
Emerging |
| 641 |
RLHFlow/RLHF-Reward-Modeling
Recipes to train reward model for RLHF. |
|
Emerging |
| 642 |
yuanzhoulvpi2017/quick_sentence_transformers
sentence-transformers to onnx 让sbert模型推理效率更快 |
|
Emerging |
| 643 |
LianjiaTech/BELLE
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型) |
|
Emerging |
| 644 |
SCIR-HI/Huatuo-Llama-Med-Chinese
Repo for BenTsao [original name: HuaTuo (华驼)], Instruction-tuning Large... |
|
Emerging |
| 645 |
Chongjie-Si/Subspace-Tuning
A generalized framework for subspace tuning methods in parameter efficient... |
|
Emerging |
| 646 |
bradyz/cross_view_transformers
Cross-view Transformers for real-time Map-view Semantic Segmentation (CVPR 2022 Oral) |
|
Emerging |
| 647 |
lucidrains/deep-cross-attention
Implementation of the proposed DeepCrossAttention by Heddes et al at Google... |
|
Emerging |
| 648 |
jaisidhsingh/pytorch-mixtures
One-stop solutions for Mixture of Expert modules in PyTorch. |
|
Emerging |
| 649 |
hila-chefer/Transformer-MM-Explainability
[ICCV 2021- Oral] Official PyTorch implementation for Generic... |
|
Emerging |
| 650 |
zjunlp/KnowLM
An Open-sourced Knowledgable Large Language Model Framework. |
|
Emerging |
| 651 |
MiniMax-AI/MiniMax-M1
MiniMax-M1, the world's first open-weight, large-scale hybrid-attention... |
|
Emerging |
| 652 |
allenai/smashed
SMASHED is a toolkit designed to apply transformations to samples in... |
|
Emerging |
| 653 |
icon-lab/ResViT
Official Implementation of ResViT: Residual Vision Transformers for... |
|
Emerging |
| 654 |
AIoT-MLSys-Lab/SVD-LLM
[ICLR 2025🔥] SVD-LLM & [NAACL 2025🔥] SVD-LLM V2 |
|
Emerging |
| 655 |
nova-land/gbnf-compiler
Plug n Play GBNF Compiler for llama.cpp |
|
Emerging |
| 656 |
AutoGPTQ/AutoGPTQ
An easy-to-use LLMs quantization package with user-friendly apis, based on... |
|
Emerging |
| 657 |
deveix/react-native-apple-llm
React Native Apple LLM plugin using Foundation Models |
|
Emerging |
| 658 |
Esmail-ibraheem/nanograd
nanograd🧠 ML/DL and neural net ecosystem, run models like GPT, llama, stable... |
|
Emerging |
| 659 |
cli99/llm-analysis
Latency and Memory Analysis of Transformer Models for Training and Inference |
|
Emerging |
| 660 |
HumanSignal/label-studio-transformers
Label data using HuggingFace's transformers and automatically get a... |
|
Emerging |
| 661 |
WangRongsheng/CareGPT
🌞 CareGPT... |
|
Emerging |
| 662 |
cahya-wirawan/indonesian-language-models
Indonesian Language Models and its Usage |
|
Emerging |
| 663 |
laelhalawani/gguf_modeldb
A quick and optimized solution to manage llama based gguf quantized models,... |
|
Emerging |
| 664 |
JIA-Lab-research/LISA
Project Page for "LISA: Reasoning Segmentation via Large Language Model" |
|
Emerging |
| 665 |
ymcui/Chinese-LLaMA-Alpaca-3
中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3 |
|
Emerging |
| 666 |
waikato-llm/llm-dataset-converter
For converting LLM datasets from one format into another. |
|
Emerging |
| 667 |
InternLM/SIM-CoT
[ICLR 2026] An official implementation of "SIM-CoT: Supervised Implicit... |
|
Emerging |
| 668 |
0x7o/RETRO-transformer
Easy-to-use Retrieval-Enhanced Transformer implementation |
|
Emerging |
| 669 |
domschl/HuggingFaceGuidedTourForMac
A guided tour on how to use HuggingFace large language models on Macs with... |
|
Emerging |
| 670 |
sobelio/llm-chain
`llm-chain` is a powerful rust crate for building chains in large language... |
|
Emerging |
| 671 |
romsto/Speculative-Decoding
Implementation of the paper Fast Inference from Transformers via Speculative... |
|
Emerging |
| 672 |
RWKV/rwkv.cpp
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model |
|
Emerging |
| 673 |
raimondilab/precogx
A predictor of GPCR couplings with G-proteins/B-arrs using Transformers |
|
Emerging |
| 674 |
kyegomez/USM
Implementation of Google's USM speech model in Pytorch |
|
Emerging |
| 675 |
tae898/erc
The official implementation of "EmoBERTa: Speaker-Aware Emotion Recognition... |
|
Emerging |
| 676 |
bhavnicksm/vanilla-transformer-jax
JAX/Flax implimentation of 'Attention Is All You Need' by Vaswani et al.... |
|
Emerging |
| 677 |
ashishpatel26/LLM-Finetuning
LLM Finetuning with peft |
|
Emerging |
| 678 |
kyegomez/Finetuning-Suite
Finetune any model on HF in less than 30 seconds |
|
Emerging |
| 679 |
livingbio/fuzzy-json
Fuzzy-JSON is a compact Python package with no dependencies, designed to... |
|
Emerging |
| 680 |
gabeur/mmt
Multi-Modal Transformer for Video Retrieval |
|
Emerging |
| 681 |
AlekseyKorshuk/optimum-transformers
Accelerated NLP pipelines for fast inference on CPU and GPU. Built with... |
|
Emerging |
| 682 |
jarobyte91/pytorch_beam_search
A lightweight implementation of Beam Search for sequence models in PyTorch. |
|
Emerging |
| 683 |
mit-han-lab/hardware-aware-transformers
[ACL'20] HAT: Hardware-Aware Transformers for Efficient Natural Language Processing |
|
Emerging |
| 684 |
Lightning-Universe/lightning-transformers
Flexible components pairing 🤗 Transformers with :zap: Pytorch Lightning |
|
Emerging |
| 685 |
snwfdhmp/llm
Use any LLM from the command line. |
|
Emerging |
| 686 |
GURPREETKAURJETHRA/Generative-AI-LLM-Projects
Gen AI Large Language Model Projects |
|
Emerging |
| 687 |
ariannamethod/molequla
molequla.ai. live ecology of GPT organisms |
|
Emerging |
| 688 |
hugofloresgarcia/vampnet
music generation with masked transformers! |
|
Emerging |
| 689 |
jakubburkiewicz/node-red-contrib-ollama
A Node-RED module that wraps the ollama.js library, offering its... |
|
Emerging |
| 690 |
moeru-ai/inventory
🧠🃏 Your universal model catalog, everything, everywhere, all at once. |
|
Emerging |
| 691 |
ChangwenXu98/TransPolymer
Implementation of "TransPolymer: a Transformer-based language model for... |
|
Emerging |
| 692 |
marella/ctransformers
Python bindings for the Transformer models implemented in C/C++ using GGML library. |
|
Emerging |
| 693 |
THUDM/LongBench
LongBench v2 and LongBench (ACL 25'&24') |
|
Emerging |
| 694 |
microsoft/LLF-Bench
A benchmark for evaluating learning agents based on just language feedback |
|
Emerging |
| 695 |
neurocard/neurocard
State-of-the-art neural cardinality estimators for join queries |
|
Emerging |
| 696 |
arm-education/Advanced-AI-Hardware-Software-Co-Design
Hands-on course materials for ML engineers to master extreme model... |
|
Emerging |
| 697 |
VarunGumma/IndicTransToolkit
A simple, consistent and extendable toolkit for IndicTrans2. (Pypi:... |
|
Emerging |
| 698 |
MaximeRobeyns/bayesian_lora
Bayesian Low-Rank Adaptation for Large Language Models |
|
Emerging |
| 699 |
malteos/llm-datasets
A collection of datasets for language model pretraining including scripts... |
|
Emerging |
| 700 |
QData/C-Tran
General Multi-label Image Classification with Transformers |
|
Emerging |