All Transformer Models

7,795 models ranked by quality score · Page 7 of 78

Showing 601–700 of 7,795
# Model Score Tier
601 inboxpraveen/LLM-Minutes-of-Meeting

🎤📄 An innovative tool that transforms audio or video files into text...

46
Emerging
602 10Nates/ollama-autocoder

A simple to use Ollama autocompletion engine with options exposed and...

46
Emerging
603 ai4co/routefinder

[TMLR 2025 + ICML 2024 FM-Wild Oral] RouteFinder: Towards Foundation Models...

46
Emerging
604 belladoreai/llama3-tokenizer-js

JS tokenizer for LLaMA 3 and LLaMA 3.1

46
Emerging
605 megagonlabs/ginza-transformers

Use custom tokenizers in spacy-transformers

46
Emerging
606 Czi24/Awesome-MLLM-LLM-Colab

Happy experimenting with MLLM and LLM models!

46
Emerging
607 patil-suraj/onnx_transformers

Accelerated NLP pipelines for fast inference on CPU. Built with Transformers...

46
Emerging
608 kyegomez/TeraGPT

Train a production grade GPT in less than 400 lines of code. Better than...

46
Emerging
609 naru-project/naru

Neural Relation Understanding: neural cardinality estimators for tabular data

46
Emerging
610 vfeofanov/mantis

Mantis: Lightweight Calibrated Foundation Model for User-Friendly Time...

46
Emerging
611 THUDM/ProteinLM

Protein Language Model

46
Emerging
612 jmisilo/clip-gpt-captioning

CLIPxGPT Captioner is Image Captioning Model based on OpenAI's CLIP and GPT-2.

46
Emerging
613 MegEngine/InferLLM

a lightweight LLM model inference framework

46
Emerging
614 ScrapeGraphAI/toonify

Toonify: Compact data format reducing LLM token usage by 30-60%

46
Emerging
615 mfoud444/ollamafreeapi

OllamaFreeAPI: Free Distributed API for Ollama LLMs Public gateway to our...

46
Emerging
616 huggingface/optimum-graphcore

Blazing fast training of 🤗 Transformers on Graphcore IPUs

46
Emerging
617 Rishit-dagli/Perceiver

Implementation of Perceiver, General Perception with Iterative Attention

46
Emerging
618 microsoft/GODEL

Large-scale pretrained models for goal-directed dialog

46
Emerging
619 georgian-io/LLM-Finetuning-Toolkit

Toolkit for fine-tuning, ablating and unit-testing open-source LLMs.

46
Emerging
620 sinanuozdemir/oreilly-optimizing-llms

Optimizing LLMs with Fine-Tuning and Prompt Engineering

46
Emerging
621 janelu9/EasyLLM

Running Large Language Model easily.

46
Emerging
622 NVlabs/RLP

[ICLR 2026] Official PyTorch Implementation of RLP: Reinforcement as a...

46
Emerging
623 Cognitive-AI-Systems/MAPF-GPT-DDG

[IROS-2025] MAPF-GPT-DDG is a scalable decentralized multi-agent pathfinding...

46
Emerging
624 IDEA-CCNL/Fengshenbang-LM

Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。

46
Emerging
625 DAMO-NLP-SG/Video-LLaMA

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language...

46
Emerging
626 qingsongedu/time-series-transformers-review

A professionally curated list of awesome resources (paper, code, data, etc.)...

46
Emerging
627 mbzuai-oryx/LLMVoX

LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM

46
Emerging
628 replit/ReplitLM

Inference code and configs for the ReplitLM model family

46
Emerging
629 dddzg/up-detr

[TPAMI 2022 & CVPR2021 Oral] UP-DETR: Unsupervised Pre-training for Object...

46
Emerging
630 tintn/vision-transformer-from-scratch

A Simplified PyTorch Implementation of Vision Transformer (ViT)

46
Emerging
631 MahmoudWahdan/dialog-nlu

Tensorflow and Keras implementation of the state of the art researches in...

46
Emerging
632 young-geng/EasyLM

Large language models (LLMs) made easy, EasyLM is a one stop solution for...

46
Emerging
633 dvmazur/mixtral-offloading

Run Mixtral-8x7B models in Colab or consumer desktops

46
Emerging
634 Arkapravo-Ghosh/speech-to-text

Speech to Text Transcription using OpenAI Whisper v3 and FastAPI

46
Emerging
635 KB-AI-Research/KB-ALBERT

KB국민은행에서 제공하는 경제/금융 도메인에 특화된 한국어 ALBERT 모델

46
Emerging
636 OctoberChang/X-Transformer

X-Transformer: Taming Pretrained Transformers for eXtreme Multi-label Text...

46
Emerging
637 zai-org/CogView

Text-to-Image generation. The repo for NeurIPS 2021 paper "CogView:...

46
Emerging
638 DUTIR-BioNLP/Taiyi-LLM

Taiyi 2, Biomedical LLM, A Bilingual (Chinese and English) Fine-Tuned Large...

46
Emerging
639 ItsPi3141/alpaca-electron

The simplest way to run Alpaca (and other LLaMA-based local LLMs) on your...

46
Emerging
640 qubvel/transformers-notebooks

Inference and fine-tuning examples for vision models from 🤗 Transformers

46
Emerging
641 RLHFlow/RLHF-Reward-Modeling

Recipes to train reward model for RLHF.

46
Emerging
642 yuanzhoulvpi2017/quick_sentence_transformers

sentence-transformers to onnx 让sbert模型推理效率更快

46
Emerging
643 LianjiaTech/BELLE

BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

46
Emerging
644 SCIR-HI/Huatuo-Llama-Med-Chinese

Repo for BenTsao [original name: HuaTuo (华驼)], Instruction-tuning Large...

46
Emerging
645 Chongjie-Si/Subspace-Tuning

A generalized framework for subspace tuning methods in parameter efficient...

46
Emerging
646 bradyz/cross_view_transformers

Cross-view Transformers for real-time Map-view Semantic Segmentation (CVPR 2022 Oral)

46
Emerging
647 lucidrains/deep-cross-attention

Implementation of the proposed DeepCrossAttention by Heddes et al at Google...

46
Emerging
648 jaisidhsingh/pytorch-mixtures

One-stop solutions for Mixture of Expert modules in PyTorch.

46
Emerging
649 hila-chefer/Transformer-MM-Explainability

[ICCV 2021- Oral] Official PyTorch implementation for Generic...

46
Emerging
650 zjunlp/KnowLM

An Open-sourced Knowledgable Large Language Model Framework.

46
Emerging
651 MiniMax-AI/MiniMax-M1

MiniMax-M1, the world's first open-weight, large-scale hybrid-attention...

46
Emerging
652 allenai/smashed

SMASHED is a toolkit designed to apply transformations to samples in...

46
Emerging
653 icon-lab/ResViT

Official Implementation of ResViT: Residual Vision Transformers for...

46
Emerging
654 AIoT-MLSys-Lab/SVD-LLM

[ICLR 2025🔥] SVD-LLM & [NAACL 2025🔥] SVD-LLM V2

46
Emerging
655 nova-land/gbnf-compiler

Plug n Play GBNF Compiler for llama.cpp

46
Emerging
656 AutoGPTQ/AutoGPTQ

An easy-to-use LLMs quantization package with user-friendly apis, based on...

46
Emerging
657 deveix/react-native-apple-llm

React Native Apple LLM plugin using Foundation Models

46
Emerging
658 Esmail-ibraheem/nanograd

nanograd🧠 ML/DL and neural net ecosystem, run models like GPT, llama, stable...

46
Emerging
659 cli99/llm-analysis

Latency and Memory Analysis of Transformer Models for Training and Inference

46
Emerging
660 HumanSignal/label-studio-transformers

Label data using HuggingFace's transformers and automatically get a...

46
Emerging
661 WangRongsheng/CareGPT

🌞 CareGPT...

46
Emerging
662 cahya-wirawan/indonesian-language-models

Indonesian Language Models and its Usage

45
Emerging
663 laelhalawani/gguf_modeldb

A quick and optimized solution to manage llama based gguf quantized models,...

45
Emerging
664 JIA-Lab-research/LISA

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

45
Emerging
665 ymcui/Chinese-LLaMA-Alpaca-3

中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3

45
Emerging
666 waikato-llm/llm-dataset-converter

For converting LLM datasets from one format into another.

45
Emerging
667 InternLM/SIM-CoT

[ICLR 2026] An official implementation of "SIM-CoT: Supervised Implicit...

45
Emerging
668 0x7o/RETRO-transformer

Easy-to-use Retrieval-Enhanced Transformer implementation

45
Emerging
669 domschl/HuggingFaceGuidedTourForMac

A guided tour on how to use HuggingFace large language models on Macs with...

45
Emerging
670 sobelio/llm-chain

`llm-chain` is a powerful rust crate for building chains in large language...

45
Emerging
671 romsto/Speculative-Decoding

Implementation of the paper Fast Inference from Transformers via Speculative...

45
Emerging
672 RWKV/rwkv.cpp

INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model

45
Emerging
673 raimondilab/precogx

A predictor of GPCR couplings with G-proteins/B-arrs using Transformers

45
Emerging
674 kyegomez/USM

Implementation of Google's USM speech model in Pytorch

45
Emerging
675 tae898/erc

The official implementation of "EmoBERTa: Speaker-Aware Emotion Recognition...

45
Emerging
676 bhavnicksm/vanilla-transformer-jax

JAX/Flax implimentation of 'Attention Is All You Need' by Vaswani et al....

45
Emerging
677 ashishpatel26/LLM-Finetuning

LLM Finetuning with peft

45
Emerging
678 kyegomez/Finetuning-Suite

Finetune any model on HF in less than 30 seconds

45
Emerging
679 livingbio/fuzzy-json

Fuzzy-JSON is a compact Python package with no dependencies, designed to...

45
Emerging
680 gabeur/mmt

Multi-Modal Transformer for Video Retrieval

45
Emerging
681 AlekseyKorshuk/optimum-transformers

Accelerated NLP pipelines for fast inference on CPU and GPU. Built with...

45
Emerging
682 jarobyte91/pytorch_beam_search

A lightweight implementation of Beam Search for sequence models in PyTorch.

45
Emerging
683 mit-han-lab/hardware-aware-transformers

[ACL'20] HAT: Hardware-Aware Transformers for Efficient Natural Language Processing

45
Emerging
684 Lightning-Universe/lightning-transformers

Flexible components pairing 🤗 Transformers with :zap: Pytorch Lightning

45
Emerging
685 snwfdhmp/llm

Use any LLM from the command line.

45
Emerging
686 GURPREETKAURJETHRA/Generative-AI-LLM-Projects

Gen AI Large Language Model Projects

45
Emerging
687 ariannamethod/molequla

molequla.ai. live ecology of GPT organisms

45
Emerging
688 hugofloresgarcia/vampnet

music generation with masked transformers!

45
Emerging
689 jakubburkiewicz/node-red-contrib-ollama

A Node-RED module that wraps the ollama.js library, offering its...

45
Emerging
690 moeru-ai/inventory

🧠🃏 Your universal model catalog, everything, everywhere, all at once.

45
Emerging
691 ChangwenXu98/TransPolymer

Implementation of "TransPolymer: a Transformer-based language model for...

45
Emerging
692 marella/ctransformers

Python bindings for the Transformer models implemented in C/C++ using GGML library.

45
Emerging
693 THUDM/LongBench

LongBench v2 and LongBench (ACL 25'&24')

45
Emerging
694 microsoft/LLF-Bench

A benchmark for evaluating learning agents based on just language feedback

45
Emerging
695 neurocard/neurocard

State-of-the-art neural cardinality estimators for join queries

45
Emerging
696 arm-education/Advanced-AI-Hardware-Software-Co-Design

Hands-on course materials for ML engineers to master extreme model...

45
Emerging
697 VarunGumma/IndicTransToolkit

A simple, consistent and extendable toolkit for IndicTrans2. (Pypi:...

45
Emerging
698 MaximeRobeyns/bayesian_lora

Bayesian Low-Rank Adaptation for Large Language Models

45
Emerging
699 malteos/llm-datasets

A collection of datasets for language model pretraining including scripts...

45
Emerging
700 QData/C-Tran

General Multi-label Image Classification with Transformers

45
Emerging
« Prev 1 2 3 5 6 7 8 9 76 77 78 Next »