All Transformer Models

7,795 models ranked by quality score · Page 8 of 78

Showing 701–800 of 7,795
# Model Score Tier
701 huggingface/awesome-huggingface

🤗 A list of wonderful open-source projects & applications integrated with...

45
Emerging
702 lambdavi/SatDrive-SegFL

MLDL '23 Project: Federated Learning and Semantic Segmentation for...

45
Emerging
703 AGI-Edgerunners/LLM-Adapters

Code for our EMNLP 2023 Paper: "LLM-Adapters: An Adapter Family for...

45
Emerging
704 DarshanDeshpande/jax-models

Unofficial JAX implementations of deep learning research papers

45
Emerging
705 lyuchenyang/Macaw-LLM

Macaw-LLM: Multi-Modal Language Modeling with Image, Video, Audio, and Text...

45
Emerging
706 Kaleidophon/token2index

A lightweight but powerful library to build token indices for NLP tasks,...

45
Emerging
707 analyticalrohit/llms-from-scratch

Build a ChatGPT like LLM from scratch in PyTorch, explained step by step.

45
Emerging
708 Hugging-Face-Supporter/tftokenizers

Use Huggingface Transformer and Tokenizers as Tensorflow Reusable SavedModels

45
Emerging
709 ddh0/easy-llama

Python package wrapping llama.cpp for on-device LLM inference

45
Emerging
710 Mann1988/awesome-claude-skills

📊 Explore high-quality Claude skills focused on business analysis and...

45
Emerging
711 eloialonso/iris

Transformers are Sample-Efficient World Models. ICLR 2023, notable top 5%.

45
Emerging
712 FareedKhan-dev/train-llama4

Building LLaMA 4 MoE from Scratch

45
Emerging
713 RUCAIBox/TextBox

TextBox 2.0 is a text generation library with pre-trained language models

45
Emerging
714 Muennighoff/vilio

🥶Vilio: State-of-the-art VL models in PyTorch & PaddlePaddle

45
Emerging
715 allenai/RL4LMs

A modular RL library to fine-tune language models to human preferences

45
Emerging
716 ALucek/NeedleInAVidStack

Extract, timestamp, and analyze specific content from video collections...

45
Emerging
717 Bavest/fin-llama

LLAMA specialized on finance

45
Emerging
718 sergiomorapardo/AdvancedTopicsAnalytics

Material y notebooks del curso "Tópicos Avanzados en Analítica...

45
Emerging
719 imoneoi/openchat

OpenChat: Advancing Open-source Language Models with Imperfect Data

45
Emerging
720 FasterDecoding/Medusa

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

45
Emerging
721 TigerResearch/TigerBot

TigerBot: A multi-language multi-task LLM

45
Emerging
722 OpenMOSS/CoLLiE

Collaborative Training of Large Language Models in an Efficient Way

45
Emerging
723 donderom/llm4s

Scala 3 bindings for llama.cpp 🦙

45
Emerging
724 alan-turing-institute/robots-in-disguise

Information and materials for the Turing's "robots-in-disguise" reading...

45
Emerging
725 YJiangcm/FollowBench

[ACL 2024] FollowBench: A Multi-level Fine-grained Constraints Following...

45
Emerging
726 deepglint/unicom

Large-Scale Visual Representation Model

45
Emerging
727 inclusionAI/asystem-awex

A high-performance RL training-inference weight synchronization framework,...

45
Emerging
728 ukairia777/tensorflow-nlp-tutorial

tensorflow를 사용하여 텍스트 전처리부터, Topic Models, BERT, GPT, LLM과 같은 최신 모델의 다운스트림...

45
Emerging
729 zhvng/open-musiclm

Implementation of MusicLM, a text to music model published by Google...

45
Emerging
730 baichuan-inc/Baichuan-7B

A large-scale 7B pretraining language model developed by BaiChuan-Inc.

45
Emerging
731 IBM/regression-transformer

Regression Transformer (2023; Nature Machine Intelligence)

45
Emerging
732 LinkSoul-AI/Chinese-Llama-2-7b

开源社区第一个能下载、能运行的中文 LLaMA2 模型!

45
Emerging
733 kyegomez/PALM-E

Implementation of "PaLM-E: An Embodied Multimodal Language Model"

45
Emerging
734 Wang-ML-Lab/bayesian-peft

Bayesian Low-Rank Adaptation of LLMs: BLoB [NeurIPS 2024] and TFB [NeurIPS 2025]

45
Emerging
735 AlignmentResearch/tuned-lens

Tools for understanding how transformer predictions are built layer-by-layer

45
Emerging
736 Tencent-Hunyuan/GradLoc

Implementation of GradLoc from the Tencent Hunyuan blog "Stabilizing RLVR...

45
Emerging
737 kyegomez/RT-2

Democratization of RT-2 "RT-2: New model translates vision and language into action"

45
Emerging
738 FairyFali/SLMs-Survey

Survey of Small Language Models from Penn State, ...

45
Emerging
739 NLPOptimize/flash-tokenizer

EFFICIENT AND OPTIMIZED TOKENIZER ENGINE FOR LLM INFERENCE SERVING

45
Emerging
740 NVlabs/EoRA

[ICLRW'26] EoRA: Fine-tuning-free Compensation for Compressed LLM with...

45
Emerging
741 reasoning-survey/Awesome-Reasoning-Foundation-Models

✨✨Latest Papers and Benchmarks in Reasoning with Foundation Models

45
Emerging
742 CMKRG/QiZhenGPT

QiZhenGPT: An Open Source Chinese Medical Large Language Model|一个开源的中文医疗大语言模型

45
Emerging
743 Yangyi-Chen/Multimodal-AND-Large-Language-Models

Paper list about multimodal and large language models, only used to record...

45
Emerging
744 sberbank-ai-lab/LightAutoML

LAMA - automatic model creation framework

45
Emerging
745 rishikksh20/convolution-vision-transformers

PyTorch Implementation of CvT: Introducing Convolutions to Vision Transformers

45
Emerging
746 danielzuegner/code-transformer

Implementation of the paper "Language-agnostic representation learning of...

45
Emerging
747 PhoebusSi/Alpaca-CoT

We unified the interfaces of instruction-tuning data (e.g., CoT data),...

45
Emerging
748 hiyouga/Dual-Contrastive-Learning

Code for our paper "Dual Contrastive Learning: Text Classification via...

45
Emerging
749 X-PLUG/mPLUG-Owl

mPLUG-Owl: The Powerful Multi-modal Large Language Model Family

45
Emerging
750 open-mmlab/Multimodal-GPT

Multimodal-GPT

45
Emerging
751 Instruction-Tuning-with-GPT-4/GPT-4-LLM

Instruction Tuning with GPT-4

45
Emerging
752 SmallDoges/small-doge

Doge Family of Small Language Models

44
Emerging
753 ThinamXx/Transformers_NLP

The repository will contain a list of projects which we will work on while...

44
Emerging
754 smalltong02/keras-llm-robot

A web UI Project In order to learn the large language model. This project...

44
Emerging
755 ParthaPRay/LLM-Learning-Sources

This repo contains a list of channels and sources from where LLMs should be learned

44
Emerging
756 tanyuqian/redco

NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A...

44
Emerging
757 powerserve-project/PowerServe

High-speed and easy-use LLM serving framework for local deployment

44
Emerging
758 srgtuszy/llama-cpp-swift

Swift bindings for llama-cpp library

44
Emerging
759 boyiwei/alignment-attribution-code

[ICML 2024] Assessing the Brittleness of Safety Alignment via Pruning and...

44
Emerging
760 QData/LaMP

ECML 2019: Graph Neural Networks for Multi-Label Classification

44
Emerging
761 hoangsonww/Spot-the-Scam-AI-Job-Fraud

🎒 An AI/ML-powered, full-stack job-posting fraud copilot delivering...

44
Emerging
762 open-compass/MixtralKit

A toolkit for inference and evaluation of 'mixtral-8x7b-32kseqlen' from Mistral AI

44
Emerging
763 NX-AI/mlstm_kernels

Tiled Flash Linear Attention library for fast and efficient mLSTM Kernels.

44
Emerging
764 refuel-ai/autolabel

Label, clean and enrich text datasets with LLMs.

44
Emerging
765 HKUDS/LightReasoner

"LightReasoner: Can Small Language Models Teach Large Language Models Reasoning?"

44
Emerging
766 mounalab/Multivariate-time-series-forecasting-keras

This project provides implementations with Keras/Tensorflow of some deep...

44
Emerging
767 interestingLSY/swiftLLM

A tiny yet powerful LLM inference system tailored for researching purpose....

44
Emerging
768 adarshM84/OpenTalkGptCode

A Chrome extension hosts an Ollama UI web server on localhost and other...

44
Emerging
769 buaacyw/MeshAnything

[ICLR 2025] From anything to mesh like human artists. Official impl. of...

44
Emerging
770 CASE-Lab-UMD/Unified-MoE-Compression

The official implementation of the paper "Towards Efficient Mixture of...

44
Emerging
771 Trustworthy-ML-Lab/CB-LLMs

[ICLR 25] A novel framework for building intrinsically interpretable LLMs...

44
Emerging
772 gitabtion/BertBasedCorrectionModels

PyTorch impelementations of BERT-based Spelling Error Correction Models. ...

44
Emerging
773 iflytek/cino

CINO: Pre-trained Language Models for Chinese Minority (少数民族语言预训练模型)

44
Emerging
774 alesanfra/toons

A high-performance TOON (Token Oriented Object Notation) parser and...

44
Emerging
775 MoonshotAI/MoBA

MoBA: Mixture of Block Attention for Long-Context LLMs

44
Emerging
776 lxe/simple-llm-finetuner

Simple UI for LLM Model Finetuning

44
Emerging
777 CASIA-LMC-Lab/FLAP

[AAAI 2024] Fluctuation-based Adaptive Structured Pruning for Large Language Models

44
Emerging
778 tommasomncttn/mergenetic

Flexible library for merging large language models (LLMs) via evolutionary...

44
Emerging
779 ziegler-ingo/cleavage_benchmark

[BIBM 2025] Code and resources for the paper "Enhancing Multi-Epitope...

44
Emerging
780 ymcui/MacBERT

Revisiting Pre-trained Models for Chinese Natural Language Processing (MacBERT)

44
Emerging
781 HugAILab/HugNLP

CIKM2023 Best Demo Paper Award. HugNLP is a unified and comprehensive NLP...

44
Emerging
782 Aratako/T5Gemma-TTS

Multilingual TTS model with voice cloning and duration control, based on...

44
Emerging
783 ZinYY/TreeLoRA

A pytorch implementation of the paper "TreeLoRA: Efficient Continual...

44
Emerging
784 Aatricks/llmedge-examples

Examples using the llmedge library

44
Emerging
785 xrsrke/pipegoose

Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of...

44
Emerging
786 CouncilDataProject/speakerbox

Speakerbox: Fine-tune Audio Transformers for speaker identification.

44
Emerging
787 deep-symbolic-mathematics/TPSR

[NeurIPS 2023] This is the official code for the paper "TPSR:...

44
Emerging
788 monologg/transformers-android-demo

📲 Transformers android examples (Tensorflow Lite & Pytorch Mobile)

44
Emerging
789 emredeveloper/Mem-LLM

Mem-LLM is a Python library for building memory-enabled AI assistants that...

44
Emerging
790 andrewkchan/deepseek.cpp

CPU inference for the DeepSeek family of large language models in C++

44
Emerging
791 volverjs/ai

Hugging Face Transformers.js wrapper for on-device AI with web-workers

44
Emerging
792 hybridgroup/yzma

Go with your own intelligence - Go applications that directly integrate...

44
Emerging
793 yoniLc/ECCT

Error Correction Code Transformer

44
Emerging
794 cdpierse/transformers-interpret

Model explainability that works seamlessly with 🤗 transformers. Explain your...

44
Emerging
795 microsoft/interwhen

A framework for verifiable reasoning with language models.

44
Emerging
796 hiyouga/FastEdit

🩹Editing large language models within 10 seconds⚡

44
Emerging
797 pairlab/SlotFormer

Code release for ICLR 2023 paper: SlotFormer on object-centric dynamics models

44
Emerging
798 SqueezeAILab/LLMCompiler

[ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling

44
Emerging
799 bhavsarpratik/serverless-transformers-on-aws-lambda

Deploy transformers serverless on AWS Lambda

44
Emerging
800 awaescher/llmaid

Mass-edit files with LLMs

44
Emerging
« Prev 1 2 3 6 7 8 9 10 76 77 78 Next »