All Transformer Models
7,795 models ranked by quality score · Page 8 of 78
| # | Model | Score | Tier |
|---|---|---|---|
| 701 |
huggingface/awesome-huggingface
🤗 A list of wonderful open-source projects & applications integrated with... |
|
Emerging |
| 702 |
lambdavi/SatDrive-SegFL
MLDL '23 Project: Federated Learning and Semantic Segmentation for... |
|
Emerging |
| 703 |
AGI-Edgerunners/LLM-Adapters
Code for our EMNLP 2023 Paper: "LLM-Adapters: An Adapter Family for... |
|
Emerging |
| 704 |
DarshanDeshpande/jax-models
Unofficial JAX implementations of deep learning research papers |
|
Emerging |
| 705 |
lyuchenyang/Macaw-LLM
Macaw-LLM: Multi-Modal Language Modeling with Image, Video, Audio, and Text... |
|
Emerging |
| 706 |
Kaleidophon/token2index
A lightweight but powerful library to build token indices for NLP tasks,... |
|
Emerging |
| 707 |
analyticalrohit/llms-from-scratch
Build a ChatGPT like LLM from scratch in PyTorch, explained step by step. |
|
Emerging |
| 708 |
Hugging-Face-Supporter/tftokenizers
Use Huggingface Transformer and Tokenizers as Tensorflow Reusable SavedModels |
|
Emerging |
| 709 |
ddh0/easy-llama
Python package wrapping llama.cpp for on-device LLM inference |
|
Emerging |
| 710 |
Mann1988/awesome-claude-skills
📊 Explore high-quality Claude skills focused on business analysis and... |
|
Emerging |
| 711 |
eloialonso/iris
Transformers are Sample-Efficient World Models. ICLR 2023, notable top 5%. |
|
Emerging |
| 712 |
FareedKhan-dev/train-llama4
Building LLaMA 4 MoE from Scratch |
|
Emerging |
| 713 |
RUCAIBox/TextBox
TextBox 2.0 is a text generation library with pre-trained language models |
|
Emerging |
| 714 |
Muennighoff/vilio
🥶Vilio: State-of-the-art VL models in PyTorch & PaddlePaddle |
|
Emerging |
| 715 |
allenai/RL4LMs
A modular RL library to fine-tune language models to human preferences |
|
Emerging |
| 716 |
ALucek/NeedleInAVidStack
Extract, timestamp, and analyze specific content from video collections... |
|
Emerging |
| 717 |
Bavest/fin-llama
LLAMA specialized on finance |
|
Emerging |
| 718 |
sergiomorapardo/AdvancedTopicsAnalytics
Material y notebooks del curso "Tópicos Avanzados en Analítica... |
|
Emerging |
| 719 |
imoneoi/openchat
OpenChat: Advancing Open-source Language Models with Imperfect Data |
|
Emerging |
| 720 |
FasterDecoding/Medusa
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads |
|
Emerging |
| 721 |
TigerResearch/TigerBot
TigerBot: A multi-language multi-task LLM |
|
Emerging |
| 722 |
OpenMOSS/CoLLiE
Collaborative Training of Large Language Models in an Efficient Way |
|
Emerging |
| 723 |
donderom/llm4s
Scala 3 bindings for llama.cpp 🦙 |
|
Emerging |
| 724 |
alan-turing-institute/robots-in-disguise
Information and materials for the Turing's "robots-in-disguise" reading... |
|
Emerging |
| 725 |
YJiangcm/FollowBench
[ACL 2024] FollowBench: A Multi-level Fine-grained Constraints Following... |
|
Emerging |
| 726 |
deepglint/unicom
Large-Scale Visual Representation Model |
|
Emerging |
| 727 |
inclusionAI/asystem-awex
A high-performance RL training-inference weight synchronization framework,... |
|
Emerging |
| 728 |
ukairia777/tensorflow-nlp-tutorial
tensorflow를 사용하여 텍스트 전처리부터, Topic Models, BERT, GPT, LLM과 같은 최신 모델의 다운스트림... |
|
Emerging |
| 729 |
zhvng/open-musiclm
Implementation of MusicLM, a text to music model published by Google... |
|
Emerging |
| 730 |
baichuan-inc/Baichuan-7B
A large-scale 7B pretraining language model developed by BaiChuan-Inc. |
|
Emerging |
| 731 |
IBM/regression-transformer
Regression Transformer (2023; Nature Machine Intelligence) |
|
Emerging |
| 732 |
LinkSoul-AI/Chinese-Llama-2-7b
开源社区第一个能下载、能运行的中文 LLaMA2 模型! |
|
Emerging |
| 733 |
kyegomez/PALM-E
Implementation of "PaLM-E: An Embodied Multimodal Language Model" |
|
Emerging |
| 734 |
Wang-ML-Lab/bayesian-peft
Bayesian Low-Rank Adaptation of LLMs: BLoB [NeurIPS 2024] and TFB [NeurIPS 2025] |
|
Emerging |
| 735 |
AlignmentResearch/tuned-lens
Tools for understanding how transformer predictions are built layer-by-layer |
|
Emerging |
| 736 |
Tencent-Hunyuan/GradLoc
Implementation of GradLoc from the Tencent Hunyuan blog "Stabilizing RLVR... |
|
Emerging |
| 737 |
kyegomez/RT-2
Democratization of RT-2 "RT-2: New model translates vision and language into action" |
|
Emerging |
| 738 |
FairyFali/SLMs-Survey
Survey of Small Language Models from Penn State, ... |
|
Emerging |
| 739 |
NLPOptimize/flash-tokenizer
EFFICIENT AND OPTIMIZED TOKENIZER ENGINE FOR LLM INFERENCE SERVING |
|
Emerging |
| 740 |
NVlabs/EoRA
[ICLRW'26] EoRA: Fine-tuning-free Compensation for Compressed LLM with... |
|
Emerging |
| 741 |
reasoning-survey/Awesome-Reasoning-Foundation-Models
✨✨Latest Papers and Benchmarks in Reasoning with Foundation Models |
|
Emerging |
| 742 |
CMKRG/QiZhenGPT
QiZhenGPT: An Open Source Chinese Medical Large Language Model|一个开源的中文医疗大语言模型 |
|
Emerging |
| 743 |
Yangyi-Chen/Multimodal-AND-Large-Language-Models
Paper list about multimodal and large language models, only used to record... |
|
Emerging |
| 744 |
sberbank-ai-lab/LightAutoML
LAMA - automatic model creation framework |
|
Emerging |
| 745 |
rishikksh20/convolution-vision-transformers
PyTorch Implementation of CvT: Introducing Convolutions to Vision Transformers |
|
Emerging |
| 746 |
danielzuegner/code-transformer
Implementation of the paper "Language-agnostic representation learning of... |
|
Emerging |
| 747 |
PhoebusSi/Alpaca-CoT
We unified the interfaces of instruction-tuning data (e.g., CoT data),... |
|
Emerging |
| 748 |
hiyouga/Dual-Contrastive-Learning
Code for our paper "Dual Contrastive Learning: Text Classification via... |
|
Emerging |
| 749 |
X-PLUG/mPLUG-Owl
mPLUG-Owl: The Powerful Multi-modal Large Language Model Family |
|
Emerging |
| 750 |
open-mmlab/Multimodal-GPT
Multimodal-GPT |
|
Emerging |
| 751 |
Instruction-Tuning-with-GPT-4/GPT-4-LLM
Instruction Tuning with GPT-4 |
|
Emerging |
| 752 |
SmallDoges/small-doge
Doge Family of Small Language Models |
|
Emerging |
| 753 |
ThinamXx/Transformers_NLP
The repository will contain a list of projects which we will work on while... |
|
Emerging |
| 754 |
smalltong02/keras-llm-robot
A web UI Project In order to learn the large language model. This project... |
|
Emerging |
| 755 |
ParthaPRay/LLM-Learning-Sources
This repo contains a list of channels and sources from where LLMs should be learned |
|
Emerging |
| 756 |
tanyuqian/redco
NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A... |
|
Emerging |
| 757 |
powerserve-project/PowerServe
High-speed and easy-use LLM serving framework for local deployment |
|
Emerging |
| 758 |
srgtuszy/llama-cpp-swift
Swift bindings for llama-cpp library |
|
Emerging |
| 759 |
boyiwei/alignment-attribution-code
[ICML 2024] Assessing the Brittleness of Safety Alignment via Pruning and... |
|
Emerging |
| 760 |
QData/LaMP
ECML 2019: Graph Neural Networks for Multi-Label Classification |
|
Emerging |
| 761 |
hoangsonww/Spot-the-Scam-AI-Job-Fraud
🎒 An AI/ML-powered, full-stack job-posting fraud copilot delivering... |
|
Emerging |
| 762 |
open-compass/MixtralKit
A toolkit for inference and evaluation of 'mixtral-8x7b-32kseqlen' from Mistral AI |
|
Emerging |
| 763 |
NX-AI/mlstm_kernels
Tiled Flash Linear Attention library for fast and efficient mLSTM Kernels. |
|
Emerging |
| 764 |
refuel-ai/autolabel
Label, clean and enrich text datasets with LLMs. |
|
Emerging |
| 765 |
HKUDS/LightReasoner
"LightReasoner: Can Small Language Models Teach Large Language Models Reasoning?" |
|
Emerging |
| 766 |
mounalab/Multivariate-time-series-forecasting-keras
This project provides implementations with Keras/Tensorflow of some deep... |
|
Emerging |
| 767 |
interestingLSY/swiftLLM
A tiny yet powerful LLM inference system tailored for researching purpose.... |
|
Emerging |
| 768 |
adarshM84/OpenTalkGptCode
A Chrome extension hosts an Ollama UI web server on localhost and other... |
|
Emerging |
| 769 |
buaacyw/MeshAnything
[ICLR 2025] From anything to mesh like human artists. Official impl. of... |
|
Emerging |
| 770 |
CASE-Lab-UMD/Unified-MoE-Compression
The official implementation of the paper "Towards Efficient Mixture of... |
|
Emerging |
| 771 |
Trustworthy-ML-Lab/CB-LLMs
[ICLR 25] A novel framework for building intrinsically interpretable LLMs... |
|
Emerging |
| 772 |
gitabtion/BertBasedCorrectionModels
PyTorch impelementations of BERT-based Spelling Error Correction Models. ... |
|
Emerging |
| 773 |
iflytek/cino
CINO: Pre-trained Language Models for Chinese Minority (少数民族语言预训练模型) |
|
Emerging |
| 774 |
alesanfra/toons
A high-performance TOON (Token Oriented Object Notation) parser and... |
|
Emerging |
| 775 |
MoonshotAI/MoBA
MoBA: Mixture of Block Attention for Long-Context LLMs |
|
Emerging |
| 776 |
lxe/simple-llm-finetuner
Simple UI for LLM Model Finetuning |
|
Emerging |
| 777 |
CASIA-LMC-Lab/FLAP
[AAAI 2024] Fluctuation-based Adaptive Structured Pruning for Large Language Models |
|
Emerging |
| 778 |
tommasomncttn/mergenetic
Flexible library for merging large language models (LLMs) via evolutionary... |
|
Emerging |
| 779 |
ziegler-ingo/cleavage_benchmark
[BIBM 2025] Code and resources for the paper "Enhancing Multi-Epitope... |
|
Emerging |
| 780 |
ymcui/MacBERT
Revisiting Pre-trained Models for Chinese Natural Language Processing (MacBERT) |
|
Emerging |
| 781 |
HugAILab/HugNLP
CIKM2023 Best Demo Paper Award. HugNLP is a unified and comprehensive NLP... |
|
Emerging |
| 782 |
Aratako/T5Gemma-TTS
Multilingual TTS model with voice cloning and duration control, based on... |
|
Emerging |
| 783 |
ZinYY/TreeLoRA
A pytorch implementation of the paper "TreeLoRA: Efficient Continual... |
|
Emerging |
| 784 |
Aatricks/llmedge-examples
Examples using the llmedge library |
|
Emerging |
| 785 |
xrsrke/pipegoose
Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of... |
|
Emerging |
| 786 |
CouncilDataProject/speakerbox
Speakerbox: Fine-tune Audio Transformers for speaker identification. |
|
Emerging |
| 787 |
deep-symbolic-mathematics/TPSR
[NeurIPS 2023] This is the official code for the paper "TPSR:... |
|
Emerging |
| 788 |
monologg/transformers-android-demo
📲 Transformers android examples (Tensorflow Lite & Pytorch Mobile) |
|
Emerging |
| 789 |
emredeveloper/Mem-LLM
Mem-LLM is a Python library for building memory-enabled AI assistants that... |
|
Emerging |
| 790 |
andrewkchan/deepseek.cpp
CPU inference for the DeepSeek family of large language models in C++ |
|
Emerging |
| 791 |
volverjs/ai
Hugging Face Transformers.js wrapper for on-device AI with web-workers |
|
Emerging |
| 792 |
hybridgroup/yzma
Go with your own intelligence - Go applications that directly integrate... |
|
Emerging |
| 793 |
yoniLc/ECCT
Error Correction Code Transformer |
|
Emerging |
| 794 |
cdpierse/transformers-interpret
Model explainability that works seamlessly with 🤗 transformers. Explain your... |
|
Emerging |
| 795 |
microsoft/interwhen
A framework for verifiable reasoning with language models. |
|
Emerging |
| 796 |
hiyouga/FastEdit
🩹Editing large language models within 10 seconds⚡ |
|
Emerging |
| 797 |
pairlab/SlotFormer
Code release for ICLR 2023 paper: SlotFormer on object-centric dynamics models |
|
Emerging |
| 798 |
SqueezeAILab/LLMCompiler
[ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling |
|
Emerging |
| 799 |
bhavsarpratik/serverless-transformers-on-aws-lambda
Deploy transformers serverless on AWS Lambda |
|
Emerging |
| 800 |
awaescher/llmaid
Mass-edit files with LLMs |
|
Emerging |