All Transformer Models
7,795 models ranked by quality score · Page 10 of 78
| # | Model | Score | Tier |
|---|---|---|---|
| 901 |
WangRongsheng/ChatGenTitle
🌟 ChatGenTitle:使用百万arXiv论文信息在LLaMA模型上进行微调的论文题目生成模型 |
|
Emerging |
| 902 |
GAIR-NLP/MegaScience
MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning |
|
Emerging |
| 903 |
kastalimohammed1965/CLIP-fine-tune-registers-gated
Vision Transformers Needs Registers. And Gated MLPs. And +20M params. Tiny... |
|
Emerging |
| 904 |
hao-ai-lab/JacobiForcing
Jacobi Forcing: Fast and Accurate Diffusion-style Decoding |
|
Emerging |
| 905 |
openjlc/riscv64-library
Some of the libraries (docs) on the RISCV64 architecture are easy for users... |
|
Emerging |
| 906 |
cleopatra-itn/fair_multimodal_sentiment
Code and Splits for the paper "A Fair and Comprehensive Comparison of... |
|
Emerging |
| 907 |
varunkumar-dev/TransformersDataAugmentation
Code associated with the "Data Augmentation using Pre-trained Transformer... |
|
Emerging |
| 908 |
cdpierse/script_buddy_v2
Script Buddy v2 is a film script text generation tool built using film... |
|
Emerging |
| 909 |
magpie-align/magpie
[ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs... |
|
Emerging |
| 910 |
jasonvanf/llama-trl
LLaMA-TRL: Fine-tuning LLaMA with PPO and LoRA |
|
Emerging |
| 911 |
obss/trapper
State-of-the-art NLP through transformer models in a modular design and... |
|
Emerging |
| 912 |
mutablelogic/go-llm
Large Language Model API interface |
|
Emerging |
| 913 |
AviSoori1x/Tuning-the-Finetuning
Tuning the Finetuning: An exploration of achieving success with QLoRA |
|
Emerging |
| 914 |
Archimedes1618/Madlab
Madlab is an advanced AI development studio designed to streamline the... |
|
Emerging |
| 915 |
eric-ai-lab/MiniGPT-5
Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language... |
|
Emerging |
| 916 |
vaswdeferenss/AI-Dialogue-Memory-Based-on-Hidden-State
🤖 Integrate LSTM into Transformer models to enhance dialog memory, offering... |
|
Emerging |
| 917 |
DAGroup-PKU/MHLA
MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head... |
|
Emerging |
| 918 |
cambridgeltl/visual-med-alpaca
Visual Med-Alpaca is an open-source, multi-modal foundation model designed... |
|
Emerging |
| 919 |
datastone-spirit/spirit-lora-trainer
Spirit Lora Trainer is a robust toolkit for training Flux1-LoRA models with... |
|
Emerging |
| 920 |
CodeWithKyrian/transformers-php
Transformers PHP is a toolkit for PHP developers to add machine learning... |
|
Emerging |
| 921 |
nerve-sparks/iris_android
IRIS is an android app for interfacing with GGUF / llama.cpp models locally. |
|
Emerging |
| 922 |
kyegomez/attn_res
A clean, single-file PyTorch implementation of Attention Residuals (Kimi... |
|
Emerging |
| 923 |
haoliuhl/ringattention
Large Context Attention |
|
Emerging |
| 924 |
VikParuchuri/textbook_quality
Generate textbook-quality synthetic LLM pretraining data |
|
Emerging |
| 925 |
zalkikar/mlm-bias
Measuring Biases in Masked Language Models for PyTorch Transformers. Support... |
|
Emerging |
| 926 |
mytechnotalent/RE-GPT
Inspired by Andrej Karpathy’s "Let’s Build GPT", this project guides you... |
|
Emerging |
| 927 |
datawhalechina/base-llm
从 NLP 到 LLM 的算法全栈教程,在线阅读地址:https://datawhalechina.github.io/base-llm/ |
|
Emerging |
| 928 |
modelscope/dash-infer
DashInfer is a native LLM inference engine aiming to deliver... |
|
Emerging |
| 929 |
ethicalabs-ai/kurtis
Kurtis is a fine-tuning, inference and evaluation tool built for SLMs (Small... |
|
Emerging |
| 930 |
RManLuo/graph-constrained-reasoning
Official Implementation of ICML 2025 Paper: "Graph-constrained Reasoning:... |
|
Emerging |
| 931 |
CLAIRE-Labo/EvoTune
Efficiently discovering algorithms via LLMs with evolutionary search and... |
|
Emerging |
| 932 |
ruimalheiro/training-custom-llama
Llama-style transformer in PyTorch with multi-node / multi-GPU training.... |
|
Emerging |
| 933 |
aliemo/transfomers-silicon-research
Research and Materials on Hardware implementation of Transformer Model |
|
Emerging |
| 934 |
michael-borck/study-buddy
Desktop AI tutoring app with local inference using Ollama for... |
|
Emerging |
| 935 |
Tongjilibo/build_MiniLLM_from_scratch
从0到1构建一个MiniLLM (pretrain+sft+dpo实践中) |
|
Emerging |
| 936 |
harveybc/predictor
Predictor that uses a configurable plugin-based predictive supervised... |
|
Emerging |
| 937 |
amirhossein-kz/HiFormer
HiFormer: Hierarchical Multi-scale Representations Using Transformers for... |
|
Emerging |
| 938 |
DC-research/TEMPO
The official code for "TEMPO: Prompt-based Generative Pre-trained... |
|
Emerging |
| 939 |
ShivamRajSharma/Transformer-Architectures-From-Scratch
Implementation of transformers based architecture in PyTorch. |
|
Emerging |
| 940 |
Eiztrips/ai-responder
инструмент для создания и обучения моделей, имитирующих стиль общения... |
|
Emerging |
| 941 |
skylight-org/sparse-attention-hub
Advancing the frontier of efficient AI |
|
Emerging |
| 942 |
soldni/pyterrier_sentence_transformers
Create PyTerrier compatible dense indices using any sentence_transformers model |
|
Emerging |
| 943 |
alibaba/GraphTranslator
GraphTranslator:Aligning Graph Model to Large Language Model for Open-ended Tasks |
|
Emerging |
| 944 |
Michael-A-Kuykendall/shimmytok
Pure Rust tokenizer for GGUF models - llama.cpp compatible |
|
Emerging |
| 945 |
dipanjanS/adv_nlp_workshop_odsc_europe22
Extensive tutorials for the Advanced NLP Workshop in Open Data Science... |
|
Emerging |
| 946 |
datamllab/LongLM
[ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning |
|
Emerging |
| 947 |
gohjiayi/suicidal-text-detection
Building a suicidal text detection model and mental health chatbot with deep... |
|
Emerging |
| 948 |
zeozeozeo/ellama
Friendly interface to chat with an Ollama instance. |
|
Emerging |
| 949 |
jianghoucheng/AnyEdit
AnyEdit: Edit Any Knowledge Encoded in Language Models, ICML 2025 |
|
Emerging |
| 950 |
huangwl18/language-planner
Official Code for "Language Models as Zero-Shot Planners: Extracting... |
|
Emerging |
| 951 |
IlyaGusev/rulm
Language modeling and instruction tuning for Russian |
|
Emerging |
| 952 |
lxuechen/private-transformers
A codebase that makes differentially private training of transformers easy. |
|
Emerging |
| 953 |
armbues/SiLLM
SiLLM simplifies the process of training and running Large Language Models... |
|
Emerging |
| 954 |
xlang-ai/Binder
[ICLR 2023] Code for the paper "Binding Language Models in Symbolic Languages" |
|
Emerging |
| 955 |
csiro-robotics/HOTFormerLoc
[IEEE/CVF CVPR 2025] Hierarchical Octree Transformer for Versatile Lidar... |
|
Emerging |
| 956 |
chanind/linear-relational
Linear Relational Embeddings (LREs) and Linear Relational Concepts (LRCs)... |
|
Emerging |
| 957 |
njchoma/transformer_image_caption
Image Captioning based on Bottom-Up and Top-Down Attention model |
|
Emerging |
| 958 |
Nuked88/ComfyUI-N-Nodes
A suite of custom nodes for ConfyUI that includes GPT text-prompt... |
|
Emerging |
| 959 |
SomeBottle/Konnyaku
A simple and robust LLM workflow for anime subtitle file translation. | 基于... |
|
Emerging |
| 960 |
canyuchen/ClinicalBench
Code for the paper "ClinicalBench: Can LLMs Beat Traditional ML Models in... |
|
Emerging |
| 961 |
Yachay-AI/byt5-geotagging
Confidence and Byt5 - based geotagging model predicting coordinates from text alone. |
|
Emerging |
| 962 |
deepreinforce-ai/CUDA-L2
CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through... |
|
Emerging |
| 963 |
mala-lab/SEMPO
[NeurIPS 2025] Official implementation of "SEMPO: Lightweight Foundation... |
|
Emerging |
| 964 |
phronmophobic/llama.clj
Run LLMs locally. A clojure wrapper for llama.cpp. |
|
Emerging |
| 965 |
ssbuild/deep_training
deep learning |
|
Emerging |
| 966 |
zetavg/LLaMA-LoRA-Tuner
UI tool for fine-tuning and testing your own LoRA models base on LLaMA,... |
|
Emerging |
| 967 |
AntixK/PyTorch-Model-Compare
Compare neural networks by their feature similarity |
|
Emerging |
| 968 |
hellotransformers/Natural_Language_Processing_with_Transformers
Natural Language Processing with Transformers 中译本,最权威Transformers教程 |
|
Emerging |
| 969 |
illiterate/BertClassifier
基于PyTorch的BERT中文文本分类模型(BERT Chinese text classification model implemented by PyTorch) |
|
Emerging |
| 970 |
KolosalAI/kolosal-server
Kolosal AI is an OpenSource and Lightweight alternative to Ollama to run... |
|
Emerging |
| 971 |
NetEase-Media/grps_trtllm
Higher performance OpenAI LLM service than vLLM serve: A pure C++... |
|
Emerging |
| 972 |
princeton-nlp/LLM-Shearing
[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via... |
|
Emerging |
| 973 |
bahree/helloLondon
Historical Language Model for London - A specialized LLM trained on... |
|
Emerging |
| 974 |
ruanchaves/napolab
The Natural Portuguese Language Benchmark (Napolab). Stay up to date with... |
|
Emerging |
| 975 |
the-crypt-keeper/can-ai-code
Self-evaluating interview for AI coders |
|
Emerging |
| 976 |
txsun1997/Black-Box-Tuning
ICML'2022: Black-Box Tuning for Language-Model-as-a-Service & EMNLP'2022:... |
|
Emerging |
| 977 |
EleutherAI/DALLE-mtf
Open-AI's DALL-E for large scale training in mesh-tensorflow. |
|
Emerging |
| 978 |
uclaml/SPPO
The official implementation of Self-Play Preference Optimization (SPPO) |
|
Emerging |
| 979 |
nlp-uoregon/mlmm-evaluation
Multilingual Large Language Models Evaluation Benchmark |
|
Emerging |
| 980 |
ArdaGnsrn/ollama-php
This is a PHP library for Ollama. Ollama is an open-source project that... |
|
Emerging |
| 981 |
luchangli03/export_llama_to_onnx
export llama to onnx |
|
Emerging |
| 982 |
AviSoori1x/seemore
From scratch implementation of a vision language model in pure PyTorch |
|
Emerging |
| 983 |
hitz-zentroa/whisper-lm-transformers
Add n-gram and LLM language model support to HF Transformers Whisper models. |
|
Emerging |
| 984 |
adarshM84/TextLLaMACode
Transform your writing with TextLLaMA! ✍️🚀 Simplify grammar, translate... |
|
Emerging |
| 985 |
CVxTz/music_genre_classification
music genre classification : LSTM vs Transformer |
|
Emerging |
| 986 |
scientific-discovery/LLEMA
[ICLR 2026] LLEMA: Evolutionary Search with LLMs for Multi-Objective... |
|
Emerging |
| 987 |
RobertCsordas/ndr
The official repository for our paper "The Neural Data Router: Adaptive... |
|
Emerging |
| 988 |
jingedawang/TutorialLLM
LLM Tutorial for Everyone. |
|
Emerging |
| 989 |
argosopentech/MetalTranslate
Customizable machine translation in C++ |
|
Emerging |
| 990 |
ariya/chat-llm
Chat with an LLM |
|
Emerging |
| 991 |
jd-coderepos/llms4subjects
The official SemEval 2025 Task 5 - LLMs4Subjects - Shared Task Dataset repository |
|
Emerging |
| 992 |
Dartvauder/NeuroSandboxWebUI
(Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image,... |
|
Emerging |
| 993 |
withcaer/curtana
Simplified zero-cost wrapper over llama.cpp powered by the lama-cpp-2 Crate. |
|
Emerging |
| 994 |
Alpha-VLLM/Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation |
|
Emerging |
| 995 |
HamedBabaei/LLMs4OM
LLMs4OM: Matching Ontologies with Large Language Models |
|
Emerging |
| 996 |
AbdelStark/attnres
Rust implementation of Attention Residuals from MoonshotAI/Kimi |
|
Emerging |
| 997 |
nv-tlabs/LLaMA-Mesh
Unifying 3D Mesh Generation with Language Models |
|
Emerging |
| 998 |
USC-FORTIS/AD-LLM
[ACL Findings 2025] A benchmark for anomaly detection using large language... |
|
Emerging |
| 999 |
tosiyuki/LLaVA-JP
LLaVA-JP is a Japanese VLM trained by LLaVA method |
|
Emerging |
| 1000 |
FreeOCR-AI/layoutreader
A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order. |
|
Emerging |