All Transformer Models
7,795 models ranked by quality score · Page 18 of 78
| # | Model | Score | Tier |
|---|---|---|---|
| 1701 |
princeton-nlp/LLMBar
[ICLR 2024] Evaluating Large Language Models at Evaluating Instruction Following |
|
Emerging |
| 1702 |
mu-cai/matryoshka-mm
Matryoshka Multimodal Models |
|
Emerging |
| 1703 |
zRzRzRzRzRzRzR/lm-fly
大模型推理框架加速,让 LLM 飞起来 |
|
Emerging |
| 1704 |
FlatlinerDOA/PerceptivePyro
Run and train Transformer based Large Language Models (LLMS) natively in... |
|
Emerging |
| 1705 |
rdenadai/BR-BERTo
Transformer model for Portuguese language (Brazil pt_BR) |
|
Emerging |
| 1706 |
pdfosborne/elsciRL
The core repository of the elsciRL framework. |
|
Emerging |
| 1707 |
AlexanderVNikitin/kernel-language-entropy
Code for Fine-grained Uncertainty Quantification for LLMs from Semantic... |
|
Emerging |
| 1708 |
UCSB-NLP-Chang/SemanticSmooth
Implementation of paper 'Defending Large Language Models against Jailbreak... |
|
Emerging |
| 1709 |
ariannamethod/chuck.optimizer
Adam is blind. Chuck sees. Lee 4ever. |
|
Emerging |
| 1710 |
mkuchnik/relm
ReLM is a Regular Expression engine for Language Models |
|
Emerging |
| 1711 |
ai-glimpse/toyllm
ToyLLM: Learning LLM from Scratch |
|
Emerging |
| 1712 |
DebeshJha/TransNetR
Official implementation of TransNetR: Transformer-based Residual Network for... |
|
Emerging |
| 1713 |
surrey-nlp/NLP-2025
Labs for COM3029/COMM061 at University of Surrey |
|
Emerging |
| 1714 |
TIGER-AI-Lab/StructLM
Code and data for "StructLM: Towards Building Generalist Models for... |
|
Emerging |
| 1715 |
horus-ai-labs/DistillFlow
Library for model distillation |
|
Emerging |
| 1716 |
chziakas/redeval
A library for red-teaming LLM applications with LLMs. |
|
Emerging |
| 1717 |
shinomakoi/AI-Messenger
A QT GUI for large language models |
|
Emerging |
| 1718 |
Bruce-Lee-LY/flash_attention_inference
Performance of the C++ interface of flash attention and flash attention v2... |
|
Emerging |
| 1719 |
Ethyros-AI/ModelCypher
ModelCypher - Decipher the high dimensional geometry of LLMs. An open source... |
|
Emerging |
| 1720 |
CVI-SZU/Linly
Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集 |
|
Emerging |
| 1721 |
osainz59/t5-encoder
A extension of Transformers library to include T5ForSequenceClassification class. |
|
Emerging |
| 1722 |
avnlp/llm-blender
LLM-Blender: Ensembling framework that maximizes LLM performance via... |
|
Emerging |
| 1723 |
olaflaitinen/llm-proteomics-hallucination
Systematic evaluation of hallucination risks in Large Language Models... |
|
Emerging |
| 1724 |
sandylaker/ib-edl
Calibrating LLMs with Information-Theoretic Evidential Deep Learning (ICLR 2025) |
|
Emerging |
| 1725 |
westlake-repl/NRPStransformer
A Transformer-Based Predictor for Nonribosomal Peptide Synthetases (NRPS)... |
|
Emerging |
| 1726 |
monologg/KoELECTRA-Pipeline
Transformers Pipeline with KoELECTRA |
|
Emerging |
| 1727 |
openmedlab/PULSE
PULSE: Pretrained and Unified Language Service Engine |
|
Emerging |
| 1728 |
DebarshiChanda/Amazon-ML-Challenge2021
Scripts and Approach for Amazon ML Challenge |
|
Emerging |
| 1729 |
LMLK-seal/HuggingGGUF
Hugging Face Model downloader and GGUF Converter. |
|
Emerging |
| 1730 |
benitomartin/food-images-finetuning
Fine-tuning of LiquidAI LFM2-VL vision-language models on food image... |
|
Emerging |
| 1731 |
zd11024/NaviLLM
[CVPR 2024] The code for paper 'Towards Learning a Generalist Model for... |
|
Emerging |
| 1732 |
CoderLSF/fast-llama
Runs LLaMA with Extremely HIGH speed |
|
Emerging |
| 1733 |
Grenzlinie/MgBERT_LLM_Classification_for_Materials_Science
Source code and result for Paper 'A Prompt-Engineered Large Language Model,... |
|
Emerging |
| 1734 |
arrmansa/Gpt-Neo-Limited-Vram-Cuda
A notebook that runs GPT-Neo with low vram (6 gb) and cuda acceleration by... |
|
Emerging |
| 1735 |
toriving/text-classification-transformers
Easy text classification for everyone : Bert based models via Huggingface... |
|
Emerging |
| 1736 |
joslefaure/HERMES
[ICCV'25] HERMES: temporal-coHERent long-forM understanding with Episodes... |
|
Emerging |
| 1737 |
FuxiaoLiu/LRV-Instruction
[ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust... |
|
Emerging |
| 1738 |
MalihehIzadi/SoftwareTagRecommender
A tag recommender based on SOTA machine learning algorithms to automatically... |
|
Emerging |
| 1739 |
gpustack/gguf-packer-go
Deliver LLMs of GGUF format via Dockerfile. |
|
Emerging |
| 1740 |
losttech/Torch.MinGPT
A C# implementation of GPT |
|
Emerging |
| 1741 |
microsoft/AdaMix
This is the implementation of the paper AdaMix: Mixture-of-Adaptations for... |
|
Emerging |
| 1742 |
kyegomez/VortexFusion
Transformers + Mambas + LSTMS All in One Model |
|
Emerging |
| 1743 |
amazon-science/transformers-data-augmentation
Code associated with the "Data Augmentation using Pre-trained Transformer... |
|
Emerging |
| 1744 |
YassWorks/Tuna
Python library that makes fine-tuning transformer-based models easier and faster. |
|
Emerging |
| 1745 |
LlamaFamily/Llama-Chinese
Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用 |
|
Emerging |
| 1746 |
SALT-NLP/LLaVAR
Code/Data for the paper: "LLaVAR: Enhanced Visual Instruction Tuning for... |
|
Emerging |
| 1747 |
JinjieNi/MixEval
The official evaluation suite and dynamic data release for MixEval. |
|
Emerging |
| 1748 |
juyongjiang/CodeUp
CodeUp: A Multilingual Code Generation Llama-X Model with... |
|
Emerging |
| 1749 |
leap-laboratories/PIZZA
An attribution library for LLMs |
|
Emerging |
| 1750 |
jrobine/twm
Transformer-based World Models |
|
Emerging |
| 1751 |
conceptofmind/t5-pytorch
Implementation of Exploring the Limits of Transfer Learning with a Unified... |
|
Emerging |
| 1752 |
kyegomez/MM1
PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from... |
|
Emerging |
| 1753 |
lukechilds/humanscript
A truly natural scripting language |
|
Emerging |
| 1754 |
rbitr/llm.f90
LLM inference in Fortran |
|
Emerging |
| 1755 |
liupras/Practical-local-LLM-programming
Programming with local large language model. |
|
Emerging |
| 1756 |
kyegomez/MC-ViT
Implementation of the model: "(MC-ViT)" from the paper: "Memory... |
|
Emerging |
| 1757 |
ShinoharaHare/LLM-Training
A distributed training framework for large language models powered by Lightning. |
|
Emerging |
| 1758 |
princeton-pli/AdaptMI
[COLM 2025] Adaptive Skill-based In-context Math Instruction for Small... |
|
Emerging |
| 1759 |
kyegomez/MegaVIT
The open source implementation of the model from "Scaling Vision... |
|
Emerging |
| 1760 |
Thrasher-Software/sigil
A local-first LLM development studio. Build, test, and customize inference... |
|
Emerging |
| 1761 |
earthai-tech/fusionlab-learn
fusionlab-learn: Igniting Next-Gen Temporal Fusion Architectures |
|
Emerging |
| 1762 |
sytelus/nanuGPT
Simple, reliable and well tested training code for quick experiments with... |
|
Emerging |
| 1763 |
iMoonLab/LLM4Hypergraph
The source code of ICLR 2025 "Beyond Graphs: Can Large Language Models... |
|
Emerging |
| 1764 |
Rishit-dagli/GLU
An easy-to-use library for GLU (Gated Linear Units) and GLU variants in TensorFlow. |
|
Emerging |
| 1765 |
readme-generator/alreadyme-ai-research
Generate README.md with GPT-3 few-shot learning |
|
Emerging |
| 1766 |
StarRing2022/ChatGPTX-Uni
实现一种多Lora权值集成切换+Zero-Finetune零微调增强的跨模型技术方案,LLM-Base+LLM-X+Alpaca,初期,LLM-Base为... |
|
Emerging |
| 1767 |
teelinsan/camoscio
Camoscio: An Italian instruction-tuned language model based on LLaMA |
|
Emerging |
| 1768 |
invergent-ai/surogate
Insanely fast LLM pre-training and fine-tuning for modern NVIDIA GPUs.... |
|
Emerging |
| 1769 |
tirtharajdash/LMLFStar
Generating target-specific novel lead molecules using an LLM |
|
Emerging |
| 1770 |
ksm26/Finetuning-Large-Language-Models
Unlock the potential of finetuning Large Language Models (LLMs). Learn from... |
|
Emerging |
| 1771 |
IDSIA/fpainter
Official repository for the paper "Images as Weight Matrices: Sequential... |
|
Emerging |
| 1772 |
luohongyin/LangCode
LangCode - Improving alignment and reasoning of large language models (LLMs)... |
|
Emerging |
| 1773 |
ivonajdenkoska/tulip
[ICLR 2025] Official code repository for "TULIP: Token-length Upgraded CLIP" |
|
Emerging |
| 1774 |
huggingface/large_language_model_training_playbook
An open collection of implementation tips, tricks and resources for training... |
|
Emerging |
| 1775 |
desaixie/zeroverse
Official code for NeurIPS 2024 paper LRM-Zero: Training Large Reconstruction... |
|
Emerging |
| 1776 |
poteminr/instruct-ner
Instruct LLMs for flat and nested NER. Fine-tuning Llama and Mistral models... |
|
Emerging |
| 1777 |
hyintell/awesome-refreshing-llms
EMNLP'23 survey: a curation of awesome papers and resources on refreshing... |
|
Emerging |
| 1778 |
ariya/gamal
Research tool leveraging LLM for answers |
|
Emerging |
| 1779 |
Troyanovsky/llama-vision-image-tagger
Use Llama3.2 Vision for tagging and searching images on your local machine. |
|
Emerging |
| 1780 |
zjunlp/Mol-Instructions
[ICLR 2024] Mol-Instructions: A Large-Scale Biomolecular Instruction Dataset... |
|
Emerging |
| 1781 |
JonSnow1807/Medical-Prescription-OCR
OCR system for handwritten medical prescriptions using Donut transformer and... |
|
Emerging |
| 1782 |
VityaVitalich/STASC
[ICLR 2025 SSI-FM] Self-Taught Self-Correction for Small Language Models |
|
Emerging |
| 1783 |
poloclub/Fine-tuning-LLMs
Finetune Llama 2 on Colab for free on your own data: step-by-step tutorial |
|
Emerging |
| 1784 |
pier-maker92/bachsformer
A Bach music generator with Artificial Intelligence. This model is made by a... |
|
Emerging |
| 1785 |
jellydn/gpt4all-cli
By utilizing GPT4All-CLI, developers can effortlessly tap into the power of... |
|
Emerging |
| 1786 |
MurrellGroup/InvariantPointAttention.jl
Julia implementation of AlphaFold 2's Invariant Point Attention |
|
Emerging |
| 1787 |
partarstu/transformers-in-java
Experimental project for AI and NLP based on Transformer Architecture |
|
Emerging |
| 1788 |
rendezqueue/rendezllama
CLI for llama.cpp with various commands to guide, edit, and regenerate... |
|
Emerging |
| 1789 |
otvam/pyscalexfmr
Optimization and Scaling of Medium-Frequency Transformers |
|
Emerging |
| 1790 |
openshieldai/openshield
OpenShield is a new generation security layer for AI models |
|
Emerging |
| 1791 |
alexeykarnachev/full_stack_transformer
Pytorch library for end-to-end transformer models training, inference and serving |
|
Emerging |
| 1792 |
andrewkchan/yalm
Yet Another Language Model: LLM inference in C++/CUDA, no libraries except for I/O |
|
Emerging |
| 1793 |
babycommando/machinascript-for-robots
Build LLM-powered robots in your garage with MachinaScript For Robots! |
|
Emerging |
| 1794 |
DeepLangAI/LingoWhale-8B
LingoWhale-8B: Open Bilingual LLMs | 开源双语预训练大模型 |
|
Emerging |
| 1795 |
guxm2021/ALT_SpeechBrain
[ISMIR 2022] Transfer Learning of wav2vec 2.0 for Automatic Lyric Transcription |
|
Emerging |
| 1796 |
SuyogKamble/simpleVLM
building a simple VLM. Implementing LlaMA-SmolLM2 from scratch + SigLip2... |
|
Emerging |
| 1797 |
c0sogi/llama-api
An OpenAI-like LLaMA inference API |
|
Emerging |
| 1798 |
neosantara-xyz/glm-ocr-inference
Fast and lightweight GLM-OCR inference on Modal with an OpenAI-compatible... |
|
Emerging |
| 1799 |
ASSERT-KTH/agentic-evals-lab
Framework for training and evaluating LLMs with reinforcement learning in... |
|
Emerging |
| 1800 |
Selozhd/FNet-tensorflow
Tensorflow Implementation of "FNet: Mixing Tokens with Fourier Transforms." |
|
Emerging |