All Transformer Models
7,795 models ranked by quality score · Page 19 of 78
| # | Model | Score | Tier |
|---|---|---|---|
| 1801 |
niclasgriesshaber/llm_patent_pipeline
LLMs for Historical Dataset Construction from Archival Image Scans |
|
Emerging |
| 1802 |
fmueller/scribae
CLI to turn Markdown notes into SEO briefs, drafts, metadata, and... |
|
Emerging |
| 1803 |
nanxiang11/CodeLab_LLM
🌟 从LLaMA2开启大语言模型原理与实践教程 |
|
Emerging |
| 1804 |
ykjaat6104/LLM-Cost-and-Token-Efficiency-Analysis
A benchmark study analyzing cost and token efficiency across 14 LLMs from 5... |
|
Emerging |
| 1805 |
clint-kristopher-morris/llm-guided-evolution
LLM Guided Evolution - The Automation of Models Advancing Models |
|
Emerging |
| 1806 |
jackaduma/ChatGLM-LoRA-RLHF-PyTorch
A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer... |
|
Emerging |
| 1807 |
MaxiDonkey/DelphiGroqCloud
The GroqCloud API wrapper for Delphi provides access to models from Meta,... |
|
Emerging |
| 1808 |
jagilley/fact-checker
Fact-checking LLM outputs with self-ask |
|
Emerging |
| 1809 |
OSU-STARLAB/Simul-LLM
[ACL 2024] An easily extensible framework for simultaneous, text-to-text... |
|
Emerging |
| 1810 |
PaddlePaddle/PALM
a Fast, Flexible, Extensible and Easy-to-use NLP Large-scale Pretraining and... |
|
Emerging |
| 1811 |
assembly-automation-hub/repo-governance
⚙️ Reusable GitHub repository governance kit: CI/CD workflows, CodeQL SAST,... |
|
Emerging |
| 1812 |
zrr1999/emotion-recognition
多模态情绪识别方法研究(Multimodal Emotion Recognition) |
|
Emerging |
| 1813 |
JRC1995/BERT-Disaster-Classification-Capsule-Routing
Exploration of BERT-BiLSTM models with Layer Aggregation (attention-based... |
|
Emerging |
| 1814 |
ariya/query-llm
Query LLM with Chain-of-Tought |
|
Emerging |
| 1815 |
SimeonHristov99/DL_25-26
Practice sessions for the course "Introduction to deep learning" in the... |
|
Emerging |
| 1816 |
martin-wey/peft-llm-code
Replication package of the paper "Exploring Parameter-Efficient Fine-Tuning... |
|
Emerging |
| 1817 |
jaketae/alibi
PyTorch implementation of Train Short, Test Long: Attention with Linear... |
|
Emerging |
| 1818 |
ziplab/HVT
[ICCV 2021] Official implementation of "Scalable Vision Transformers with... |
|
Emerging |
| 1819 |
luciusssss/ZhuangBench
[ACL'24 Findings] Teaching Large Language Models an Unseen Language on the Fly |
|
Emerging |
| 1820 |
antonyvigouret/Pay-Attention-to-MLPs
My implementation of the gMLP model from the paper "Pay Attention to MLPs". |
|
Emerging |
| 1821 |
zhongkaifu/TensorSharp
A C# inference engine for running large language models (LLMs) locally using... |
|
Emerging |
| 1822 |
AristotelisPap/Question-Answering-with-BERT-and-Knowledge-Distillation
Fine-tuned BERT on SQuAd 2.0 Dataset. Applied Knowledge Distillation (KD)... |
|
Emerging |
| 1823 |
aquadzn/deploy-transformers
Easily deploy a state-of-the-art language model from HuggingFace's Transformers |
|
Emerging |
| 1824 |
sigeisler/reinforce-attacks-llms
REINFORCE Adversarial Attacks on Large Language Models: An Adaptive,... |
|
Emerging |
| 1825 |
deep-symbolic-mathematics/Multimodal-Math-Pretraining
[ICLR 2024 Spotlight] This is the official code for the paper "SNIP:... |
|
Emerging |
| 1826 |
camenduru/alpaca-lora-colab
Alpaca Lora |
|
Emerging |
| 1827 |
AutonomicPerfectionist/PipeInfer
PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculation |
|
Emerging |
| 1828 |
shoppollama/shoppollama
Open Source Agentic Commerce Platform built on Ollama and Stripe — Run... |
|
Emerging |
| 1829 |
JHansiduYapa/Fine-Tuning-a-Small-Language-Model-for-Cypher-Query-Generation
This project fine-tunes Unsloth's Gemma-3 4B IT (4-bit) model to translate... |
|
Emerging |
| 1830 |
sinanuozdemir/oreilly-bert-nlp
This repository contains code for the O'Reilly Live Online Training for BERT |
|
Emerging |
| 1831 |
NgJaBach/dark-kit
Collect and share guidance + code snippets for running LM-related tasks. |
|
Emerging |
| 1832 |
dvgodoy/LLM-visuals
Over 60 figures and diagrams of LLMs, quantization, low-rank adapters... |
|
Emerging |
| 1833 |
flowersteam/LLM-Culture
Code for the "Cultural evolution in populations of Large Language Models" paper |
|
Emerging |
| 1834 |
alantess/gtrxl-torch
Gated Transformer Model for Computer Vision |
|
Emerging |
| 1835 |
thanhlecongg/Invalidator
Invalidator: Automated Patch Correctness Assessment via Semantic and... |
|
Emerging |
| 1836 |
gotzmann/booster
Booster - open accelerator for LLM models. Better inference and debugging... |
|
Emerging |
| 1837 |
m-horky/sllm
Tools using small Large Language Models |
|
Emerging |
| 1838 |
yongchao98/R1-Code-Interpreter
R1-Code-Interpreter: Training LLMs to Reason with Code via Supervised and... |
|
Emerging |
| 1839 |
RishabSA/Sketch2Graphviz
Sketch2Graphviz allows you to convert sketches or images of graphs and... |
|
Emerging |
| 1840 |
crux82/BISS-2024
This repository hosts materials from the Bertinoro International Spring... |
|
Emerging |
| 1841 |
abdur75648/V-Zen
V-Zen: Efficient GUI Understanding and Precise Grounding With A Novel... |
|
Emerging |
| 1842 |
trzy/llava-cpp-server
LLaVA server (llama.cpp). |
|
Emerging |
| 1843 |
sashazykov/json-repair-rb
A simple Ruby gem designed to repair broken JSON strings |
|
Emerging |
| 1844 |
TIGER-AI-Lab/VisualWebInstruct
The official repo for "VisualWebInstruct: Scaling up Multimodal Instruction... |
|
Emerging |
| 1845 |
etaoxing/multigame-dt
Implementation of Multi-Game Decision Transformers in PyTorch |
|
Emerging |
| 1846 |
qingsongedu/Awesome-TimeSeries-SpatioTemporal-LM-LLM
A professional list on Large (Language) Models and Foundation Models (LLM,... |
|
Emerging |
| 1847 |
phvv-me/frame-representation-hypothesis
Official Repository for Frame Representation Hypothesis paper |
|
Emerging |
| 1848 |
sampathkethineedi/bert-topic-sentiment
Topic Based Sentiment Detection using BERT |
|
Emerging |
| 1849 |
dravenk/ollama-zig
Ollama Zig library |
|
Emerging |
| 1850 |
rickiepark/the-lm-book
<대규모 언어 모델, 핵심만 빠르게!>(인사이트, 2025)의 코드 저장소 |
|
Emerging |
| 1851 |
weiserlab/TinyLLM
Bringing Language Models to the Most Resource Constrained Devices |
|
Emerging |
| 1852 |
HenryNdubuaku/nanodl
Build GPT, Gemma, LlaMa, Mixtral, Whisper, SWin, ViT and more in JAX. |
|
Emerging |
| 1853 |
warner-benjamin/commented-transformers
Highly commented implementations of Transformers in PyTorch |
|
Emerging |
| 1854 |
urban-mobility-generation/Language-Modeling-for-Urban-Mobility
Language Modeling for Urban Mobility: A Data-Centric Review and Guidelines |
|
Emerging |
| 1855 |
saeeddhqan/tiny-transformer
Tiny transformer models implemented in pytorch. |
|
Emerging |
| 1856 |
grigio/llm-eval-simple
llm-eval-simple is a simple LLM evaluation framework with intermediate... |
|
Emerging |
| 1857 |
yihongXU/TransCenter
This is the official implementation of TransCenter (TPAMI). The code and... |
|
Emerging |
| 1858 |
nehalvaghasiya/interview-bot
AI-powered virtual interview bot to simulate real interview practice. |
|
Emerging |
| 1859 |
markusaksli/ai-music
A vanilla Trasformer Decoder music generation model trained on Final Fantasy... |
|
Emerging |
| 1860 |
seedatnabeel/CLLM
Curated LLM (ICML 2024) |
|
Emerging |
| 1861 |
DAMO-NLP-SG/LLM-Zoo
LLM Zoo collects information of various open- and close-sourced LLMs |
|
Emerging |
| 1862 |
Alsace08/Chain-of-Embedding
[ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding... |
|
Emerging |
| 1863 |
SingleZombie/LLSA
Official implementation of Log-linear Sparse Attention (LLSA). |
|
Emerging |
| 1864 |
jaketae/vit-breast-cancer
Transfer learning pretrained vision transformers for breast histopathology |
|
Emerging |
| 1865 |
laurab222/TSAD
Benchmarking Unsupervised Strategies for Anomaly Detection in Multivariate... |
|
Emerging |
| 1866 |
Praveengovianalytics/falcon-evaluate
Falcon Evaluate is an open-source Python library aims to revolutionise the... |
|
Emerging |
| 1867 |
Azure/nlp-samples
Japanese NLP sample codes |
|
Emerging |
| 1868 |
0x7o/text2keywords
Trained T5 and T5-large model for creating keywords from text |
|
Emerging |
| 1869 |
UCSC-REAL/DS2
[ICLR 2025] Official implementation of paper "Improving Data Efficiency via... |
|
Emerging |
| 1870 |
TIGER-AI-Lab/LongICLBench
Code and Data for "Long-context LLMs Struggle with Long In-context Learning"... |
|
Emerging |
| 1871 |
MME-Benchmarks/Video-MME
✨✨[CVPR 2025] Video-MME: The First-Ever Comprehensive Evaluation Benchmark... |
|
Emerging |
| 1872 |
SculptAI/GIMKit
Guided Infilling Modeling Toolkit |
|
Emerging |
| 1873 |
abenechehab/dicl
[ICLR 2025] Official implementation of DICL (Disentangled In-Context... |
|
Emerging |
| 1874 |
IrohXu/Awesome-Multimodal-LLM-Autonomous-Driving
[WACV 2024 Survey Paper] Multimodal Large Language Models for Autonomous Driving |
|
Emerging |
| 1875 |
ray-project/ray-llm
RayLLM - LLMs on Ray (Archived). Read README for more info. |
|
Emerging |
| 1876 |
styfeng/DataAug4NLP
Collection of papers and resources for data augmentation for NLP. |
|
Emerging |
| 1877 |
monologg/HanBert-Transformers
HanBert on 🤗 Huggingface Transformers 🤗 |
|
Emerging |
| 1878 |
Aloereed/llama.cpp-server-ohos
Llama.cpp server for OpenHarmony |
|
Emerging |
| 1879 |
suamin/T2NER
T2NER: Transformers based Transfer Learning Framework for Named Entity... |
|
Emerging |
| 1880 |
hristijanpeshov/SHAP-Explainable-Lexicon-Model
This project proposes a novel methodology to automatically learn financial... |
|
Emerging |
| 1881 |
bvanaken/visbert
VisBERT: Demo web app for "How Does BERT Answer Questions?" |
|
Emerging |
| 1882 |
gbaptista/ollama-ai
A Ruby gem for interacting with Ollama's API that allows you to run open... |
|
Emerging |
| 1883 |
cakshat/AlloyBERT
Introducing AlloyBERT: a transformer encoder-based model for predicting... |
|
Emerging |
| 1884 |
epfl-dlab/llm-latent-language
Repo accompanying our paper "Do Llamas Work in English? On the Latent... |
|
Emerging |
| 1885 |
cosbidev/NAIM
Official implementation for the paper ``Not Another Imputation Method: A... |
|
Emerging |
| 1886 |
JinjieNi/MegaDLMs
GPU-optimized framework for training diffusion language models at any scale.... |
|
Emerging |
| 1887 |
kyegomez/M2PT
Implementation of M2PT in PyTorch from the paper: "Multimodal Pathway:... |
|
Emerging |
| 1888 |
AaronFeng753/Ollama-Model-Dumper
Export and Backup Ollama models into GGUF and ModelFile |
|
Emerging |
| 1889 |
aryan-jadon/Regression-Loss-Functions-in-Time-Series-Forecasting-Tensorflow
This repository contains the implementation of paper Temporal Fusion... |
|
Emerging |
| 1890 |
yaph/charla
A terminal based chat application that works with AI language models. |
|
Emerging |
| 1891 |
abhilashreddys/Fake-News-Article
Detecting fake news articles by analyzing patterns in writing. |
|
Emerging |
| 1892 |
hao-ai-lab/DistCA
Efficient Long-context Language Model Training by Core Attention Disaggregation |
|
Emerging |
| 1893 |
GeorgeMichailidis/multi-task-mixed-freq
Code repository for "Multi-Task Encoder-Dual-Decoder Modeling Framework on... |
|
Emerging |
| 1894 |
real-stanford/reflect
[CoRL 2023] REFLECT: Summarizing Robot Experiences for Failure Explanation... |
|
Emerging |
| 1895 |
BUAADreamer/Chinese-LLaVA-Med
中文医学多模态大模型 Large Chinese Language-and-Vision Assistant for BioMedicine |
|
Emerging |
| 1896 |
robinhad/kruk
Ukrainian instruction-tuned language models and datasets |
|
Emerging |
| 1897 |
ahans30/goldfish-loss
[NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs |
|
Emerging |
| 1898 |
asigalov61/Perceiver-Music-Transformer
SOTA Google's Perceiver-AR Music Transformer Implementation and Model |
|
Emerging |
| 1899 |
zhilizju/Awesome-instruction-tuning
A curated list of awesome instruction tuning datasets, models, papers and... |
|
Emerging |
| 1900 |
DFKI-NLP/thermostat
Collection of NLP model explanations and accompanying analysis tools |
|
Emerging |