All Transformer Models
7,795 models ranked by quality score · Page 23 of 78
| # | Model | Score | Tier |
|---|---|---|---|
| 2201 |
michaelnny/QLoRA-LLM
A simple custom QLoRA implementation for fine-tuning a language model (LLM)... |
|
Emerging |
| 2202 |
johndpope/OmniTransfer-hack
OmniTransfer implementation for LTX-2 (work in progress) |
|
Emerging |
| 2203 |
liaoyuhua/LLM4TS
Large Language & Foundation Models for Time Series. |
|
Emerging |
| 2204 |
steinbergmedia/libmusictok
C++ Library for tokenizing MIDI files, designed to be compatible with the... |
|
Emerging |
| 2205 |
OneInterface/realtime-bakllava
llama.cpp with BakLLaVA model describes what does it see |
|
Emerging |
| 2206 |
zerovl/ZeroVL
[ECCV2022] Contrastive Vision-Language Pre-training with Limited Resources |
|
Emerging |
| 2207 |
Adversing/hf-model-checker
A tool to analyze HuggingFace models and determine their compatibility with... |
|
Emerging |
| 2208 |
jaygala24/fed-hate-speech
The official code repository for the paper titled "A Federated Approach for... |
|
Emerging |
| 2209 |
nanowell/Differential-Transformer-PyTorch
PyTorch implementation of the Differential-Transformer architecture for... |
|
Emerging |
| 2210 |
RLHFlow/Online-RLHF
A recipe for online RLHF and online iterative DPO. |
|
Emerging |
| 2211 |
google/curie
Code release for "CURIE: Evaluating LLMs On Multitask Scientific Long... |
|
Emerging |
| 2212 |
Sunona-AI-labs/sunona
Sunona: Next-generation voice AI infrastructure. Orchestrate intelligent,... |
|
Emerging |
| 2213 |
Kaleidophon/nlp-uncertainty-zoo
Model zoo for different kinds of uncertainty quantification methods used in... |
|
Emerging |
| 2214 |
CEC-Agent/CEC
Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for... |
|
Emerging |
| 2215 |
moritztng/fltr
Like grep but for natural language questions. Based on Mistral 7B or Mixtral 8x7B. |
|
Emerging |
| 2216 |
mcp-tool-shop-org/backpropagate
Headless LLM fine-tuning in 3 lines — smart defaults, VRAM-aware batch... |
|
Emerging |
| 2217 |
kkahatapitiya/LangRepo
Code for our ACL 2025 paper "Language Repository for Long Video Understanding" |
|
Emerging |
| 2218 |
suyash/mlt
Multilingual Neural Machine Translation using Transformers with Conditional... |
|
Emerging |
| 2219 |
hesamsheikh/llm-mechanics
Coding an LLM and its building blocks from scratch. |
|
Emerging |
| 2220 |
florist-notes/aicore_n
Artificial Intelligence > Machine Learning > Deep Learning |
|
Emerging |
| 2221 |
PKU-Alignment/beavertails
BeaverTails is a collection of datasets designed to facilitate research on... |
|
Emerging |
| 2222 |
starmpcc/CAMEL
Clinically Adapted Model Enhanced from LLaMA |
|
Emerging |
| 2223 |
Hamtech-ai/Persian-Image-Captioning
A Persian Image Captioning model based on Vision Encoder Decoder Models of... |
|
Emerging |
| 2224 |
18907305772/Explore-Instruct
EMNLP'2023: Explore-Instruct: Enhancing Domain-Specific Instruction Coverage... |
|
Emerging |
| 2225 |
hpdps-group/ElasticMM
ElasticMM: Elastic and Efficient MLLM Serving System |
|
Emerging |
| 2226 |
JessicaLopezEspejel/HazPi
HazPi is a modified Transformer(Vaswani et al., 2017) neural network... |
|
Emerging |
| 2227 |
GURPREETKAURJETHRA/Llama-3-ORPO-Fine-Tuning
Llama 3 ORPO Fine Tuning on A100 in Colab Pro. |
|
Emerging |
| 2228 |
holarissun/RewardModelingBeyondBradleyTerry
official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models... |
|
Emerging |
| 2229 |
egaoharu-kensei/flash-attention-triton
Cross-platform FlashAttention-2 Triton implementation for Turing+ GPUs with... |
|
Emerging |
| 2230 |
nestordemeure/stop_word
Huggingface transformers stopping criteria that halts the generation when a... |
|
Emerging |
| 2231 |
deep-div/PlotLLM
Data Visualization with LLM automatically analyzes data and generates... |
|
Emerging |
| 2232 |
StargazerX0/ScaleKV
[NeurIPS 2025] ScaleKV: Memory-Efficient Visual Autoregressive Modeling with... |
|
Emerging |
| 2233 |
DunnBC22/Vision_Audio_and_Multimodal_Projects
This repository includes all computer vision, audio, document AI, and... |
|
Emerging |
| 2234 |
Beomi/BitNet-Transformers
0️⃣1️⃣🤗 BitNet-Transformers: Huggingface Transformers Implementation of... |
|
Emerging |
| 2235 |
hhy-huang/GraphJudge
[EMNLP'25 main] This is the official repo for the paper, Can LLMs be Good... |
|
Emerging |
| 2236 |
asigalov61/Giant-Music-Transformer
[SOTA] [92% acc] 786M-8k-44L-32H multi-instrumental music transformer with... |
|
Emerging |
| 2237 |
CristiVlad25/ai-papers
Tracing the evolution of AI and large language models from early neural... |
|
Emerging |
| 2238 |
wang2226/Awesome-LLM-Decoding
📜 Paper list on decoding methods for LLMs and LVLMs |
|
Emerging |
| 2239 |
fboulnois/llm-leaderboard-csv
CSVs of the Huggingface and LMArena LLM leaderboards, along with the code to... |
|
Emerging |
| 2240 |
Gen-Verse/ReasonFlux
[NeurIPS 2025 Spotlight] LLM post-training suite — featuring ReasonFlux,... |
|
Emerging |
| 2241 |
llmapi-io/llmapi-cli
Command-line client and python development library for accessing LLM's... |
|
Emerging |
| 2242 |
SqueezeAILab/KVQuant
[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with... |
|
Emerging |
| 2243 |
bayartsogt-ya/albert-mongolian
ALBERT trained on Mongolian text corpus |
|
Emerging |
| 2244 |
kingabzpro/French-to-Fongbe-and-Ewe-MT
The objective of this challenge is to create a machine translation system... |
|
Emerging |
| 2245 |
tugot17/Discord-Language-Detection-Bot
Restrict the use of forbidden languages on your discord server with transformers |
|
Emerging |
| 2246 |
VITA-Group/Ms-PoE
"Found in the Middle: How Language Models Use Long Contexts Better via... |
|
Emerging |
| 2247 |
CASE-Lab-UMD/Router-Tuning-Mixture-of-Depths
The open-source Mixture of Depths code and the official implementation of... |
|
Emerging |
| 2248 |
bobazooba/xllm-demo
Demo project using XLLM |
|
Emerging |
| 2249 |
DAMO-NLP-SG/multilingual-safety-for-LLMs
[ICLR 2024]Data for "Multilingual Jailbreak Challenges in Large Language Models" |
|
Emerging |
| 2250 |
asahi417/lm-vocab-trimmer
Vocabulary Trimming (VT) is a model compression technique, which reduces a... |
|
Emerging |
| 2251 |
Scicrop/llm-vision-basics
Educational notebooks that demystify Large Language Models and Computer... |
|
Emerging |
| 2252 |
SuperBianC/scMulan
Repository for paper scMulan: a multitask generative pre-trained language... |
|
Emerging |
| 2253 |
JoelDeonDsouza/Zenpool_LLM
Zenpool is a compact, fine-tuned MLL (Mini Language Learner) model |
|
Emerging |
| 2254 |
lpalbou/AbstractLLM
A unified interface for Large Language Models with memory, reasoning, and... |
|
Emerging |
| 2255 |
rafalposwiata/depression-detection-lt-edi-2022
This repository contains the code of our winning solution for the Shared... |
|
Emerging |
| 2256 |
deepmancer/advanced-recommender-system
Advance information retrieval system that combines advanced indexing,... |
|
Emerging |
| 2257 |
AchiraNadeeshan/social-activity-job-matcher
PathFinder is a job recommendation web application that allows users to... |
|
Emerging |
| 2258 |
GURPREETKAURJETHRA/Multi-GPU-Fine-Training-LLMs
Multi GPU Fine Training LLMs using DeepSpeed and Accelerate. |
|
Emerging |
| 2259 |
YuanGongND/ltu
Code, Dataset, and Pretrained Models for Audio and Speech Large Language... |
|
Emerging |
| 2260 |
daskol/llama.py
Python bindings to llama.cpp |
|
Emerging |
| 2261 |
adithya-s-k/CompanionLLM
CompanionLLM - A framework to finetune LLMs to be your own sentient... |
|
Emerging |
| 2262 |
microsoft/MMLU-CF
A Contamination-free Multi-task Language Understanding Benchmark [Official, ACL 2025] |
|
Emerging |
| 2263 |
maciekt07/Lecture-Note-Generator-POC
📒 A proof-of-concept app that transcribes lecture recordings into text and... |
|
Emerging |
| 2264 |
CodingPlatelets/transformer_MM
Accelerator for LLM Based on Chisel3 |
|
Emerging |
| 2265 |
davzoku/cria
An end-to-end LLM app prototype based on Llama 2 |
|
Emerging |
| 2266 |
hasanisaeed/C-Transformer
Implementation of the core Transformer architecture in pure C |
|
Emerging |
| 2267 |
IIT-DM/BattleofLLMs
Benchmarks of LLMs with Conversational QA datasets. |
|
Emerging |
| 2268 |
SachinKalsi/annotated-research-papers
This repository is a comprehensive collection of research papers,... |
|
Emerging |
| 2269 |
isaacus-dev/emubert-creator
The training code behind EmuBert, the largest open-source masked language... |
|
Emerging |
| 2270 |
JonnoB/training_lms_with_synthetic_data
A repo for training Language models to correct errors in OCR text |
|
Emerging |
| 2271 |
zatevakhin/obsidian-local-llm
Obsidian Local LLM is a plugin for Obsidian that provides access to a... |
|
Emerging |
| 2272 |
GiorgiaAuroraAdorni/gansformer-reproducibility-challenge
Replication of the novel Generative Adversarial Transformer. |
|
Emerging |
| 2273 |
SertraFurr/DuckDuckAI
Python API Wrapper to interact with DuckDuckAI |
|
Emerging |
| 2274 |
XavierSpycy/hands-on-lora
Explore practical fine-tuning of LLMs with Hands-on Lora. Dive into examples... |
|
Emerging |
| 2275 |
krishnapriya-18/COVID-19-Tweet-Classification-using-Roberta-and-Bert-Simple-Transformers
Rank 1 / 216 |
|
Emerging |
| 2276 |
martin-wey/CodeUltraFeedback
CodeUltraFeedback: aligning large language models to coding preferences (TOSEM 2025) |
|
Emerging |
| 2277 |
HKUDS/RecLM
[ACL2025] "RecLM: Recommendation Instruction Tuning" |
|
Emerging |
| 2278 |
ISNE11/CheatSheet-LLM
Run local Large Language Models (LLMs) offline using Ollama – interact with... |
|
Emerging |
| 2279 |
Srijan-D/LangChain-v0.2-HuggingFace-Llama3
This project integrates LangChain v0.2.6, HuggingFace Serverless Inference... |
|
Emerging |
| 2280 |
zake7749/Kyara
[Kaggle-2nd] Lightweight yet Effective Chinese LLM. |
|
Emerging |
| 2281 |
NotYuSheng/Multimodal-Large-Language-Model
Localized Multimodal Large Language Model (MLLM) integrated with Streamlit... |
|
Emerging |
| 2282 |
wangcongcong123/transection
Transection: Transformers for English to Chinese Translation |
|
Emerging |
| 2283 |
yingding/applyllm
A python package for applying LLM with LangChain and Hugging Face on local... |
|
Emerging |
| 2284 |
DoubleVII/lithft
Pretrain, finetune any LLMs from huggingface on your own data. |
|
Emerging |
| 2285 |
micahondiwa/applied-ai
Deep Learning for Computer Vision: A collection of 6 end-to-end applied AI... |
|
Emerging |
| 2286 |
caua1503/llm-tool-fusion
llm-tool-fusion é uma biblioteca Python que unifica e simplifica o uso de... |
|
Emerging |
| 2287 |
TheAnkurGoswami/Neural-Networks-from-Scratch
Implementation of different neural networks with back-propagation logic. |
|
Emerging |
| 2288 |
rabiloo/llm-finetuning
Sample for Fine-Tuning LLMs & VLMs |
|
Emerging |
| 2289 |
gabe00122/jaxrl
Partially Observable Multi-Agent RL with Transformers |
|
Emerging |
| 2290 |
lennartpollvogt/ollama-instructor
Python library for the instruction and reliable validation of structured... |
|
Emerging |
| 2291 |
black-roland/homeassistant-cloud-ru-ai
Cloud.ru Foundation Models — cloud-based AI assistants for Home Assistant |
|
Emerging |
| 2292 |
KRR-Oxford/LLMap-Prelim
A preliminary investigation for ontology alignment (OM) with large language... |
|
Emerging |
| 2293 |
levashi/reprobe
Phase-aware LLM activation steering and linear probing. A memory-efficient,... |
|
Emerging |
| 2294 |
gunnarnordqvist/opencode-context-filter
Transparent HTTP proxy that automatically filters repository context for... |
|
Emerging |
| 2295 |
yonahgraphics/openevalkit
Production-grade Python framework for evaluating LLM and agentic systems... |
|
Emerging |
| 2296 |
Naman-ntc/FastCode
Utilities for efficient fine-tuning, inference and evaluation of code... |
|
Emerging |
| 2297 |
dhpollack/huggingface_libtorch
Minimal example of using a traced huggingface transformers model with libtorch |
|
Emerging |
| 2298 |
sajjjadayobi/ParsBigBird
Persian Bert For Long-Range Sequences |
|
Emerging |
| 2299 |
shizhouxing/Robustness-Verification-for-Transformers
[ICLR 2020] Code for paper "Robustness Verification for Transformers" |
|
Emerging |
| 2300 |
aniass/Spam-detection
Spam detection in SMS messages with BERT model and Machine Learning algorithms |
|
Emerging |