All Transformer Models
7,795 models ranked by quality score · Page 33 of 78
| # | Model | Score | Tier |
|---|---|---|---|
| 3201 |
Loguru-AI/Loguru-CLI
An interactive commandline interface that brings intelligence to your logs. |
|
Experimental |
| 3202 |
HamidrezaGholamrezaei/LLM-Text-Classification-with-RoBERTa
A project demonstrating the use of Large Language Models (LLMs) for text... |
|
Experimental |
| 3203 |
rasyosef/amharic-news-category-classification
notebooks to finetune `bert-small-amharic`, `bert-mini-amharic`, and... |
|
Experimental |
| 3204 |
HemantBK/LLaMA-Sum-Fine-Tuning
Fine-tuned Meta's LLaMA 3.2 1B for text summarization using QLoRA (4-bit... |
|
Experimental |
| 3205 |
sayannath/ViT-TF-Hub-Application
Build and fine-tune your Image Classifier using a Vision Transformer Model... |
|
Experimental |
| 3206 |
dimiz51/DETR-Factory-PyTorch
This project is an implementation of the Detection Transformer (DETR) and... |
|
Experimental |
| 3207 |
FuxiaoLiu/VisualNews-Repository
[EMNLP'21] Visual News: Benchmark and Challenges in News Image Captioning |
|
Experimental |
| 3208 |
aaronbriel/absum
Abstractive Summarization for Data Augmentation |
|
Experimental |
| 3209 |
iovdin/tune-models
LLM models for tune, from openai, anthropic, openrouter, groq, ollama, mistral |
|
Experimental |
| 3210 |
Riko0/messenger_logger_callback
messenger-logger-callback — Send ML training logs to Telegram. Standalone... |
|
Experimental |
| 3211 |
LegendLeoChen/llm-finetune
使用trl、peft、transformers等库,实现对huggingface上模型的微调。 |
|
Experimental |
| 3212 |
alxfgh/Large-Language-Models-in-Chemistry
Working collection of papers, repos and models of transformer based language... |
|
Experimental |
| 3213 |
EdgeTypE/FolderToLLM
A PowerShell tool for Windows that recursively scans a directory, captures... |
|
Experimental |
| 3214 |
kenzic/run-models-in-the-browser-with-transformers.js-demo
Working demo for the article Run Models in the Browser With Transformers.js |
|
Experimental |
| 3215 |
Furyton/awesome-language-model-analysis
This paper list focuses on the theoretical and empirical analysis of... |
|
Experimental |
| 3216 |
KimJaehee0725/YoYAK
[제 13회 투빅스 컨퍼런스] YoYAK - Yes or Yes, Attention with gap-sentence for Korean... |
|
Experimental |
| 3217 |
MalihehIzadi/catiss
CatIss is an intelligent tool for automatic categorization of issue reports... |
|
Experimental |
| 3218 |
cpuheater/cause-life-is-a-game
Solving games with reinforcement learning |
|
Experimental |
| 3219 |
ranpy13/Learning-LLM
Learning to build LLM from scratch, following rasbt/LLMs-from-scratch footsteps. |
|
Experimental |
| 3220 |
tanishqgautam/SETI-Breakthrough-Listen
Solution to the SETI Breakthrough Listen Competition hosted on Kaggle |
|
Experimental |
| 3221 |
fuglede/llama.ttf
A font for writing tiny stories |
|
Experimental |
| 3222 |
KillerShoaib/RLM-From-Scratch
Implementation of Recursive Language Model paper from scratch |
|
Experimental |
| 3223 |
IsaacRodgz/multimodal-transformers-movies
Experiments with multimodal deep learning models based on transformers |
|
Experimental |
| 3224 |
sayakpaul/vision-transformers-tf
A non-exhaustive collection of vision transformer models implemented in TensorFlow. |
|
Experimental |
| 3225 |
InternLM/Spark
An official implementation of "SPARK: Synergistic Policy And Reward... |
|
Experimental |
| 3226 |
Shekswess/tiny-think
Reasoning-first post-training for tiny language models (140M) on a single GPU. |
|
Experimental |
| 3227 |
dkurt/optimum-openvino
Intel OpenVINO extension for Hugging Face Transformers |
|
Experimental |
| 3228 |
rhubarbwu/linguistic-collapse
Codebase for Linguistic Collapse: Neural Collapse in (Large) Language Models... |
|
Experimental |
| 3229 |
Kevo-03/AttentionNet
AttentionNet: Encrypted Network Traffic Classification Solution with... |
|
Experimental |
| 3230 |
grouzen/ollana
Ollama over LAN - Auto-discover your Ollama server on your local network... |
|
Experimental |
| 3231 |
mintaywon/IF_RLHF
Source code for 'Understanding impacts of human feedback via influence functions' |
|
Experimental |
| 3232 |
kaylode/vqa-transformer
Visual Question Answering using Transformer and Bottom-Up attention.... |
|
Experimental |
| 3233 |
knoveleng/steering
Official repo for the paper: "Selective Steering: Norm-Preserving Control... |
|
Experimental |
| 3234 |
Beomi/easy-lm-trainer
🤗 최소한의 세팅으로 LM을 학습하기 위한 샘플코드 |
|
Experimental |
| 3235 |
nullHawk/simple-transformer
Implementation of Transformer model in PyTorch |
|
Experimental |
| 3236 |
sober-clever/ReRe
The implementations of paper "Reinforced Preference Optimization for... |
|
Experimental |
| 3237 |
Hexastack/hexabot-template-starter
Hexabot Project Starter Template, fork this project to create you own... |
|
Experimental |
| 3238 |
rozek/node-red-flow-llama
Node-RED Flow (and web page example) for the LLaMA AI model |
|
Experimental |
| 3239 |
YanSte/NLP-LLM-Fine-tuning-Llame-2-QLoRA-2024
Natural Language Processing (NLP) and Large Language Models (LLM) with... |
|
Experimental |
| 3240 |
line/sacpo
[NeurIPS 2024] SACPO (Stepwise Alignment for Constrained Policy Optimization) |
|
Experimental |
| 3241 |
rahul13ramesh/compositional_capabilities
Compositional Capabilities of Autoregressive Transformers: A Study on... |
|
Experimental |
| 3242 |
subhasisj/FastAPI-Streamlit-Docker-NLP
Text Classification model deployment using FastAPI, Streamlit and Docker Compose |
|
Experimental |
| 3243 |
namuan/snap-assist
Summon intelligence in a snap |
|
Experimental |
| 3244 |
OSU-NLP-Group/QA4RE
[ACL'23 Findings] "Aligning Instruction Tasks Unlocks Large Language Models... |
|
Experimental |
| 3245 |
kaistAI/LangBridge
[ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision |
|
Experimental |
| 3246 |
fattorib/transformer_shmap
Tensor Parallelism with JAX + Shard Map |
|
Experimental |
| 3247 |
GraphPKU/CoI
Chain of Images for Intuitively Reasoning |
|
Experimental |
| 3248 |
eilamc14/Simplify-This
Comparative Analysis of Prompt-Based and Fine-Tuned LLMs |
|
Experimental |
| 3249 |
yukyunglee/transformers-resources
huggingface transformers tutorial, code, resources |
|
Experimental |
| 3250 |
Qwen-Applications/STAR
STAR: Similarity-guided Teacher-Assisted Refinement for Super-Tiny Function... |
|
Experimental |
| 3251 |
raghavbali/text_generation
Notebooks to better understand text generation |
|
Experimental |
| 3252 |
matt-k-wong/mlx-flash
Lightning-fast MLX utilities and optimizations for Apple Silicon |
|
Experimental |
| 3253 |
yucc2018/share
一些代码实践分享。 |
|
Experimental |
| 3254 |
robjsliwa/mlx-sd-single-file-models
Single safetensors file support Apple MLX Stable Diffusion |
|
Experimental |
| 3255 |
LEL-A/doc
Overarching documentation and planning to build so-called... |
|
Experimental |
| 3256 |
vbario/sleeping-llm
A language model that forms persistent memories from conversation and... |
|
Experimental |
| 3257 |
SJTU-IPADS/Bamboo
Bamboo-7B Large Language Model |
|
Experimental |
| 3258 |
Cre4T3Tiv3/unsloth-llama3-alpaca-lora
Advanced 4-bit QLoRA fine-tuning pipeline for LLaMA 3 8B with... |
|
Experimental |
| 3259 |
cbacary/MoDeGPT
An implementation of the MoDeGPT LLM compression from the ICLR 2025... |
|
Experimental |
| 3260 |
jhuang265/Calibrating-LLMs-with-Label-Smoothing
Code to our ICML 2025 Paper "Calibrated Language Models and How to Find Them... |
|
Experimental |
| 3261 |
vbercy/g2tm-segmenter
Graph-Guided Token Merging (G2TM) is a lightweight one-shot module designed... |
|
Experimental |
| 3262 |
YukinoshitaKaren/Reason-KE
[EMNLP 2025 Findings] Robust Knowledge Editing via Explicit Reasoning Chains... |
|
Experimental |
| 3263 |
nerdimite/bert-finetuning-webinar
Code for the FullStack AI Live Coding Series- Part 1 (CellStrat AI Lab) |
|
Experimental |
| 3264 |
liziniu/GEM
Code for Paper (Preserving Diversity in Supervised Fine-tuning of Large... |
|
Experimental |
| 3265 |
technion-cs-nlp/BiologicalTokenizers
Effect of tokenization on transformers for biological sequence |
|
Experimental |
| 3266 |
sanskar9999/CodeEvolveLLM
A framework for using local LLMs (Qwen2.5-coder 7B) that are fine-tuned... |
|
Experimental |
| 3267 |
robinzixuan/FROST
[ICLR 2026] FROST: Filtering Reasoning Outliers with Attention for Efficient... |
|
Experimental |
| 3268 |
frncscp/patacognition
Legacy repo for the Artificial Intelligence capable of patacón recognition... |
|
Experimental |
| 3269 |
lix19937/llm-deploy
AI Infra LLM infer/ tensorrt-llm/ vllm |
|
Experimental |
| 3270 |
llaraspata/HallucinationDetection
Analyzing the correlation between Hallucinations and Knowledge Conflicts in... |
|
Experimental |
| 3271 |
Frozen-Projects/AI_Cactus
Cactus AI framework plugin for UE5 to run local LLMs at runtime.... |
|
Experimental |
| 3272 |
LightDopper/skill-codex
🚀 Enable automated code analysis and editing with Claude Code using Codex... |
|
Experimental |
| 3273 |
EMalagoli92/CvT-TensorFlow
TensorFlow 2.X reimplementation of CvT: Introducing Convolutions to Vision... |
|
Experimental |
| 3274 |
Beomi/transformers-language-modeling
Train 🤗transformers with DeepSpeed: ZeRO-2, ZeRO-3 |
|
Experimental |
| 3275 |
YuanheZ/LoRA-One
LoRA-One: One-Step Full Gradient Could Suffice for Fine-Tuning Large ... |
|
Experimental |
| 3276 |
tsinghua-fib-lab/AAAI2025_MIA-Tuner
[AAAI'25 Oral] "MIA-Tuner: Adapting Large Language Models as Pre-training... |
|
Experimental |
| 3277 |
mts-ai/OpenAutoNLU
An open-source pipeline for training natural language understanding models |
|
Experimental |
| 3278 |
ArneBinder/pytorch-ie-hydra-template-1
PyTorch-IE Hydra Template |
|
Experimental |
| 3279 |
BenChaliah/NVFP4-on-4090-vLLM
AdaLLM is an NVFP4-first inference runtime for Ada Lovelace (RTX 4090) with... |
|
Experimental |
| 3280 |
JIA-Lab-research/Step-DPO
Implementation for "Step-DPO: Step-wise Preference Optimization for... |
|
Experimental |
| 3281 |
proycon/deepfrog
An NLP-suite powered by deep learning |
|
Experimental |
| 3282 |
PKU-Alignment/aligner
[NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct |
|
Experimental |
| 3283 |
julienokumu/Solving-ML-Papers
Attempts at solving machine learning papers(9/_)✅ |
|
Experimental |
| 3284 |
atapour/rank-over-class
Source code for the training pipeline of the text ranking model used in the... |
|
Experimental |
| 3285 |
TamSiuhin/OPPU
Official Implementation of "Democratizing Large Language Models via... |
|
Experimental |
| 3286 |
dnbaker/bioseq
Tokenizers and Machine Learning Models for biological sequence data |
|
Experimental |
| 3287 |
osiriszjq/impulse_init
Convolutional Initialization for Data-Efficient Vision Transformers |
|
Experimental |
| 3288 |
wjn1996/HugNLP
HugNLP is a unified and comprehensive NLP library based on HuggingFace... |
|
Experimental |
| 3289 |
Yusuf80216/QnATables-An-Intelligent-Question-Answering-System
Question Answering System to answer question over tables in a document |
|
Experimental |
| 3290 |
bhavsarpratik/transformers
Implementations of transformer models in pytorch |
|
Experimental |
| 3291 |
kyegomez/Multi-Model-Training
An experimental repository on research for training multiple models all at... |
|
Experimental |
| 3292 |
LehengTHU/AlphaRec
[ICLR 2025 Oral 🏆] The implementation of paper "Language Representations Can... |
|
Experimental |
| 3293 |
ArenRedd/AI-Chatbot-using-LLaMA2
Uncensored AI: An open-source, unrestricted LLaMA 2 model for free and raw... |
|
Experimental |
| 3294 |
Armaggheddon/ClipServe
🚀 ClipServe: A fast API server for embedding text, images, and performing... |
|
Experimental |
| 3295 |
arifulislamat/local-voice-cloning-app
Powered by ChatterboxTTS | Transformer | Llama | Gradio |
|
Experimental |
| 3296 |
TobyYang7/Llava_Qwen2
Visual Instruction Tuning for Qwen2 Base Model |
|
Experimental |
| 3297 |
unipr-org/AI
AI - Intelligenza Artificiale presso l'Università degli Studi di Parma (6 CFU). |
|
Experimental |
| 3298 |
aakinlalu/GenerativeAI
Series of generative artificial intelligence (AI) for creating new content,... |
|
Experimental |
| 3299 |
yoniLc/GeometricTransformerMolecule
Transformer for End to End Molecule Property Prediction |
|
Experimental |
| 3300 |
czg1225/VeriThinker
[NeurIPS 2025] VeriThinker: Learning to Verify Makes Reasoning Model Efficient |
|
Experimental |