Trending Transformer Models
Models with the biggest quality score improvements over the last 14 days.
| # | Model | Change | Score | Tier |
|---|---|---|---|---|
| 1 |
Knuckles-Team/genius-chatbot
Chatbot that uses any desired hugging face model or allows for scalable... |
+20 | 42 | Emerging |
| 2 |
YashrajBaila7/GPT2LM
A implimentation of GPT2 varient. |
+19 | 29 | Experimental |
| 3 |
rameshvarun/magic-lamp
Magic LLM-powered Python functions that return anything you ask for. Many caveats. |
+19 | 29 | Experimental |
| 4 |
elinx/safe-view
A terminal-based application for visualizing and analyzing safetensors files. |
+17 | 25 | Experimental |
| 5 |
rxn4chemistry/rxn-onmt-models
Training of OpenNMT-based RXN models |
+16 | 47 | Emerging |
| 6 |
kmaurinjones/AllMeans
Automatic topic modelling using minimal external input and computational resources |
+16 | 33 | Emerging |
| 7 |
sagorbrur/fillblank
Fill The Blank |
+16 | 27 | Experimental |
| 8 |
cui-shaobo/causal-strength
evaluating the causal strength between cause and effect |
+16 | 27 | Experimental |
| 9 |
duck4i/retro-ui
Retro Llama |
+16 | 19 | Experimental |
| 10 |
ndoll1998/active-transformers
Active Learning for Transformer with focus on Sequence Tagging tasks |
+16 | 27 | Experimental |
| 11 |
yingding/applyllm
A python package for applying LLM with LangChain and Hugging Face on local... |
+16 | 33 | Emerging |
| 12 |
ash-01xor/Imgcap
A CLI to generate captions for images |
+16 | 19 | Experimental |
| 13 |
lpalbou/model-quantizer
Effortlessly quantize, benchmark, and publish Hugging Face models with... |
+16 | 27 | Experimental |
| 14 |
OpenVoiceOS/ovos-audio-transformer-plugin-ggwave
data over sound plugin |
+16 | 51 | Established |
| 15 |
ffreemt/convbot
A conversational bot based on huggingface transformers |
+14 | 24 | Experimental |
| 16 |
bodaay/HuggingFaceModelDownloader
Simple go utility to download HuggingFace Models and Datasets |
+14 | 63 | Established |
| 17 |
earthai-tech/fusionlab-learn
fusionlab-learn: Igniting Next-Gen Temporal Fusion Architectures |
+13 | 37 | Emerging |
| 18 |
fmueller/scribae
CLI to turn Markdown notes into SEO briefs, drafts, metadata, and... |
+13 | 36 | Emerging |
| 19 |
Riko0/messenger_logger_callback
messenger-logger-callback — Send ML training logs to Telegram. Standalone... |
+12 | 28 | Experimental |
| 20 |
tue-mps/eomt
[CVPR 2025 Highlight] Official code and models for Encoder-only Mask... |
+10 | 56 | Established |
| 21 |
NexaAI/nexa-sdk
Run frontier LLMs and VLMs with day-0 model support across GPU, NPU, and... |
+10 | 60 | Established |
| 22 |
EricLBuehler/mistral.rs
Fast, flexible LLM inference |
+10 | 65 | Established |
| 23 |
shibing624/MedicalGPT
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training... |
+10 | 68 | Established |
| 24 |
mukel/llama3.java
Practical Llama 3 inference in Java |
+10 | 59 | Established |
| 25 |
argosopentech/argos-translate
Open-source offline translation library written in Python |
+10 | 58 | Established |
| 26 |
ggml-org/llama.vim
Vim plugin for LLM-assisted code/text completion |
+10 | 55 | Established |
| 27 |
jingyaogong/minimind
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h! |
+10 | 67 | Established |
| 28 |
levashi/reprobe
Phase-aware LLM activation steering and linear probing. A memory-efficient,... |
+9 | 33 | Emerging |
| 29 |
changyeyu/LLM-RL-Visualized
🌟100+ 原创 LLM / RL 原理图📚,《大模型算法》作者巨献!💥(100+ LLM/RL Algorithm Maps ) |
+9 | 58 | Established |
| 30 |
NVIDIA/kvpress
LLM KV cache compression made easy |
+8 | 63 | Established |
| 31 |
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models |
+8 | 53 | Established |
| 32 |
dirmacs/lancor
A Rust client library for llama.cpp's OpenAI-compatible API server |
+8 | 37 | Emerging |
| 33 |
homerjed/transformer_flows
Implementation of Apple ML's Transformer Flow (or TARFlow) from "Normalising... |
+7 | 22 | Experimental |
| 34 |
telekom/transformer-tools
Transformers Training Tools |
+7 | 33 | Emerging |
| 35 |
SalehAhmedShafin/Multimodal-Disaster-Event-Identification-from-Social-Media-Posts
We have proposed a multimodal approach. Where we first took the best... |
+7 | 12 | Experimental |
| 36 |
ai-center-kth/cuBERT-source-code-clustering
Fine-tuning cuBERT embeddings for clustering source code by functionality |
+7 | 25 | Experimental |
| 37 |
vkhamesi/proteins
🧬 Fine-Tuning Large Language and Protein Models on a single T4 GPU via... |
+7 | 21 | Experimental |
| 38 |
BoCtrl-C/attention-rollout
Unofficial PyTorch implementation of Attention Rollout |
+7 | 22 | Experimental |
| 39 |
Mussabat/HateSpeech-EACL-2024
This repository contains the system description and the codes that we... |
+7 | 12 | Experimental |
| 40 |
ertosns/wiki-summary
wikipedia summarizer transformer |
+7 | 21 | Experimental |
| 41 |
Zefan-Cai/KVCache-Factory
Unified KV Cache Compression Methods for Auto-Regressive Models |
+7 | 47 | Emerging |
| 42 |
ExposedCat/tg-local-llm
Run local LLMs powered up by tools in Telegram Messenger |
+7 | 24 | Experimental |
| 43 |
robertocarlosmedina/attention-transformer-translator-1
Sequence to Sequence Transformer implementation in order to train a model to... |
+7 | 17 | Experimental |
| 44 |
alta3/llm-the-alta3-way
The greatest LLMs on the planet! |
+7 | 22 | Experimental |
| 45 |
wiktor-k/llama-chat
Implements a simple REPL chat with a locally running instance of Ollama. |
+7 | 22 | Experimental |
| 46 |
Tebmer/Awesome-Knowledge-Distillation-of-LLMs
This repository collects papers for "A Survey on Knowledge Distillation of... |
+7 | 34 | Emerging |
| 47 |
somosnlp/the-annotated-transformer
Traducción al español del notebook "The Annotated Transformer" de Harvard... |
+7 | 27 | Experimental |
| 48 |
Riccorl/ner-serve
Simple NER model using Docker, FastAPI, ONNX and Multilingual Mini-LM. |
+7 | 12 | Experimental |
| 49 |
Avinash-Acharya/Arishtha
A Proof-of-Concept for a kids specific browser which provide... |
+7 | 17 | Experimental |
| 50 |
tatsu-lab/alpaca_farm
A simulation framework for RLHF and alternatives. Develop your RLHF method... |
+7 | 42 | Emerging |
| 51 |
abhimishra91/transformers-tutorials
Github repo with tutorials to fine tune transformers for diff NLP tasks |
+7 | 51 | Established |
| 52 |
AndrewZhe/lawyer-llama
中文法律LLaMA (LLaMA for Chinese legel domain) |
+7 | 48 | Emerging |
| 53 |
locuslab/wanda
A simple and effective LLM pruning approach. |
+7 | 47 | Emerging |
| 54 |
styfeng/DataAug4NLP
Collection of papers and resources for data augmentation for NLP. |
+7 | 36 | Emerging |
| 55 |
Lostefra/deep_comedy
A TensorFlow Transformer able to generate verses in the style of Dante's... |
+7 | 17 | Experimental |
| 56 |
marksgraham/transformer-ood
Official PyTorch code for "Transformer-based out-of-distribution detection... |
+7 | 20 | Experimental |
| 57 |
swainshashwat/Flock
Craft custom Language Model Models (LLMs) effortlessly using Flock. Build... |
+7 | 33 | Emerging |
| 58 |
maximkm/DLA_ASR_HW
ASR pytorch project |
+7 | 19 | Experimental |
| 59 |
ShreyJaiswal1/aichatbot
This is a Simple AI chatbot website ;) still learning to make it better |
+7 | 21 | Experimental |
| 60 |
georgian-io/LLM-Finetuning-Toolkit
Toolkit for fine-tuning, ablating and unit-testing open-source LLMs. |
+7 | 46 | Emerging |
| 61 |
HEMANGANI/LLM-Recommendation-Systems
This project fine-tunes large language models (LLMs) for text-based... |
+7 | 21 | Experimental |
| 62 |
eriknovak/LM-EMD
Interpretable cross-lingual document ranking using a multilingual language... |
+7 | 17 | Experimental |
| 63 |
X-rayLaser/DistributedLLM
Run LLM inference by spliting models into parts and hosting each part on a... |
+7 | 20 | Experimental |
| 64 |
KishanBagaria/dAbot
🤖 CLI tool to automate stuff on DeviantArt.com |
+7 | 38 | Emerging |
| 65 |
NS027/medical_chatbot_project_genAI
Multimodal AI-powered medical assistant with LLMs, speech, and image understanding. |
+7 | 24 | Experimental |
| 66 |
AbhinavGH/AI-Chatbot-Bol-Bhai
This is an AI chatbot that uses Google's SpeechRecognition API and... |
+7 | 22 | Experimental |
| 67 |
SMMousaviSP/huggingface_transformers_tutorial
How to fine-tune transformer models for text classification using Hugging... |
+7 | 17 | Experimental |
| 68 |
CtrlAltFly/AIML-Projects
these are my projects that i submitted for AIML course with great lakes &... |
+7 | 17 | Experimental |
| 69 |
rakibnsajib/MediBot-AI-Doctor-with-Vision-and-Voice
AI-powered medical assistant using LLaMA-3.2-11B-Vision, Whisper, and... |
+7 | 33 | Emerging |
| 70 |
ariya/query-llm
Query LLM with Chain-of-Tought |
+7 | 36 | Emerging |
| 71 |
Quotify-Bot/quotify-frontend
AI-powered inspirational quote generator |
+7 | 27 | Experimental |
| 72 |
gokul-pv/PanopticSegmentation
Panoptic segmentation on custom construction objects using DETR |
+7 | 25 | Experimental |
| 73 |
eljandoubi/PaliGemma
Coding PaliGemma from scratch using pytorch for inference. |
+7 | 17 | Experimental |
| 74 |
Ahwar/NER-NLP-with-ONNX-Java
A Java NLP application that identifies names, organizations, and locations... |
+7 | 25 | Experimental |
| 75 |
nlpaueb/greek-bert
A Greek edition of BERT pre-trained language model |
+7 | 38 | Emerging |
| 76 |
vijaydwivedi75/gnn-lspe
Source code for GNN-LSPE (Graph Neural Networks with Learnable Structural... |
+7 | 44 | Emerging |
| 77 |
nerve-sparks/iris_android
IRIS is an android app for interfacing with GGUF / llama.cpp models locally. |
+7 | 43 | Emerging |
| 78 |
NTT123/sketch-transformer
Modeling Draw, Quick! dataset using transformers |
+7 | 29 | Experimental |
| 79 |
sak96/rust_llama_app
Chat bot (llama) written in rust using Yew and Tauri. |
+7 | 21 | Experimental |
| 80 |
AnkitNayak-eth/Llama-AI
Powered by the Llama 3.3 70B API, it delivers advanced, context-aware, and... |
+7 | 23 | Experimental |
| 81 |
gmongaras/Wizard_QLoRA_Finetuning
Finetuning Some Wizard Models With QLoRA |
+7 | 29 | Experimental |
| 82 |
abhimishra91/jarvis-service
NLP Service to perform text classification. This is the first part of... |
+7 | 17 | Experimental |
| 83 |
Hexastack/hexabot-template-starter
Hexabot Project Starter Template, fork this project to create you own... |
+7 | 28 | Experimental |
| 84 |
bytedance/SALMONN
SALMONN family: A suite of advanced multi-modal LLMs |
+7 | 54 | Established |
| 85 |
mujaffarbhati/AI-Chatbot-End-to-End-via-Flask
Chatbot made via NLP for Question - Answering purposes as of a support... |
+7 | 21 | Experimental |
| 86 |
KasperGroesLudvigsen/influenza_transformer
PyTorch implementation of Transformer model used in "Deep Transformer Models... |
+7 | 41 | Emerging |
| 87 |
alexrozanski/LlamaChat
Chat with your favourite LLaMA models in a native macOS app |
+7 | 40 | Emerging |
| 88 |
IanConceicao/Com2Sense-Challenge
Applying natural language processing for common sense evaluation. |
+7 | 17 | Experimental |
| 89 |
tasketh/tasketh
tasketh is a simple discord bot that lets moderators assign, and users claim tasks. |
+7 | 32 | Emerging |
| 90 |
RohitMurali18/Music-Generation-Emotion-Adaptive
This project implements an Emotion-Aware Music Generator (EAMG) that turns... |
+7 | 11 | Experimental |
| 91 |
OpenMOSS/CoLLiE
Collaborative Training of Large Language Models in an Efficient Way |
+7 | 45 | Emerging |
| 92 |
jwbay/misc-ts-transformers
Miscellaneous TypeScript transformers |
+7 | 17 | Experimental |
| 93 |
huggingface/llm_training_handbook
An open collection of methodologies to help with successful training of... |
+7 | 41 | Emerging |
| 94 |
claw1200/llama-cord
Discord App for Interacting with local Ollama Models. Multiple Agents Supported! |
+7 | 19 | Experimental |
| 95 |
HandsOnLLM/Hands-On-Large-Language-Models
Official code repo for the O'Reilly Book - "Hands-On Large Language Models" |
+7 | 57 | Established |
| 96 |
hexuandeng/DRPruning
Implementation for our paper “DRPruning: Efficient Large Language Model... |
+7 | 30 | Emerging |
| 97 |
spongedsc/pathways
Pathways: multi-modal AI/ML models on discord |
+7 | 19 | Experimental |
| 98 |
atomlayer/llamachan
llamachan is a project that realises the idea of a dead internet for an imageboard |
+7 | 19 | Experimental |
| 99 |
Spectrewolf8/PHi-3-SQL-generation-fine-tune-experiment
A fine-tuned version of Phi-3-mini-4k-instruct for generating SQL queries... |
+7 | 20 | Experimental |
| 100 |
JaspreetSingh-exe/Music-Genre-Classification
This project builds a Music Genre Classification System using SVM, CNN,... |
+7 | 17 | Experimental |