Trending Transformer Models

Models with the biggest quality score improvements over the last 14 days.

# Model Change Score Tier
1 Knuckles-Team/genius-chatbot

Chatbot that uses any desired hugging face model or allows for scalable...

+20 42 Emerging
2 YashrajBaila7/GPT2LM

A implimentation of GPT2 varient.

+19 29 Experimental
3 rameshvarun/magic-lamp

Magic LLM-powered Python functions that return anything you ask for. Many caveats.

+19 29 Experimental
4 elinx/safe-view

A terminal-based application for visualizing and analyzing safetensors files.

+17 25 Experimental
5 rxn4chemistry/rxn-onmt-models

Training of OpenNMT-based RXN models

+16 47 Emerging
6 kmaurinjones/AllMeans

Automatic topic modelling using minimal external input and computational resources

+16 33 Emerging
7 sagorbrur/fillblank

Fill The Blank

+16 27 Experimental
8 cui-shaobo/causal-strength

evaluating the causal strength between cause and effect

+16 27 Experimental
9 duck4i/retro-ui

Retro Llama

+16 19 Experimental
10 ndoll1998/active-transformers

Active Learning for Transformer with focus on Sequence Tagging tasks

+16 27 Experimental
11 yingding/applyllm

A python package for applying LLM with LangChain and Hugging Face on local...

+16 33 Emerging
12 ash-01xor/Imgcap

A CLI to generate captions for images

+16 19 Experimental
13 lpalbou/model-quantizer

Effortlessly quantize, benchmark, and publish Hugging Face models with...

+16 27 Experimental
14 OpenVoiceOS/ovos-audio-transformer-plugin-ggwave

data over sound plugin

+16 51 Established
15 ffreemt/convbot

A conversational bot based on huggingface transformers

+14 24 Experimental
16 bodaay/HuggingFaceModelDownloader

Simple go utility to download HuggingFace Models and Datasets

+14 63 Established
17 earthai-tech/fusionlab-learn

fusionlab-learn: Igniting Next-Gen Temporal Fusion Architectures

+13 37 Emerging
18 fmueller/scribae

CLI to turn Markdown notes into SEO briefs, drafts, metadata, and...

+13 36 Emerging
19 Riko0/messenger_logger_callback

messenger-logger-callback — Send ML training logs to Telegram. Standalone...

+12 28 Experimental
20 tue-mps/eomt

[CVPR 2025 Highlight] Official code and models for Encoder-only Mask...

+10 56 Established
21 NexaAI/nexa-sdk

Run frontier LLMs and VLMs with day-0 model support across GPU, NPU, and...

+10 60 Established
22 EricLBuehler/mistral.rs

Fast, flexible LLM inference

+10 65 Established
23 shibing624/MedicalGPT

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training...

+10 68 Established
24 mukel/llama3.java

Practical Llama 3 inference in Java

+10 59 Established
25 argosopentech/argos-translate

Open-source offline translation library written in Python

+10 58 Established
26 ggml-org/llama.vim

Vim plugin for LLM-assisted code/text completion

+10 55 Established
27 jingyaogong/minimind

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

+10 67 Established
28 levashi/reprobe

Phase-aware LLM activation steering and linear probing. A memory-efficient,...

+9 33 Emerging
29 changyeyu/LLM-RL-Visualized

🌟100+ 原创 LLM / RL 原理图📚,《大模型算法》作者巨献!💥(100+ LLM/RL Algorithm Maps )

+9 58 Established
30 NVIDIA/kvpress

LLM KV cache compression made easy

+8 63 Established
31 BradyFU/Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

+8 53 Established
32 dirmacs/lancor

A Rust client library for llama.cpp's OpenAI-compatible API server

+8 37 Emerging
33 homerjed/transformer_flows

Implementation of Apple ML's Transformer Flow (or TARFlow) from "Normalising...

+7 22 Experimental
34 telekom/transformer-tools

Transformers Training Tools

+7 33 Emerging
35 SalehAhmedShafin/Multimodal-Disaster-Event-Identification-from-Social-Media-Posts

We have proposed a multimodal approach. Where we first took the best...

+7 12 Experimental
36 ai-center-kth/cuBERT-source-code-clustering

Fine-tuning cuBERT embeddings for clustering source code by functionality

+7 25 Experimental
37 vkhamesi/proteins

🧬 Fine-Tuning Large Language and Protein Models on a single T4 GPU via...

+7 21 Experimental
38 BoCtrl-C/attention-rollout

Unofficial PyTorch implementation of Attention Rollout

+7 22 Experimental
39 Mussabat/HateSpeech-EACL-2024

This repository contains the system description and the codes that we...

+7 12 Experimental
40 ertosns/wiki-summary

wikipedia summarizer transformer

+7 21 Experimental
41 Zefan-Cai/KVCache-Factory

Unified KV Cache Compression Methods for Auto-Regressive Models

+7 47 Emerging
42 ExposedCat/tg-local-llm

Run local LLMs powered up by tools in Telegram Messenger

+7 24 Experimental
43 robertocarlosmedina/attention-transformer-translator-1

Sequence to Sequence Transformer implementation in order to train a model to...

+7 17 Experimental
44 alta3/llm-the-alta3-way

The greatest LLMs on the planet!

+7 22 Experimental
45 wiktor-k/llama-chat

Implements a simple REPL chat with a locally running instance of Ollama.

+7 22 Experimental
46 Tebmer/Awesome-Knowledge-Distillation-of-LLMs

This repository collects papers for "A Survey on Knowledge Distillation of...

+7 34 Emerging
47 somosnlp/the-annotated-transformer

Traducción al español del notebook "The Annotated Transformer" de Harvard...

+7 27 Experimental
48 Riccorl/ner-serve

Simple NER model using Docker, FastAPI, ONNX and Multilingual Mini-LM.

+7 12 Experimental
49 Avinash-Acharya/Arishtha

A Proof-of-Concept for a kids specific browser which provide...

+7 17 Experimental
50 tatsu-lab/alpaca_farm

A simulation framework for RLHF and alternatives. Develop your RLHF method...

+7 42 Emerging
51 abhimishra91/transformers-tutorials

Github repo with tutorials to fine tune transformers for diff NLP tasks

+7 51 Established
52 AndrewZhe/lawyer-llama

中文法律LLaMA (LLaMA for Chinese legel domain)

+7 48 Emerging
53 locuslab/wanda

A simple and effective LLM pruning approach.

+7 47 Emerging
54 styfeng/DataAug4NLP

Collection of papers and resources for data augmentation for NLP.

+7 36 Emerging
55 Lostefra/deep_comedy

A TensorFlow Transformer able to generate verses in the style of Dante's...

+7 17 Experimental
56 marksgraham/transformer-ood

Official PyTorch code for "Transformer-based out-of-distribution detection...

+7 20 Experimental
57 swainshashwat/Flock

Craft custom Language Model Models (LLMs) effortlessly using Flock. Build...

+7 33 Emerging
58 maximkm/DLA_ASR_HW

ASR pytorch project

+7 19 Experimental
59 ShreyJaiswal1/aichatbot

This is a Simple AI chatbot website ;) still learning to make it better

+7 21 Experimental
60 georgian-io/LLM-Finetuning-Toolkit

Toolkit for fine-tuning, ablating and unit-testing open-source LLMs.

+7 46 Emerging
61 HEMANGANI/LLM-Recommendation-Systems

This project fine-tunes large language models (LLMs) for text-based...

+7 21 Experimental
62 eriknovak/LM-EMD

Interpretable cross-lingual document ranking using a multilingual language...

+7 17 Experimental
63 X-rayLaser/DistributedLLM

Run LLM inference by spliting models into parts and hosting each part on a...

+7 20 Experimental
64 KishanBagaria/dAbot

🤖 CLI tool to automate stuff on DeviantArt.com

+7 38 Emerging
65 NS027/medical_chatbot_project_genAI

Multimodal AI-powered medical assistant with LLMs, speech, and image understanding.

+7 24 Experimental
66 AbhinavGH/AI-Chatbot-Bol-Bhai

This is an AI chatbot that uses Google's SpeechRecognition API and...

+7 22 Experimental
67 SMMousaviSP/huggingface_transformers_tutorial

How to fine-tune transformer models for text classification using Hugging...

+7 17 Experimental
68 CtrlAltFly/AIML-Projects

these are my projects that i submitted for AIML course with great lakes &...

+7 17 Experimental
69 rakibnsajib/MediBot-AI-Doctor-with-Vision-and-Voice

AI-powered medical assistant using LLaMA-3.2-11B-Vision, Whisper, and...

+7 33 Emerging
70 ariya/query-llm

Query LLM with Chain-of-Tought

+7 36 Emerging
71 Quotify-Bot/quotify-frontend

AI-powered inspirational quote generator

+7 27 Experimental
72 gokul-pv/PanopticSegmentation

Panoptic segmentation on custom construction objects using DETR

+7 25 Experimental
73 eljandoubi/PaliGemma

Coding PaliGemma from scratch using pytorch for inference.

+7 17 Experimental
74 Ahwar/NER-NLP-with-ONNX-Java

A Java NLP application that identifies names, organizations, and locations...

+7 25 Experimental
75 nlpaueb/greek-bert

A Greek edition of BERT pre-trained language model

+7 38 Emerging
76 vijaydwivedi75/gnn-lspe

Source code for GNN-LSPE (Graph Neural Networks with Learnable Structural...

+7 44 Emerging
77 nerve-sparks/iris_android

IRIS is an android app for interfacing with GGUF / llama.cpp models locally.

+7 43 Emerging
78 NTT123/sketch-transformer

Modeling Draw, Quick! dataset using transformers

+7 29 Experimental
79 sak96/rust_llama_app

Chat bot (llama) written in rust using Yew and Tauri.

+7 21 Experimental
80 AnkitNayak-eth/Llama-AI

Powered by the Llama 3.3 70B API, it delivers advanced, context-aware, and...

+7 23 Experimental
81 gmongaras/Wizard_QLoRA_Finetuning

Finetuning Some Wizard Models With QLoRA

+7 29 Experimental
82 abhimishra91/jarvis-service

NLP Service to perform text classification. This is the first part of...

+7 17 Experimental
83 Hexastack/hexabot-template-starter

Hexabot Project Starter Template, fork this project to create you own...

+7 28 Experimental
84 bytedance/SALMONN

SALMONN family: A suite of advanced multi-modal LLMs

+7 54 Established
85 mujaffarbhati/AI-Chatbot-End-to-End-via-Flask

Chatbot made via NLP for Question - Answering purposes as of a support...

+7 21 Experimental
86 KasperGroesLudvigsen/influenza_transformer

PyTorch implementation of Transformer model used in "Deep Transformer Models...

+7 41 Emerging
87 alexrozanski/LlamaChat

Chat with your favourite LLaMA models in a native macOS app

+7 40 Emerging
88 IanConceicao/Com2Sense-Challenge

Applying natural language processing for common sense evaluation.

+7 17 Experimental
89 tasketh/tasketh

tasketh is a simple discord bot that lets moderators assign, and users claim tasks.

+7 32 Emerging
90 RohitMurali18/Music-Generation-Emotion-Adaptive

This project implements an Emotion-Aware Music Generator (EAMG) that turns...

+7 11 Experimental
91 OpenMOSS/CoLLiE

Collaborative Training of Large Language Models in an Efficient Way

+7 45 Emerging
92 jwbay/misc-ts-transformers

Miscellaneous TypeScript transformers

+7 17 Experimental
93 huggingface/llm_training_handbook

An open collection of methodologies to help with successful training of...

+7 41 Emerging
94 claw1200/llama-cord

Discord App for Interacting with local Ollama Models. Multiple Agents Supported!

+7 19 Experimental
95 HandsOnLLM/Hands-On-Large-Language-Models

Official code repo for the O'Reilly Book - "Hands-On Large Language Models"

+7 57 Established
96 hexuandeng/DRPruning

Implementation for our paper “DRPruning: Efficient Large Language Model...

+7 30 Emerging
97 spongedsc/pathways

Pathways: multi-modal AI/ML models on discord

+7 19 Experimental
98 atomlayer/llamachan

llamachan is a project that realises the idea of a dead internet for an imageboard

+7 19 Experimental
99 Spectrewolf8/PHi-3-SQL-generation-fine-tune-experiment

A fine-tuned version of Phi-3-mini-4k-instruct for generating SQL queries...

+7 20 Experimental
100 JaspreetSingh-exe/Music-Genre-Classification

This project builds a Music Genre Classification System using SVM, CNN,...

+7 17 Experimental