All Transformer Models
7,795 models ranked by quality score · Page 25 of 78
| # | Model | Score | Tier |
|---|---|---|---|
| 2401 |
deterministic-algorithms-lab/NLP-Journey
This repository provides a selection of very basic and minimal notebooks for... |
|
Emerging |
| 2402 |
SORRY-Bench/sorry-bench
Benchmark evaluation code for "SORRY-Bench: Systematically Evaluating Large... |
|
Emerging |
| 2403 |
augustwester/transformer-xl
A lightweight PyTorch implementation of the Transformer-XL architecture... |
|
Emerging |
| 2404 |
benct/kotlin-cheat-sheet
:star: Kotlin <3 Cheat Sheet, Collection Extension Functions and General Examples |
|
Emerging |
| 2405 |
akanyaani/miniLLAMA
A simplified LLAMA implementation for training and inference tasks. |
|
Emerging |
| 2406 |
AndrewBoessen/PerfectRep
PerfectRep is a 3D pose estimation model tailored specifically for... |
|
Emerging |
| 2407 |
UBC-NLP/marbert
UBC ARBERT and MARBERT Deep Bidirectional Transformers for Arabic |
|
Emerging |
| 2408 |
allenai/x-lxmert
PyTorch code for EMNLP 2020 paper "X-LXMERT: Paint, Caption and Answer... |
|
Emerging |
| 2409 |
YJiangcm/LTE
[ACL 2024] Learning to Edit: Aligning LLMs with Knowledge Editing |
|
Emerging |
| 2410 |
maifeeulasad/LocalLLaMA
📚 LocalLLaMA Archive — Community-powered static archive for r/LocalLLaMA |
|
Emerging |
| 2411 |
Uralstech/vid-orca
Deploy LLaMA-2 Chat on Google Cloud. |
|
Emerging |
| 2412 |
detsutut/ama-bot
A modern and lightweight NLP interface for Question-Answering systems and... |
|
Emerging |
| 2413 |
kurakurai/Luth
Luth is a state-of-the-art series of fine-tuned LLMs for French |
|
Emerging |
| 2414 |
asaddi/YALLM-LlamaVision
A set of nodes for basic Llama 3.2 Vision support in ComfyUI |
|
Emerging |
| 2415 |
loretoparisi/bert_text_classifier
Text Classification with BERT |
|
Emerging |
| 2416 |
datawhalechina/unlock-hf
解锁HuggingFace生态的百般用法 |
|
Emerging |
| 2417 |
SalesforceAIResearch/Elastic-Reasoning
Make reasoning models scalable |
|
Emerging |
| 2418 |
CogitatorTech/zigformer
An educational transformer-based LLM in pure Zig |
|
Emerging |
| 2419 |
peacelwh/VT-FSL
[NeurIPS 2025] VT-FSL: Bridging Vision and Text with LLMs for Few-Shot Learning |
|
Emerging |
| 2420 |
luffycodes/Tutorbot-Spock
An Education Tutoring Chatbot based on Learning Science Principles powered... |
|
Emerging |
| 2421 |
liuyang-ict/SAP-DETR
[CVPR 2023] Official implementation of "SAP-DETR: Bridging the Gap between... |
|
Emerging |
| 2422 |
SapienzaNLP/ita-bench
A collection of Italian benchmarks for LLM evaluation |
|
Emerging |
| 2423 |
fajri91/sum_liputan6
The first large-scale summarization corpus for the Indonesian language. AACL 2020. |
|
Emerging |
| 2424 |
nsi319/Finetune-Transformers
Abstractive text summarization by fine-tuning seq2seq models. |
|
Emerging |
| 2425 |
remixer-dec/botality-ii
telegram bot for self-hosted local inference of stable diffusion,... |
|
Emerging |
| 2426 |
waltonfuture/InstructionGPT-4
InstructionGPT-4 |
|
Emerging |
| 2427 |
LorenzoAgnolucci/BERT_for_ABSA
In this work (Targeted) Aspect-Based Sentiment Analysis task is converted to... |
|
Emerging |
| 2428 |
sayakpaul/deploy-hf-tf-vision-models
This repository shows various ways of deploying a vision model (TensorFlow)... |
|
Emerging |
| 2429 |
seongminp/transformers-into-vaes
Code for "Finetuning Pretrained Transformers into Variational Autoencoders" |
|
Emerging |
| 2430 |
FareedKhan-dev/Understanding-Transformers-Step-by-Step-math-example
Understanding Large Language Transformer Architecture like a child |
|
Emerging |
| 2431 |
babycommando/neuralgraffiti
Live-bending a foundation model’s output at neural network level. |
|
Emerging |
| 2432 |
andreiramani/jadi4llamacpp
Just another drop in for llama.cpp |
|
Emerging |
| 2433 |
ejurasek00/Hashing_LLM_Debiasing
Repository consisting of the files used in the experiments + brief... |
|
Emerging |
| 2434 |
shamspias/Transformers-and-Large-Language-Models-From-Basics-to-Frontier-Research
Dive into the transformative world of NLP with this guide on Transformers.... |
|
Emerging |
| 2435 |
ramalamadingdong/onnx-rubikpi
ONNX LLM runtime on RUBIK-Pi with Gemma 1B and Llama 3.2 1B |
|
Emerging |
| 2436 |
apple/ml-interspeech2022-phi_rtn
Repository accompanying the Interspeech 2022 publication titled... |
|
Emerging |
| 2437 |
templetwo/PhaseGPT
Kuramoto Phase-Coupled Oscillator Attention in Transformers |
|
Emerging |
| 2438 |
codyjk/ChessGPT
♟️ A transformer that plays chess 🤖 |
|
Emerging |
| 2439 |
tmcarmichael/fabricai-inference-server
A hackable, modular, containerized inference server for deploying large... |
|
Emerging |
| 2440 |
skjp/spout
Workspace Repo for Synergistic Plugins Optimizing Usability of Transformers(Spout) |
|
Emerging |
| 2441 |
BothBosu/Synthetic-Data-for-Scam-Detection-Leveraging-LLMs-to-Train-Deep-Learning-Models
This repository contains the source code and synthetic datasets used in the... |
|
Emerging |
| 2442 |
kiyoshisasano/llm-failure-atlas
A graph-based failure modeling and deterministic detection system for LLM... |
|
Emerging |
| 2443 |
StarxSky/ANE-GPT-New
New ANE GPT |
|
Emerging |
| 2444 |
BrightBlueCheese/transformers_and_chemistry
The Role of Model Architecture and Scale in Predicting Molecular Properties:... |
|
Emerging |
| 2445 |
zzteam-rccup-2024/aurora-echo
We propose a new feedback system, named Aurora Echo} which provides... |
|
Emerging |
| 2446 |
chris-santiago/met
Reproducing the MET framework with PyTorch |
|
Emerging |
| 2447 |
SapienzaNLP/MaTESe
MaTESe: Machine Translation Evaluation as a Sequence Tagging Problem |
|
Emerging |
| 2448 |
codewithdark-git/llama-3-Hackathon
LLaMA Genius is an AI-powered research assistant designed to help users... |
|
Emerging |
| 2449 |
winstxnhdw/llm-api
A fast CPU-based API for Qwen 2.5 using CTranslate2, hosted on Hugging Face Spaces. |
|
Emerging |
| 2450 |
Bengal1/Simple-Transformer
An introductory guide and practical showcase of the Transformer model. |
|
Emerging |
| 2451 |
hqhq1025/ai-course-notes
📚 220+ 份 AI/LLM 公开课中文讲义 PDF | Stanford CS336·CS224R·CS25·CS231N | Berkeley... |
|
Emerging |
| 2452 |
DrejcPesjak/scaling-monosemanticity-llama
Reproducing Scaling Monosemanticity: Extracting Interpretable Features from... |
|
Emerging |
| 2453 |
NotYuSheng/DialogSmith
Fine-tune an LLM on your Telegram chats to replicate your writing style... |
|
Emerging |
| 2454 |
VisioSphereAI/labelvim
This is a python based standalone image annotation tool designed for tasks... |
|
Emerging |
| 2455 |
tsinghua-fib-lab/UniST
Official implementation for "UniST: A Prompt-Empowered Universal Model for... |
|
Emerging |
| 2456 |
EdvardOlsen/Horoscope_generator
This is a horoscope generating code |
|
Emerging |
| 2457 |
liux2/Langchain-LLM-Config
Langchain LLM config adapters |
|
Emerging |
| 2458 |
ppijbb/NaturalLanguageProcessing
natural language processing notebooks |
|
Emerging |
| 2459 |
CYFARE/PDXTRACT
Extract From PDF's Using Ollama Local LLM |
|
Emerging |
| 2460 |
KCLabMTU/LMCrot
Protein Language Model (pLM) Powered Protein Crotonylation (Kcr) Modified... |
|
Emerging |
| 2461 |
TamSiuhin/P2P
source code for "Instant Personalized Large Language Model Adaptation via... |
|
Emerging |
| 2462 |
jrazi/llm-assignments-fall-2023
My assignment submissions for the Fall 2023 Large Language Models course at... |
|
Emerging |
| 2463 |
MichiganNLP/Scalable-VLM-Probing
Probe Vision-Language Models |
|
Emerging |
| 2464 |
ES7/Introduction-to-LLMs
In this repository I have explained the application of Large Language Models... |
|
Emerging |
| 2465 |
cgxjdzz/FeatureForge-LLM
FeatureForge LLM is a Python package that leverages large language models... |
|
Emerging |
| 2466 |
zealscott/AutoProfiler
Source code for Automated Profile Inference with Language Model Agents |
|
Emerging |
| 2467 |
mrkorzun/Multi-AI-Telegram-Bot
Multi-model Telegram bot (aiogram v3) with OpenRouter model picker (Llama,... |
|
Emerging |
| 2468 |
yangjianxin1/Firefly-LLaMA2-Chinese
Firefly中文LLaMA-2大模型,支持增量预训练Baichuan2、Llama2、Llama、Falcon、Qwen、Baichuan、Intern... |
|
Emerging |
| 2469 |
mary-lev/llm-ocr
LLM-powered OCR evaluation and correction package that supports multiple... |
|
Emerging |
| 2470 |
KhoiDOO/vitvqganvae
Benchmark for Evaluating Data Reconstruction using Vector Quantization |
|
Emerging |
| 2471 |
fualsan/TransformerFromScratch
PyTorch Implementation of Transformer Deep Learning Model |
|
Emerging |
| 2472 |
askblocks/askblocks-core
LLM API backend for Askblocks Q&A widget system. |
|
Emerging |
| 2473 |
alphasecio/groq
A Streamlit chatbot with memory for running open-source text models on Groq. |
|
Emerging |
| 2474 |
Arezkiiiii/mini_llm
🚀 Build and understand a Large Language Model from scratch using PyTorch... |
|
Emerging |
| 2475 |
huangjia2019/llm-in-action
LLM Examples |
|
Emerging |
| 2476 |
OpenMLRL/LLM_Collab_Code_Generation
LLM Collaboration for Code Generation |
|
Emerging |
| 2477 |
Alba-Intelligence/GraphMERT.jl
Julia implementation of the GraphMERT algorithm (Arxiv 2510.09580) |
|
Emerging |
| 2478 |
EKarton/English-French-Translator
A web application that translate sentences between English and French |
|
Emerging |
| 2479 |
robert-mcdermott/ollama-batch-cluster
Large Scale Batch Processing with Ollama |
|
Emerging |
| 2480 |
wangxiao5791509/MultiModal_BigModels_Survey
[MIR-2023-Survey] A continuously updated paper list for multi-modal... |
|
Emerging |
| 2481 |
Flagro/OmniModKit
Multimodal LLM toolkit |
|
Emerging |
| 2482 |
pmady/llmops
🚀 The Ultimate Curated List of LLMOps Tools, Frameworks, and Resources - A... |
|
Emerging |
| 2483 |
jpwahle/emnlp23-paraphrase-types
The official implementation of the EMNLP 2023 paper "Paraphrase Types for... |
|
Emerging |
| 2484 |
anshumansinha3301/Article-on-Large-Language-Models
This article provides an in-depth examination of LLMs, explaining... |
|
Emerging |
| 2485 |
Abhishek6353/AllMiniLML6V2-coreml
CoreML conversion of all-MiniLM-L6-v2 with a full SwiftUI demo, tokenizer... |
|
Emerging |
| 2486 |
Shivanshu-Gupta/in-context-learning
Easy in-context learning experiemnts with variety of datasets, LLMs, and... |
|
Emerging |
| 2487 |
yueliu1999/FlipAttack
[ICML 2025] An official source code for paper "FlipAttack: Jailbreak LLMs... |
|
Emerging |
| 2488 |
paulalesius/llmath
Large Language Math - The Mathematics of LLM Foundational Models - For Beginners |
|
Emerging |
| 2489 |
dengls24/LLM-para
Analyze LLM inference: FLOPs, memory, Roofline model. Supports GQA, MoE,... |
|
Emerging |
| 2490 |
yao8839836/kg-llm
Exploring large language models for knowledge graph completion. ICASSP 2025 |
|
Emerging |
| 2491 |
RJain12/choformer
Cho codon optimization WIP |
|
Emerging |
| 2492 |
ola-krutrim/Chitrarth
Chitrarth: Bridging Vision and Language for a Billion People |
|
Emerging |
| 2493 |
mikecvet/beam
LLM Beam Search Example Implementation |
|
Emerging |
| 2494 |
CharlesYuan02/eve-bot
A Discord bot I created in Python. Her name is Eve. |
|
Emerging |
| 2495 |
Living-with-machines/genre-classification
Jupyter book showing how to build an ML powered book genre classifier |
|
Emerging |
| 2496 |
kazemihabib/Mitigating-Reasoning-LLM-Social-Bias
A novel approach to mitigating social bias in Large Language Models through... |
|
Emerging |
| 2497 |
tasketh/tasketh
tasketh is a simple discord bot that lets moderators assign, and users claim tasks. |
|
Emerging |
| 2498 |
pittisl/GreenTrainer
Code for paper "Towards Green AI in Fine-tuning Large Language Models via... |
|
Emerging |
| 2499 |
gsarti/lcl23-xnlm-lab
Materials for the Lab "Explaining Neural Language Models from Internal... |
|
Emerging |
| 2500 |
kyegomez/Qwen-VL
My personal implementation of the model from "Qwen-VL: A Frontier Large... |
|
Emerging |