All Transformer Models
7,795 models ranked by quality score · Page 27 of 78
| # | Model | Score | Tier |
|---|---|---|---|
| 2601 |
arshadshk/Last_Query_Transformer_RNN-PyTorch
Implementation of the paper "Last Query Transformer RNN for knowledge... |
|
Emerging |
| 2602 |
HiThink-Research/BizFinBench
A Business-Driven Real-World Financial Benchmark for Evaluating LLMs |
|
Emerging |
| 2603 |
katha-ai/EmoTx-CVPR2023
[CVPR 2023] Official code repository for "How you feelin'? Learning Emotions... |
|
Emerging |
| 2604 |
Mmorgan-ML/Phase-Slip-Sampler
Phase-Slip is a stochastic intervention architecture that operates on the... |
|
Emerging |
| 2605 |
varchasvee108/vision-transformer-maze-agent
Vision Transformer agent that learns to navigate mazes while visualizing... |
|
Emerging |
| 2606 |
asiff00/Bangla-Llama
Fine tuned llama 3 models for context based question answering in bengali language. |
|
Emerging |
| 2607 |
ai-art-dev99/llm-from-scratch
Build a Large Language Model From Scratch |
|
Emerging |
| 2608 |
xiaoachen98/Open-LLaVA-NeXT
An open-source implementation for training LLaVA-NeXT. |
|
Emerging |
| 2609 |
catherinesyeh/story-viz
Reimagining storyline visualizations with LLMs (VIS 2025) |
|
Emerging |
| 2610 |
prateekralhan/Deep-Question-Answering-System
A deep learning based Q&A system built using RoBerTa model from huggingface... |
|
Emerging |
| 2611 |
laclouis5/uform-coreml-converters
CLI for converting UForm models to CoreML. |
|
Emerging |
| 2612 |
conceptofmind/PaLM-flax
Implementation of the SOTA Transformer architecture from PaLM - Scaling... |
|
Emerging |
| 2613 |
patricia-pereira/cd-erc
Code for the paper: Context-Dependent Embedding Utterance Representations... |
|
Emerging |
| 2614 |
john-osborne-j/quantized-clinicalbert
This repository contains a 4-bit quantized ClinicalBERT model for disease... |
|
Emerging |
| 2615 |
Katashynskyi/Voice_assistant_UA_EN
No api-keys | local | llama3.1 For language studying and live translation |
|
Emerging |
| 2616 |
maxxxzdn/erwin
Erwin: A Tree-based Hierarchical Transformer for Large-scale Physical... |
|
Emerging |
| 2617 |
ropensci/pangoling
An R package for estimating the log-probabilities of words in a given... |
|
Emerging |
| 2618 |
NC0DER/GreekT5
A series of Greek News Summarization Sequence-to-Sequence Models built with... |
|
Emerging |
| 2619 |
ASK-03/Reverse-Chain
Implementation of paper - Reverse Chain: A Generic-Rule for LLMs to Master... |
|
Emerging |
| 2620 |
vcanchik/robotmem
Robot memory |
|
Emerging |
| 2621 |
asiff00/Bengali-Sentence-Error-Correction
Fine-tune mBart 50 for Bengali Sentence Error Correction |
|
Emerging |
| 2622 |
dsdanielpark/hf-transllm
LLMtranslator translates and generates text in multiple languages. |
|
Emerging |
| 2623 |
RaptorMai/MLLM-CompBench
[NeurIPS'25] MLLM-CompBench evaluates the comparative reasoning of MLLMs... |
|
Emerging |
| 2624 |
PRITHIVSAKTHIUR/Nvidia-Cosmos-Reason1-Demo
Physical AI models understand physical common sense and generate appropriate... |
|
Emerging |
| 2625 |
Merterm/Modeling-Intensification-for-SLG
Public repo for the paper: "Modeling Intensification for Sign Language... |
|
Emerging |
| 2626 |
SCRN-VRC/Language-Translation-with-Fragment-Shaders
EN to JP and JP to EN with transformer models |
|
Emerging |
| 2627 |
Qwen-Applications/CLIPO
CLIPO: Contrastive Learning in Policy Optimization Generalizes RLVR |
|
Emerging |
| 2628 |
Curated-Awesome-Lists/Awesome-Llama3
A curated, awesome list of resources, tools, and projects for the AI Large... |
|
Emerging |
| 2629 |
bobazooba/shurale
Conversation AI model for open domain dialogs |
|
Emerging |
| 2630 |
ryokamoi/llm-self-correction-papers
List of papers on Self-Correction of LLMs. |
|
Emerging |
| 2631 |
KhaledSharif/robot-transformers
Train and evaluate an Action Chunking Transformer (ACT) to perform... |
|
Emerging |
| 2632 |
curtisgray/wingman
Wingman is the fastest and easiest way to run Llama models on your PC or Mac. |
|
Emerging |
| 2633 |
ItzDerock/llama-playground
A simple to use and powerful web-interface to mess around with Meta's LLaMA LLM. |
|
Emerging |
| 2634 |
avatsaev/av-local-llm-api
Allows to easily run local REST API with a custom LLM, running locally or... |
|
Emerging |
| 2635 |
baldoarbol/BodyShapeGPT
Fine-tuned LLMs generate accurate 3D human avatars from textual descriptions... |
|
Emerging |
| 2636 |
akshat0123/GPT-1
Pytorch implementation of GPT-1 |
|
Emerging |
| 2637 |
azzeddineCH/flash-nanoGPT
Jax/Flax re-write of @karpathy 🐐 NanoGPT using some of the common Jax... |
|
Emerging |
| 2638 |
longyuewangdcu/Chinese-Llama-2
improve Llama-2's proficiency in comprehension, generation, and translation... |
|
Emerging |
| 2639 |
tongnie/ImputeFormer
[KDD 2024] "ImputeFormer: Low Rankness-Induced Transformers for... |
|
Emerging |
| 2640 |
bentoml/transformers-nlp-service
Online Inference API for NLP Transformer models - summarization, text... |
|
Emerging |
| 2641 |
JinXins/Awesome-Token-Merge-for-MLLMs
A paper list about Token Merge, Reduce, Resample, Drop for MLLMs. |
|
Emerging |
| 2642 |
ntropy-network/enrichment_models
This repository benchmark Ntropy API against different Large Language Models... |
|
Emerging |
| 2643 |
AhmetZamanis/DeepLearningEnergyForecasting
Time series forecasting on an hourly energy dataset, with LSTM & Transformer... |
|
Emerging |
| 2644 |
codeastra2/llm-feat
Automated feature engineering using Large Language Models (LLMs) for tabular data |
|
Emerging |
| 2645 |
naity/finetune-esm
Scalable Protein Language Model Finetuning with Distributed Learning and... |
|
Emerging |
| 2646 |
vmarinowski/infini-attention
An unofficial pytorch implementation of 'Efficient Infinite Context... |
|
Emerging |
| 2647 |
ImplicitLayer/multiagent_environments
Envirionments for NLP multiagent tasks |
|
Emerging |
| 2648 |
liziniu/policy_optimization
Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data) |
|
Emerging |
| 2649 |
JiauZhang/nnm
Neural Network Models |
|
Emerging |
| 2650 |
Relaxed-System-Lab/HexGen
[ICML 2024] Serving LLMs on heterogeneous decentralized clusters. |
|
Emerging |
| 2651 |
Koziev/LM-pretrain
Char-level language model pretraining code and scripts |
|
Emerging |
| 2652 |
Utshav-paudel/LLM-Zero-to-Hero
This repo contains the resources, projects and documentation of mine while... |
|
Emerging |
| 2653 |
prajjwal1/generalize_lm_nli
Code for the paper EMNLP 2021 workshop paper "Generalization in NLI: Ways... |
|
Emerging |
| 2654 |
crscardellino/argumentation-mining-transformers
Argumentation Mining Transformers Module (AMTM) implementation. |
|
Emerging |
| 2655 |
Basel-anaya/LoreWeaver
LoreWeaver is a Novel Generation Multimodal LLM based on Mistral 7B LLM |
|
Emerging |
| 2656 |
yuchen0515/2022-Competition-CUDAOutOfMemory
Our team placed 6th out of 119 teams in E.SUN AI Open Competition Summer... |
|
Emerging |
| 2657 |
lazy-guy/chess-llama
Tiny Llama model trained to play chess |
|
Emerging |
| 2658 |
yyDing1/GNER
[ACL 2024 Findings] Code implementation of Paper "Rethinking Negative... |
|
Emerging |
| 2659 |
misko/spf
Signal Processing Fun (in the sun) |
|
Emerging |
| 2660 |
j-webtek/Local-LLM_FineTune
Finetune Your Local LLM |
|
Emerging |
| 2661 |
muna-ai/muna-predictors
Interesting Python functions compiled to run anywhere with Muna. |
|
Emerging |
| 2662 |
jshuadvd/LongRoPE
Implementation of the LongRoPE: Extending LLM Context Window Beyond 2... |
|
Emerging |
| 2663 |
jordddan/Pruning-LLMs
The framework to prune LLMs to any size and any config. |
|
Emerging |
| 2664 |
makllama/makllama
MaK(Mac+Kubernetes)llama - Running and orchestrating large language models... |
|
Emerging |
| 2665 |
SciCrunch/bio_electra
Bio-Electra - Small and efficient discriminatively pre-trained language... |
|
Emerging |
| 2666 |
Giyanellow/llama-chatbot-with-ui
This project provides a comprehensive template for self-hosting a Large... |
|
Emerging |
| 2667 |
Aradhye2002/selective-peft-toolkit
Official implementation of the paper "Step-by-Step Unmasking for... |
|
Emerging |
| 2668 |
shinomakoi/magi_llm_gui
A Qt GUI for large language models |
|
Emerging |
| 2669 |
wassemgtk/llm.scala
Extensible implementation of a Language Model (LLM) training framework in Scala. |
|
Emerging |
| 2670 |
koudounasalkis/CLUES
This repo contains the code for "A Contrastive Learning Approach to Mitigate... |
|
Emerging |
| 2671 |
raymin0223/fast_robust_early_exit
Fast and Robust Early-Exiting Framework for Autoregressive Language Models... |
|
Emerging |
| 2672 |
tripathiarpan20/self-improvement-4all
Private self-improvement coaching with open-source LLMs |
|
Emerging |
| 2673 |
tenghuilee/ScalingCapFusedVisionLM
number of tokens <=> performance to a vision language model |
|
Emerging |
| 2674 |
swapUniba/LaikaLLM
A hub for training and evaluating LLMs, following the multitask paradigm, in... |
|
Emerging |
| 2675 |
xmed-lab/TAM
[ICCV25 Oral] Token Activation Map to Visually Explain Multimodal LLMs |
|
Emerging |
| 2676 |
cui-shaobo/defeasibility-in-causality
exploring the defeasibility inside causality |
|
Emerging |
| 2677 |
qiqiApink/MotionGPT
The official PyTorch implementation of the paper "MotionGPT: Finetuned LLMs... |
|
Emerging |
| 2678 |
just-ctrlC-ctrlV/Mechanical-Assistant
Imagine a world where your mechanical tasks are streamlined and optimized by... |
|
Emerging |
| 2679 |
alan-turing-institute/prompto
An open source library for asynchronous querying of LLM endpoints |
|
Emerging |
| 2680 |
ai4sd/multiscale-byte-lm
A hierarchical LM that scales to training on context windows of +5M tokens |
|
Emerging |
| 2681 |
cleopatra-itn/claim_detection
Code for tasks in the paper "Check\_square at CheckThat! 2020: Claim... |
|
Emerging |
| 2682 |
kyegomez/Open-NAMM
An open source implementation of the paper: "AN EVOLVED UNIVERSAL TRANSFORMER MEMORY" |
|
Emerging |
| 2683 |
VidhyaVarshanyJS/EnsembleX
EnsembleX utilizes the Knapsack algorithm to optimize Large Language Model... |
|
Emerging |
| 2684 |
ziansu/codeart
Official repo for FSE'24 paper "CodeArt: Better Code Models by Attention... |
|
Emerging |
| 2685 |
lrusso/llama3pure
Three inference engines for Llama 3: pure C for desktop systems, pure... |
|
Emerging |
| 2686 |
IParraMartin/An-Explanation-Is-All-You-Need
The original transformer implementation from scratch. It contains... |
|
Emerging |
| 2687 |
nlp-uoregon/Okapi
Okapi: Instruction-tuned Large Language Models in Multiple Languages with... |
|
Emerging |
| 2688 |
hplt-project/monolingual-multilingual-instruction-tuning
Monolingual or Multilingual Instruction Tuning: Which Makes a Better Alpaca |
|
Emerging |
| 2689 |
codefuse-ai/GALLa
[ACL 2025] Graph Aligned Large Language Models for Improved Source Code Understanding |
|
Emerging |
| 2690 |
Orion-AI-Lab/televit
Teleconnection-driven vision transformers for improved long-term forecasting |
|
Emerging |
| 2691 |
HenryCai11/LLM-Self-Control
The official repo of paper "Self-Control of LLM Behaviors by Compressing... |
|
Emerging |
| 2692 |
M4TH1EU/llama-assist
Manage your smart home in Home Assistant with local LLMs running with llama.cpp |
|
Emerging |
| 2693 |
AshutoshDongare/softskill-NER
Fine tuning 🤗 transformer model for softskill NER task |
|
Emerging |
| 2694 |
camelop/NLP-Robustness
OOD Generalization and Detection (ACL 2020) |
|
Emerging |
| 2695 |
zeroxt32/Forex-Expert-Advisor-Python
Forex Bot Agents Using Machine Learning Implementations. Custom Forex Environments |
|
Emerging |
| 2696 |
nghiempt/llm-analysis-privacy-policy
Unveiling Discrepancies in Android App Data Safety Declarations and Privacy... |
|
Emerging |
| 2697 |
vipulraheja/coedit
Official implementation of the paper "CoEdIT: Text Editing by Task-Specific... |
|
Emerging |
| 2698 |
yfedoseev/llmkit
Production-grade LLM client - Rust, Python, TypeScript. 100+ providers,... |
|
Emerging |
| 2699 |
ivanovitchm/PPGEEC2318
Repository for EEC2318, a graduate course on PPgEEC about Machine Learning |
|
Emerging |
| 2700 |
TamSiuhin/LLM-UM-Reading
A list of large language models for user modeling (LLM-UM) papers, based on... |
|
Emerging |