All Transformer Models
7,795 models ranked by quality score · Page 28 of 78
| # | Model | Score | Tier |
|---|---|---|---|
| 2701 |
aigc-apps/PertEval
[NeurIPS '24 Spotlight] PertEval: Unveiling Real Knowledge Capacity of LLMs... |
|
Emerging |
| 2702 |
DomHudson/bert-in-production
A collection of resources on using BERT (https://arxiv.org/abs/1810.04805 )... |
|
Emerging |
| 2703 |
discountry/forever-chat
chatgpt with forever memory! |
|
Emerging |
| 2704 |
titanml/takeoff-community
TitanML Takeoff Server is an optimization, compression and deployment... |
|
Emerging |
| 2705 |
Agora-Lab-AI/HydraNet
HydraNet is a state-of-the-art transformer architecture that combines... |
|
Emerging |
| 2706 |
yangjianxin1/LongQLoRA
LongQLoRA: Extent Context Length of LLMs Efficiently |
|
Emerging |
| 2707 |
zzz47zzz/codebase-for-incremental-learning-with-llm
[ACL2024] A Codebase for Incremental Learning with Large Language Models;... |
|
Emerging |
| 2708 |
elijahnzeli1/CausalTorch
CausalTorch is a PyTorch library for building generative models with... |
|
Emerging |
| 2709 |
ryoungj/ObsScaling
[NeurIPS'24 Spotlight] Observational Scaling Laws |
|
Emerging |
| 2710 |
ant-louis/belgpt2
🇧🇪 BelGPT-2: the 1st GPT model pretrained in French. |
|
Emerging |
| 2711 |
Yifan-Song793/ETO
Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents... |
|
Emerging |
| 2712 |
euclaise/SlimTrainer
Full finetuning of large language models without large memory requirements |
|
Emerging |
| 2713 |
hao-ai-lab/d3LLM
d3LLM: Ultra-Fast Diffusion LLM 🚀 |
|
Emerging |
| 2714 |
Agora-Lab-AI/OmniByteGPT
An implementation of an all-new foundation model architecture that trains on... |
|
Emerging |
| 2715 |
w1bb/ATE
A server application that provides the user answers to trivia-like questions. |
|
Emerging |
| 2716 |
Shaurya-Sethi/transqlate
End-to-end natural language to SQL system: schema-aware model fine-tuning,... |
|
Emerging |
| 2717 |
ChaitanyaK77/Optimal-Detection-of-Diabetic-Retinopathy-Severity-Using-Attention-Based-CNN-and-Vision-Transformers
This repository contains the implementation of a hybrid model combining... |
|
Emerging |
| 2718 |
Iteranya/AktivaAI
Local LLM Discord Bot |
|
Emerging |
| 2719 |
JunyiYe/FaultyMathProblem
From Blind Solvers to Logical Thinkers: Benchmarking LLMs’ Logical Integrity... |
|
Emerging |
| 2720 |
NiuTrans/Introduction-to-Transformers
An introduction to basic concepts of Transformers and key techniques of... |
|
Emerging |
| 2721 |
abdur75648/MedicalGPT
Medical Report Generation And VQA (Adapting XrayGPT to Any Modality) |
|
Emerging |
| 2722 |
bpevangelista/vfastml
Inference and Training Engine for LLMs, Image2Image and Other Models |
|
Emerging |
| 2723 |
py-lama/weblama
A web-based Markdown editor with syntax highlighting, Mermaid diagram... |
|
Emerging |
| 2724 |
jiaowoguanren0615/DLinear
This is a warehouse for DLinear-Pytorch-model, can be used to train your... |
|
Emerging |
| 2725 |
EternityYW/RUPBench
RUPBench: Benchmarking Reasoning Under Perturbations for Robustness... |
|
Emerging |
| 2726 |
serp-ai/LLaMA-8bit-LoRA
Repository for Chat LLaMA - training a LoRA for the LLaMA (1 or 2) models on... |
|
Emerging |
| 2727 |
VirtualRoyalty/gan-plus-nlp
Generative adversarial approach to most popular NLP tasks |
|
Emerging |
| 2728 |
FudanDISC/ReForm-Eval
An benchmark for evaluating the capabilities of large vision-language models (LVLMs) |
|
Emerging |
| 2729 |
KevinLee1110/dynamic-batching
The official repo for the paper "Optimizing LLM Inference Throughput via... |
|
Emerging |
| 2730 |
ApocryphalEditor/SRM-mapping-framework
A framework for mapping the internal geometry of transformer representations... |
|
Emerging |
| 2731 |
LMOS-IO/ALMoAPI
ALMoAPI, Agentic Language Model API, is a fork of tabbyAPI, designed to... |
|
Emerging |
| 2732 |
danieloquelis/natural-language-git
Offline LLM-powered Git CLI tool. NLGit interprets your natural language... |
|
Emerging |
| 2733 |
NeurAI-Lab/MT-SfMLearner
Official code for 'Transformers in Unsupervised Structure-from-Motion' and... |
|
Emerging |
| 2734 |
shikiw/Modality-Integration-Rate
[ICCV 2025] The official code of the paper "Deciphering Cross-Modal... |
|
Emerging |
| 2735 |
ImplicitLayer/agents_nlp
Agents for solving NLP problems |
|
Emerging |
| 2736 |
AJAkil/LLMalMorph
This repository contain the tool LLMalMorph, a semi automated tool that... |
|
Emerging |
| 2737 |
kyegomez/MobileVLM
Implementation of the LDP module block in PyTorch and Zeta from the paper:... |
|
Emerging |
| 2738 |
Roboflow-Universe/finetune-RF-DETR
Modular CLI pipeline for fine‑tuning RF‑DETR object detection models on... |
|
Emerging |
| 2739 |
LikithMeruvu/Gemma2B_Finetuning_Medium
This Repo contains How to Finetune Google's New Gemma LLm model using your... |
|
Emerging |
| 2740 |
JarvisPei/FuseGPT
The implementation for the paper, FuseGPT: Learnable Layers Fusion of... |
|
Emerging |
| 2741 |
graphcore-research/jax-scalify
JAX Scalify: end-to-end scaled arithmetics |
|
Emerging |
| 2742 |
XCollab/HuggingFace
This repository provides an overview of Hugging Face's Transformers library,... |
|
Emerging |
| 2743 |
MartinaHutter/yaskawa-voice-commands
NLP for yaskawa robot |
|
Emerging |
| 2744 |
Pranav-here/agentic-ai-chatbot
This project is a modular AI chatbot framework that allows dynamic... |
|
Emerging |
| 2745 |
surrey-nlp/LLM4MT_eval
This repository is for our paper "What do large language model need for... |
|
Emerging |
| 2746 |
FranxYao/FlanT5-CoT-Specialization
Implementation of ICML 23 Paper: Specializing Smaller Language Models... |
|
Emerging |
| 2747 |
smpanaro/coreml-llm-cli
CLI to demonstrate running a large language model (LLM) on Apple Neural Engine. |
|
Emerging |
| 2748 |
xmindflow/deformableLKA
[WACV 2024] Beyond Self-Attention: Deformable Large Kernel Attention for... |
|
Emerging |
| 2749 |
nrl-ai/CustomChar
Your customized AI assistant - Personal assistants on any hardware! With... |
|
Emerging |
| 2750 |
yihong1120/Llama2-Telegram-Bot
Integration of the advanced llama2 AI model with Telegram to provide... |
|
Emerging |
| 2751 |
Arman176001/Oxidize
⚙️ Oxidize: A Python-to-Rust code translator to boost performance, safety,... |
|
Emerging |
| 2752 |
techthoughts2/pwshBedrock
pwshBedrock is a PowerShell module designed to simplify interaction with... |
|
Emerging |
| 2753 |
marqinhos/MedicalLiverSegmentationToolKit
Medical Toolkit for Liver Volume Segmentation |
|
Emerging |
| 2754 |
jaabmar/cp_fuse
Implementation for the paper "Copyright-Protected Language Generation via... |
|
Emerging |
| 2755 |
QwenLM/PolyMath
[NeurIPS 2025 D&B Track] Evaluation Code Repo for Paper "PolyMath:... |
|
Emerging |
| 2756 |
OpenMOSS/LongLLaDA
[AAAI26] LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs |
|
Emerging |
| 2757 |
NiuTrans/Vision-LLM-Alignment
This repository contains the code for SFT, RLHF, and DPO, designed for... |
|
Emerging |
| 2758 |
kyegomez/primus
A multimodal foundation model for humanoid robotics that integrates multiple... |
|
Emerging |
| 2759 |
mrcabbage972/simple-toolformer
A Python implementation of Toolformer using Huggingface Transformers |
|
Emerging |
| 2760 |
AGI-Edgerunners/LLM-Optimizers-Papers
Must-read Papers on Large Language Model (LLM) as Optimizers and Automatic... |
|
Emerging |
| 2761 |
jwergieluk/revllm
RevLLM -- Reverse Engineering Tools for Large Language Models |
|
Emerging |
| 2762 |
harleyszhang/llm_counts
llm theoretical performance analysis tools and support params, flops, memory... |
|
Emerging |
| 2763 |
pdaicode/awesome-LLMs-finetuning
Collection of resources for finetuning Large Language Models (LLMs). |
|
Emerging |
| 2764 |
dinhquy-nguyen-1704/ZaloAI2023-Elementary-Math-Solving
Baseline achieving 0.8 accuracy on the private test set in the ZaloAI... |
|
Emerging |
| 2765 |
Joyce94/LLM-RLHF-Tuning
LLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA) |
|
Emerging |
| 2766 |
kryptomrx/tonl-mcp-bridge
Reduce LLM token costs by 30-60% with TONL format. TypeScript library & CLI... |
|
Emerging |
| 2767 |
RahulSChand/llama2.c-for-dummies
Step by step explanation/tutorial of llama2.c |
|
Emerging |
| 2768 |
cutec-chris/matrix-llm-bot
An Bot wich can use most of Large Language Models |
|
Emerging |
| 2769 |
hem9984/Dataset-label
This will allow you to choose your labels, and then label every image in a... |
|
Emerging |
| 2770 |
iboing/CorDA
CorDA: Context-Oriented Decomposition Adaptation of Large Language Models... |
|
Emerging |
| 2771 |
apollosoldier/Advanced-Classifier
The Advanced Classification Model is a deep learning-based approach for... |
|
Emerging |
| 2772 |
ynes99/BraTS_Segmentation
Segmentation of brain tumors (Glioma) in MRIs using Meta's model SAM... |
|
Emerging |
| 2773 |
yinizhilian/ICLR2025-Papers-with-Code
历年ICLR论文和开源项目合集,包含ICLR2021、ICLR2022、ICLR2023、ICLR2024、ICLR2025. |
|
Emerging |
| 2774 |
dmis-lab/Outlier-Safe-Pre-Training
[ACL 2025] Outlier-Safe Pre-Training for Robust 4-Bit Quantization of Large... |
|
Emerging |
| 2775 |
rajatrayaraddi/rul-prediction-bilstm-cnn
A BiLSTM-CNN hybrid model with attention for predicting remaining useful life (RUL) |
|
Emerging |
| 2776 |
waltonfuture/Diff-eRank
[NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Models |
|
Emerging |
| 2777 |
MoleculeTransformers/moleculenet-smiles-bert-mixup
Training pre-trained BERT language model on molecular SMILES from the... |
|
Emerging |
| 2778 |
bhanuprathap2000/sign-language-recognition
This repo contains the code for sign-language-recognition as part of our... |
|
Emerging |
| 2779 |
cosmic-heart/Benetech-Chart-Derendering
Benetech Kaggle Competition Work. Fine Tuning Matcha (Multi Modal... |
|
Emerging |
| 2780 |
garyb9/pytorch-transformers
Transformers architecture code playground repository in python using PyTorch. |
|
Emerging |
| 2781 |
sitammeur/qwen2.5-web
Qwen2.5 Instruct, large language model, operates within web browsers via 🤗... |
|
Emerging |
| 2782 |
telekom/llm_evaluation_results
LLM evaluation results |
|
Emerging |
| 2783 |
shhossain/BanglaTranslationKit
BanglaTranslationKit is a open-source translation package for offline... |
|
Emerging |
| 2784 |
Nikshaan/llm-from-scratch
Implementation of build a LLM from scratch by Sebastian Raschka. |
|
Emerging |
| 2785 |
fatemafaria142/Large-Language-Models-Over-Transformer-Models-for-Bangla-NLI
This research examines the performance of Large Language Models (GPT-3.5... |
|
Emerging |
| 2786 |
D-Roberts/transformers-retrieval-ranking-nli-ECIR2021
Multilingual retrieval, ranking and natural language inference with... |
|
Emerging |
| 2787 |
upunaprosk/quantized-lm-confidence
Code for NAACL paper When Quantization Affects Confidence of Large Language Models? |
|
Emerging |
| 2788 |
ilias-ant/toxic-spans-detection
An attempt at SemEval 2021 Task 5: Toxic Spans Detection. |
|
Emerging |
| 2789 |
matteomedioli/BERT-KG
Enriching Language Models Representations via Knowledge Graphs Regularisation |
|
Emerging |
| 2790 |
Nickil21/weakly-supervised-parsing
Official Code for our Findings of ACL 2022 paper: Co-training an... |
|
Emerging |
| 2791 |
toriving/haafor-challenge-2020
The project for HAAFOR CHALLENGE 2020 |
|
Emerging |
| 2792 |
stevezheng23/fewshot_nlp_pt
Few-shot NLP in PyTorch |
|
Emerging |
| 2793 |
HLTCHKUST/VG-GPLMs
The code repository for EMNLP 2021 paper "Vision Guided Generative... |
|
Emerging |
| 2794 |
mlane/llm-getting-started
Practical, beginner-friendly LLM projects using Python, LangChain, and... |
|
Emerging |
| 2795 |
mtanghu/LEAP
LEAP: Linear Explainable Attention in Parallel for causal language modeling... |
|
Emerging |
| 2796 |
ParadoxZW/LLaVA-UHD-Better
A bug-free and improved implementation of LLaVA-UHD, based on the code from... |
|
Emerging |
| 2797 |
cambridgeltl/sail-bli
Self-Augmented In-Context Learning for Unsupervised Word Translation (ACL... |
|
Emerging |
| 2798 |
seonghyeonye/Flipped-Learning
[ICLR 2023] Guess the Instruction! Flipped Learning Makes Language Models... |
|
Emerging |
| 2799 |
UIC-Liu-Lab/ContinualLM
An Extensible Continual Learning Framework Focused on Language Models (LMs) |
|
Emerging |
| 2800 |
Skyline-9/Visionary-Vids
Multi-modal transformer approach for natural language query based joint... |
|
Emerging |