All Transformer Models
7,795 models ranked by quality score · Page 37 of 78
| # | Model | Score | Tier |
|---|---|---|---|
| 3601 |
lwch/llama2.go
Port of Facebook's LLaMA 2 model in pure go and use little memory |
|
Experimental |
| 3602 |
kuiperzone/Marklet-AI
Open Source AI Model Client |
|
Experimental |
| 3603 |
SkillichSE/Lumi-bot
A Telegram bot powered by aiogram integrated with a local LLM (LM Studio).... |
|
Experimental |
| 3604 |
cvcio/rtaa-classifier
Comments & Twitter accounts gRPC classification service. |
|
Experimental |
| 3605 |
olliverc1985/AXIOM
Lightweight Rust ML framework for training and deploying small transformer... |
|
Experimental |
| 3606 |
Lumi-node/model-garage
Open the hood on neural networks. Component-level model surgery, analysis,... |
|
Experimental |
| 3607 |
Abhinand20/MathFormer
MathFormer - Solve math equations using NLP and transformers! |
|
Experimental |
| 3608 |
neural-processing-lab/MEG-XL
Code for "MEG-XL: Data-Efficient Brain-to-Text via Long-Context Pre-Training" |
|
Experimental |
| 3609 |
imsigma1/AI-Knowledge-Creativity
🧠 Power AI-driven tools for creative exploration and knowledge retrieval,... |
|
Experimental |
| 3610 |
xingbpshen/medical-calibration-fairness-mllm
[MICCAI 2025] The official implementation of the paper "Exposing and... |
|
Experimental |
| 3611 |
OpenDFM/HeadsUp
[ICML 2025] Codes for the paper "Heads up! Large Language Models Can Perform... |
|
Experimental |
| 3612 |
KrishnanJothi/MT5_Language_identification_NLP
MT5-small is fine-tuned on the downstream task of Natural Language... |
|
Experimental |
| 3613 |
theboringhumane/echoOLlama
🦙 echoOLlama: A real-time voice AI platform powered by local LLMs. Features... |
|
Experimental |
| 3614 |
bgreenwell/statlingua
Explain Statistical Output with Large Language Models |
|
Experimental |
| 3615 |
mahsaama/ViT3D-BrainTumorSegmentation
Segmentation of Brain Tumors using Vision Transformer |
|
Experimental |
| 3616 |
SreeEswaran/Train-your-LLM
This repository contains code and resources for training, fine-tuning, and... |
|
Experimental |
| 3617 |
marvelefe/vit-brain-tumor
Vision Transformer (ViT) model for brain tumour detection and classification |
|
Experimental |
| 3618 |
Kareem404/hyper-connections
A minimal implementation of Manifold-Constrained Hyper-Connections (mHC)... |
|
Experimental |
| 3619 |
LimDoHyeon/EEG-LLM
Fine-tuned LLM for electroencephalography(EEG) data classification |
|
Experimental |
| 3620 |
friendshipkim/overfill
Code for OverFill: Two-Stage Models for Efficient Language Model Decoding |
|
Experimental |
| 3621 |
mkagenius/llm-token-visualizer
See How Big Exactly A 128k Token Text Is |
|
Experimental |
| 3622 |
Curtis-Wu/Equivariant-Graph-Transformer
A deep neural network with hybrid architecture (EGNN + Transformer) for... |
|
Experimental |
| 3623 |
ys-zong/VLGuard
[ICML 2024] Safety Fine-Tuning at (Almost) No Cost: A Baseline for Vision... |
|
Experimental |
| 3624 |
luo-junyu/Awesome-Data-Efficient-LLM
A list of data-efficient and data-centric LLM (Large Language Model) papers.... |
|
Experimental |
| 3625 |
gabriellst/paraphrase.ia
paraphrase.ia is a Chrome extension that let's you make paraphrases of a... |
|
Experimental |
| 3626 |
kyegomez/Open-Olmo
Unofficial open-source PyTorch implementation of the OLMo Hybrid... |
|
Experimental |
| 3627 |
Hashmat02/Fine-Tuning-LLaMA-2-for-Toxicity-Classification
Fine-tuning LLaMA 2 for toxicity classification using a balanced Kaggle... |
|
Experimental |
| 3628 |
nlkli/lachat
minimal CLI client for llama-server |
|
Experimental |
| 3629 |
elinx/safe-view
A terminal-based application for visualizing and analyzing safetensors files. |
|
Experimental |
| 3630 |
itsShnik/adaptively-finetuning-transformers
Adaptively fine tuning transformer based models for multiple domains and... |
|
Experimental |
| 3631 |
machinelearningzuu/experiments-on-large-language-models
This Repository Contains Different Experiments on LLMs with Hugging Face,... |
|
Experimental |
| 3632 |
shreydan/masked-language-modeling
Transformers Pre-Training with MLM objective — implemented encoder-only... |
|
Experimental |
| 3633 |
o-messai/fastVLM
An implementation of FastVLM/LLaVA or any llm/vlm model using FastAPI... |
|
Experimental |
| 3634 |
yejoon-lee/kr3
KR3: Korean Restaurant Review with Ratings / Experiments on... |
|
Experimental |
| 3635 |
subhasisj/llm-product-insights
Extracting Product Insights from Unstructured text data using LLMs with LangChain |
|
Experimental |
| 3636 |
zhiyuanhubj/LongRecipe
LongRecipe: Recipe for Efficient Long Context Generalization in Large Language Models |
|
Experimental |
| 3637 |
s4um1l/aya-cross-lingual-probe
Mechanistic interpretability of cross-lingual concept representations in... |
|
Experimental |
| 3638 |
ambideXtrous9/Finetune-Qwen3-using-Unsloth
Finetune Qwen3 using Unsloth : Reasoning and Non-Reasoning Dataset |
|
Experimental |
| 3639 |
rafaelvp-db/langchain-sql-databricks
Simple examples of using LLMs and Langchain on Databricks, |
|
Experimental |
| 3640 |
caesarnine/llm-experiments
Playing around with LLMs |
|
Experimental |
| 3641 |
sastpg/RFTT
RFTT: Reasoning with Reinforced Functional Token Tuning |
|
Experimental |
| 3642 |
Pomilon/LEMA
LEMA (Layer-wise Efficient Memory Abstraction): A hardware-aware framework... |
|
Experimental |
| 3643 |
nercone-dev/zeta-llm-dataset
Public Datasets for Zeta-Tool |
|
Experimental |
| 3644 |
Mr-TalhaIlyas/segformer
PyTorch Implementation of SegFormer: Simple and Efficient Design for... |
|
Experimental |
| 3645 |
taishan1994/qlora-chinese-LLM
使用qlora对中文大语言模型进行微调,包含ChatGLM、Chinese-LLaMA-Alpaca、BELLE |
|
Experimental |
| 3646 |
dilbersha/llm-inference-benchmarking-3080
A production-grade telemetry-aware suite for benchmarking LLM inference... |
|
Experimental |
| 3647 |
joshxfi/bumblebee
🐝 Run on-device models directly from your browser via Transformers. |
|
Experimental |
| 3648 |
JIA-Lab-research/Q-LLM
This is the official repo of "QuickLLaMA: Query-aware Inference Acceleration... |
|
Experimental |
| 3649 |
exitudio/GaitMixer
Official repository for "GaitMixer: Skeleton-based Gait Representation... |
|
Experimental |
| 3650 |
Brazilian-willametteriver232/llama.swift
🚀 Access llama.cpp easily in your Swift projects, leveraging precompiled... |
|
Experimental |
| 3651 |
fshnkarimi/train_scheduling_assistant
This project utilizes a fine-tuned Large Language Model (LLM) to generate... |
|
Experimental |
| 3652 |
aditeyabaral/maple
Implementation of the paper, MAPLE - MAsking words to generate blackout... |
|
Experimental |
| 3653 |
inuwamobarak/nougat
Nougat is a Meta AI's revolutionary OCR model designed to transcribe... |
|
Experimental |
| 3654 |
Bhargav1144/Mental_Health_Chatbot
A Streamlit-based AI chatbot offering compassionate mental health support... |
|
Experimental |
| 3655 |
mytechnotalent/MicroGPT
MicroGPT is a clean, educational implementation of the GPT (Generative... |
|
Experimental |
| 3656 |
SharathHebbar/ML-Project-list
List of all ML projects |
|
Experimental |
| 3657 |
Adora-Foundation/llm-energy-lab
Web application for benchmarking and comparing LLM behaviour, energy and... |
|
Experimental |
| 3658 |
cja5553/LLMs_in_perioperative_care
Codes for: Alba, C., Xue, B., Abraham, J. et al. The foundational... |
|
Experimental |
| 3659 |
mahadi-nahid/TabSQLify
[NAACL 2024] TabSQLify: Enhancing Reasoning Capabilities of LLMs Through... |
|
Experimental |
| 3660 |
ai-center-kth/cuBERT-source-code-clustering
Fine-tuning cuBERT embeddings for clustering source code by functionality |
|
Experimental |
| 3661 |
datvodinh/serve-llm
Serve high throughput and scalable LLM using Ray and vLLM |
|
Experimental |
| 3662 |
hpfield/Text2Touch
CoRL 2025 - Tactile In-Hand Manipulation with LLM-Designed Reward Functions |
|
Experimental |
| 3663 |
TristanLecourtois/NL2SQL
Text2SQL project comparing different LLM models |
|
Experimental |
| 3664 |
IsaacRodgz/Multimodal-Adapters
Adapter modules with support for multimodal fusion of information (text,... |
|
Experimental |
| 3665 |
DaemonLoki/MyAppleIntelligence
Custom implementation of Apple Intelligence features |
|
Experimental |
| 3666 |
DRSY/EasyKV
Easy control for Key-Value Constrained Generative LLM... |
|
Experimental |
| 3667 |
FredyRivera-dev/Flux2-from-scratch
This repo proposes to implement the Flux2 model from scratch |
|
Experimental |
| 3668 |
dwain-barnes/LLM-GGUF-Auto-Converter
Automated Jupyter notebook solution for batch converting Large Language... |
|
Experimental |
| 3669 |
Ahwar/NER-NLP-with-ONNX-Java
A Java NLP application that identifies names, organizations, and locations... |
|
Experimental |
| 3670 |
gesis24csspy/analyzing-text-data
Course materials on computational text analysis. John McLevey. 2024.... |
|
Experimental |
| 3671 |
talmago/spacy_coref
Lightweight cross-lingual coreference resolution with spaCy using ONNX... |
|
Experimental |
| 3672 |
theonesud/embedia
Create LLM-powered webapps with ease |
|
Experimental |
| 3673 |
UCSC-VLAA/vllm-safety-benchmark
[ECCV 2024] Official PyTorch Implementation of "How Many Unicorns Are in... |
|
Experimental |
| 3674 |
IbrahimSobh/askdoc
In this tutorial we will see 💡 How to get answers from documents using... |
|
Experimental |
| 3675 |
Yuan-ManX/infera
Infera — A High-Performance Inference Engine for Large Language Models. |
|
Experimental |
| 3676 |
zerob13/modelinfo-cli
A CLI to query AI model capabilities, context limits, and pricing from... |
|
Experimental |
| 3677 |
Omid-Nejati/Locality-iN-Locality
Robust Transformer with Locality Inductive Bias and Feature Normalization... |
|
Experimental |
| 3678 |
Arunkumar2510/LLM-Interview-Questions-and-Answers-Hub
🧠 Discover and prepare with 100+ LLM interview questions and answers to... |
|
Experimental |
| 3679 |
LennartKeller/roberta2longformer
Convert pretrained RoBerta models to various long-document transformer models |
|
Experimental |
| 3680 |
x-zheng16/CALM
[AAAI 25] CALM: Curiosity-Driven Auditing for LLMs |
|
Experimental |
| 3681 |
BenChaliah/Superposition-Transformer
a novel architecture that leverages Autoencoders to superimpose the hidden... |
|
Experimental |
| 3682 |
len-sla/NLP_mBART_mT5_translation
Polyglot Power: mBART & mT5 Translation Toolkit ... |
|
Experimental |
| 3683 |
rafaelvp-db/db-ancient-code-translation
Simple repo showing code-to-code and code-to-text capabilities using LLMs on... |
|
Experimental |
| 3684 |
designer-coderajay/logit-lens-explorer
Mechanistic interpretability tool visualizing GPT-2's layer-by-layer... |
|
Experimental |
| 3685 |
HenryNdubuaku/super-lazy-autograd
Hand-derived memory-efficient VJPs for tuning LLMs on laptops. |
|
Experimental |
| 3686 |
scienceetonnante/eiffel-tower-llama
Let's try to reproduce the Golden Gate Claude demo, but using open-source... |
|
Experimental |
| 3687 |
chrisliu298/llm-unlearn-eco
[NeurIPS 2024] Large Language Model Unlearning via Embedding-Corrupted Prompts |
|
Experimental |
| 3688 |
gs-ai/mlm-memory
A functionally operational, mathematically unhinged system for achieving 10×... |
|
Experimental |
| 3689 |
Jiacheng-Zhu-AIML/AsymmetryLoRA
Preprint: Asymmetry in Low-Rank Adapters of Foundation Models |
|
Experimental |
| 3690 |
riccardodm97/QA-QG
Question Answering and Question Generation NLP tasks on the SQuAD v1.1 dataset |
|
Experimental |
| 3691 |
Mattbusel/llm-wasm
LLM inference primitives for WebAssembly — cache, retry, routing, guards,... |
|
Experimental |
| 3692 |
Kovelja009/handwriting-recognition
Benchmark of different network architectures for handwritten text recognition. |
|
Experimental |
| 3693 |
tetratensor/Stock-Market-News-Sentiment-Analysis
A Python-based news sentiment analysis using Hugging Face Sentiment Analysis... |
|
Experimental |
| 3694 |
HelpingAI/inferno
Run Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1, and other... |
|
Experimental |
| 3695 |
AbdBarho/transformers-stack
A full stack solution for deploying a transformers model from HuggingFace |
|
Experimental |
| 3696 |
yashjakhotiya/Adversarial-Attacks-On-Transformers
Exploring vulnerabilities of Transformers-based Malware Detectors to... |
|
Experimental |
| 3697 |
AdamCoscia/iScore
Upload, score, and visually compare multiple LLM-graded summaries simultaneously! |
|
Experimental |
| 3698 |
Aaronhuang-778/SliM-LLM
[ICML 2025] SliM-LLM: Salience-Driven Mixed-Precision Quantization for Large... |
|
Experimental |
| 3699 |
gokul-pv/PanopticSegmentation
Panoptic segmentation on custom construction objects using DETR |
|
Experimental |
| 3700 |
Andras7/gpt2-pytorch
Extremely simple and understandable GPT2 implementation with minor tweaks |
|
Experimental |