All Transformer Models
7,795 models ranked by quality score · Page 39 of 78
| # | Model | Score | Tier |
|---|---|---|---|
| 3801 |
KevinBian107/MOSAIC
Motif-preserving Graph Tokenization for Biological Structure Generation... |
|
Experimental |
| 3802 |
jaslatendresse/llm-demo
This repository demonstrates how to do inference using llama.cpp on a... |
|
Experimental |
| 3803 |
DigitalHarborFoundation/FlexEval
FlexEval is an LLM evaluation tool designed for practical quantitative analysis. |
|
Experimental |
| 3804 |
dkopi/Bitune
Implementation of Bitune: Bidirectional Instruction-Tuning |
|
Experimental |
| 3805 |
SupratikB23/HarmonyRL
Deep Learning framework for generating symbolic music (MIDI) using... |
|
Experimental |
| 3806 |
sambitbhaumik/siamese-nn-sts
Project files contain PyTorch implementations for Siamese BiLSTM models for... |
|
Experimental |
| 3807 |
Uranarc/Disentanglement
Comparative NLP study: BERTopic vs. Llama 3 for conversation... |
|
Experimental |
| 3808 |
Josephrp/SmolFactory
finetune gpt-oss and smollm3 on your data easily and cheaply |
|
Experimental |
| 3809 |
hppRC/simple-simcse-ja
Exploring Japanese SimCSE |
|
Experimental |
| 3810 |
ToluClassics/LowResourceOCR
This work is an adaptation of CNN+Transformer architecture to training text... |
|
Experimental |
| 3811 |
abgache/NanoGPL
Small test generative pre-trained LAM (Linear Attention Mechanism). |
|
Experimental |
| 3812 |
longday1102/VietAI-experiment-LLaMA2
⚡ LLaMA-2 model experiment |
|
Experimental |
| 3813 |
Blinorot/ALARM
Official Implementation of "ALARM: Audio–Language Alignment for Reasoning Models" |
|
Experimental |
| 3814 |
SIC98/GPT2-python-code-generator
GPT2 finetuning with transformers 🤗 |
|
Experimental |
| 3815 |
strickvl/isafpr_finetune
Finetuning an LLM for structured data extraction from press releases |
|
Experimental |
| 3816 |
ia-labo/French-News-Clustering
Text classification and clustering using transformers and Denstream. |
|
Experimental |
| 3817 |
saloni-1919/biosum-reliable
AI-powered biomedical text summarization using extractive NLP, biomedical... |
|
Experimental |
| 3818 |
LennartKeller/DeepTextClustering
Deep text clustering with language models |
|
Experimental |
| 3819 |
Hi-archers/MLaKE
COLING 2025: MLaKE: Multilingual Knowledge Editing Benchmark for Large... |
|
Experimental |
| 3820 |
icon-lab/TranSMS
Official Implementation of Transformers for System Matrix Super-resolution (TranSMS) |
|
Experimental |
| 3821 |
madara88645/VibeGraph
Turn any Python codebase into an interactive call graph with AI-powered... |
|
Experimental |
| 3822 |
AmbiTyga/Automated-Medical-Assistance
Paper: https://openreview.net/forum?id=jYV4ZXy0L5 |
|
Experimental |
| 3823 |
thinkwee/NOVER
[EMNLP-2025] R1-Zero on ANY TASK |
|
Experimental |
| 3824 |
AlirezaSalehy/Tipsomaly
This is an extended version of the paper “TIPS Over Tricks: Simple Prompts... |
|
Experimental |
| 3825 |
Human-Centric-Machine-Learning/counterfactual-llms
Code for "Counterfactual Token Generation in Large Language Models", Arxiv 2024. |
|
Experimental |
| 3826 |
aastroza/llm-teaching
Teaching materials on Large Language Models (LLMs) |
|
Experimental |
| 3827 |
zhengyima/knowqa
预训练模型知识量度量竞赛 Baseline F1 0.35 BERTForMaskedLM |
|
Experimental |
| 3828 |
cvedix/omnisdk
On-device AI deloper platform |
|
Experimental |
| 3829 |
pszemraj/decoder-pytorch-template
Hackable PyTorch template for decoder-only transformer architecture... |
|
Experimental |
| 3830 |
TarekkMU1911/AI-Agent-Diabetes-Diagnosis
This project builds an AI-powered agent to support diabetes patients using... |
|
Experimental |
| 3831 |
artpli/CodeIE
[ACL 23] CodeIE: Large Code Generation Models are Better Few-Shot... |
|
Experimental |
| 3832 |
krishnaplwl/Homework_Solver_LLM
A fine-tuned LLM to solve homework questions ranging from maths to science... |
|
Experimental |
| 3833 |
ArpitKadam/Attention-Is-All-You-Code
From Attention Mechanisms to Large Language Models — built from scratch. |
|
Experimental |
| 3834 |
NikolaOgnjenovic/WebWise
Full stack web app which lets users upload & browse videos in order to... |
|
Experimental |
| 3835 |
Talnz007/VulkanIlm
GPU-accelerated LLaMA inference wrapper for legacy Vulkan-capable systems a... |
|
Experimental |
| 3836 |
HKUNLP/multilingual-transfer
Code for paper ”Language Versatilists vs. Specialists: An Empirical... |
|
Experimental |
| 3837 |
tensorchord/inference-benchmark
Benchmark for machine learning model online serving (LLM, embedding,... |
|
Experimental |
| 3838 |
tatwan/mastering_llm_deployments
This is based on my comprehensive course on deploying Large Language Models... |
|
Experimental |
| 3839 |
nnilayy/BioCore
A comprehensive bioinformatics platform/suite for molecular biology research... |
|
Experimental |
| 3840 |
asokraju/LangChainDatasetForge
Generating artificial datasets using langchain and finetuning the LLMs on... |
|
Experimental |
| 3841 |
loryanstrant/ha-transformers-theme
A Transformers theme for Home Assistant |
|
Experimental |
| 3842 |
abhayra12/StudentLife-Phenotyping
End-to-end behavioral prediction system using digital phenotyping. PyTorch... |
|
Experimental |
| 3843 |
Yash-Kavaiya/30-Days-LLM-Mastery-Course
30-Days-LLM-Mastery-Course: A comprehensive, hands-on course diving deep... |
|
Experimental |
| 3844 |
schmijul/TransformerForSignalPredicition
This is a private learning Project to play around with Transformers |
|
Experimental |
| 3845 |
avijit-jana/huggingface-nlp-image-tool
An end‑to‑end application leveraging Hugging Face pretrained models for... |
|
Experimental |
| 3846 |
ovshake/rat
Reverse Attention Tracer: A lightweight API to visualize which words... |
|
Experimental |
| 3847 |
mtkaya/transformer-edge-optimization
Optimize Transformer models for edge devices |
|
Experimental |
| 3848 |
BLCK-B/Moerkepub
Local EPUB translation using multilingual Transformer models on GPU. |
|
Experimental |
| 3849 |
sergio11/llm_finetuning_and_evaluation
The LLM FineTuning and Evaluation project 🚀 enhances FLAN-T5 models for... |
|
Experimental |
| 3850 |
ayaka14732/TrAVis
TrAVis: Visualise BERT attention in your browser |
|
Experimental |
| 3851 |
mosh98/MMBT
Multi modal BiTransformer [ Reimplementation ] in Pytorch That Acutally Works ! |
|
Experimental |
| 3852 |
madeburo/GEO-AI-Shopify
AI Search Optimization for Shopify. Generate llms.txt, AI crawler rules and... |
|
Experimental |
| 3853 |
tianzhaotju/LEAM
We propose a novel DL-based mutation technique (LEAM), which adapts the... |
|
Experimental |
| 3854 |
wondergo2017/LLM4DyG
Implementation codes for KDD24 paper "LLM4DyG: Can Large Language Models... |
|
Experimental |
| 3855 |
Eric2i/LLM-MindMap
EMNLP 2025 - "Mapping the Minds of LLMs: A Graph-Based Analysis of Reasoning... |
|
Experimental |
| 3856 |
InternRobotics/Grounded_3D-LLM
Code&Data for Grounded 3D-LLM with Referent Tokens |
|
Experimental |
| 3857 |
DresOperatingSystems/Dresguardian
Privacy First Telegram Group Management Bot with Built in AI and DuckDuckGo... |
|
Experimental |
| 3858 |
inuwamobarak/Meta-Llama-3-8B
Experiments with the Meta-Llama-3-8B |
|
Experimental |
| 3859 |
byroneverson/Mia
A simple swift app for MacOS/iOS to test large language models (LLM) |
|
Experimental |
| 3860 |
Fromsko/neural_friend_kit
用微信聊天记录训练神经网络,复刻朋友的说话风格 |
|
Experimental |
| 3861 |
lakshyaag/Deep-Learning-From-Scratch
Implementing popular deep learning papers in PyTorch |
|
Experimental |
| 3862 |
shreydan/scratchformers
building various transformer model architectures and its modules from scratch. |
|
Experimental |
| 3863 |
timvvvht/HKEX-Announcement-Classifier
A project on data exploration, analysis and using a neural network to... |
|
Experimental |
| 3864 |
hsj576/GTO
Official Implementation of "Bridging Draft Policy Misalignment: Group Tree... |
|
Experimental |
| 3865 |
TheDarkchip/nfp
Lean 4 library + CLI for rigorous bounds in transformer computations... |
|
Experimental |
| 3866 |
mirzayasirabdullahbaig07/Fine-Tuning-LLaMA-3.2-3B-Using-PEFT-LoRA
This project showcases parameter-efficient fine-tuning of the LLaMA 3.2 (3B)... |
|
Experimental |
| 3867 |
mhajder/llama.cpp-updater
A shell script to automatically update or build llama.cpp with optimal GPU... |
|
Experimental |
| 3868 |
MingSun-Tse/Awesome-Efficient-ViT
Recent Advances on Efficient Vision Transformers |
|
Experimental |
| 3869 |
saichandrapandraju/TabQGen
This repository hosts the code for the paper "Answer-Aware Question... |
|
Experimental |
| 3870 |
Ebimsv/LLM-Lab
Pretraining and Finetuning Language Model |
|
Experimental |
| 3871 |
amanongithub7/classical-music-generation
Comparing LSTM and Transformer-based deep learning approaches for classical... |
|
Experimental |
| 3872 |
bassrehab/credit_risk
Forecast long sequence default/downgrade of corporate entities and financial... |
|
Experimental |
| 3873 |
afspies/attention-tutorial
Jupyter Notebook tutorial on Attention Mechanisms, Position Embeddings and... |
|
Experimental |
| 3874 |
danelpeng/Awesome-Continual-Leaning-with-PTMs
This is a curated list of "Continual Learning with Pretrained Models" research. |
|
Experimental |
| 3875 |
Devnetly/image-captioning
Image captioning model & application based on transformers. |
|
Experimental |
| 3876 |
a1exus/koda
Local LLM orchestration — run GGUF models via llama.cpp with one command |
|
Experimental |
| 3877 |
MouxiaoHuang/PPE
[ICLR 2026] Official code of PPE: Positional Preservation Embedding for... |
|
Experimental |
| 3878 |
yuval6957/SIIM-Transformer
Yuval and nosound models and write-up for Kaggle's competition "SIIM-ISIC... |
|
Experimental |
| 3879 |
ExposedCat/tg-local-llm
Run local LLMs powered up by tools in Telegram Messenger |
|
Experimental |
| 3880 |
pablo-reyes8/implementing-gpt
Clean-room GPT-2/GPT-3 implementation: tokenizers, architecture blocks,... |
|
Experimental |
| 3881 |
Stoksweet/modlable
A platform for building, training and running inference on TensorflowJS... |
|
Experimental |
| 3882 |
DoctorLai/SimilarString
Compute the score of similarity between two strings |
|
Experimental |
| 3883 |
JexanJoel/VoiceIQ-Backend
AI engine for VoiceIQ - transcribes Hinglish & Tanglish call recordings via... |
|
Experimental |
| 3884 |
RitoCryo/DeepRWKV-Reasoning
🔍 Enhance reasoning in Large Language Models with DeepRWKV-Reasoning, using... |
|
Experimental |
| 3885 |
Andrew2077/Alpaca
Simple Q/A app, where i created a UI for alpaca (fine tuned LLAMA) model... |
|
Experimental |
| 3886 |
Technolog796/image_captioning
Создание русскоязычной модели для image captioning |
|
Experimental |
| 3887 |
RUCKBReasoning/CodeRM
Official code implementation for the ACL 2025 paper: 'Dynamic Scaling of... |
|
Experimental |
| 3888 |
farukalamai/background-removal-birefnet
Background Removal Application using BiRefNet |
|
Experimental |
| 3889 |
yubainu/sibainu-engine
Real-time hallucination detection for LLMs via Geometric Drift Analysis in... |
|
Experimental |
| 3890 |
showlab/VisInContext
Official implementation of Leveraging Visual Tokens for Extended Text... |
|
Experimental |
| 3891 |
GoWtEm/llm-model-selector
A high-performance Rust utility that analyzes your system hardware to... |
|
Experimental |
| 3892 |
AI-14/pkatransnet
[IVC 2025] [Official code] - Enhancing radiology report generation: A prior... |
|
Experimental |
| 3893 |
shreyansh26/LLM-Sampling
A collection of various LLM sampling methods implemented in pure Pytorch |
|
Experimental |
| 3894 |
NachoPeinador/FRUGAL_AI_CHIP
FrugalAI Chip: Modular silicon architecture for disposable AI. Achieves... |
|
Experimental |
| 3895 |
Assaoka/Guide-to-Advanced-LLM-Techniques
Este repositório é um tutorial completo e prático que explora metodologias... |
|
Experimental |
| 3896 |
harshpimpale/LegalMind
A project that uses Large Language Models (LLMs) to assist users with legal... |
|
Experimental |
| 3897 |
byramsubramanian/yt-video-summarizer
Video Summarization Experiments with Open LLMs |
|
Experimental |
| 3898 |
MiuLab/InstUPR
Source code of our paper "InstUPR: Instruction-based Unsupervised Passage... |
|
Experimental |
| 3899 |
IonutIga/LLMs-for-KGC
Repository for experiments regarding the assessment of the suitability of... |
|
Experimental |
| 3900 |
sayhitosandy/Mamba_SSM
Mamba: Linear-Time Sequence Modeling with Selective State Spaces |
|
Experimental |