All Transformer Models
7,795 models ranked by quality score · Page 38 of 78
| # | Model | Score | Tier |
|---|---|---|---|
| 3701 |
Anu0408/Language_Translation_GenAI_App
Language Translator is an AI-powered tool for text and voice translation... |
|
Experimental |
| 3702 |
DrRuin/Lightweight-Fine-Tuning
Lightweight fine-tuning is one of the most important techniques for adapting... |
|
Experimental |
| 3703 |
yuki-2025/llama3-8b-fine-tuning-math
Fine-Tuning Llama 3-8B for Structured Math Reasoning: Fine-tuning Llama3 8b... |
|
Experimental |
| 3704 |
lucataco/cog-llama-3-vision-alpha
Cog wrapper for qresearch/llama-3-vision-alpha |
|
Experimental |
| 3705 |
ZhouYuxuanYX/Benchmarking-and-Guiding-Adaptive-Sampling-Decoding-for-LLMs
This is the official implementation of our ACL 2025 Main paper "Balancing... |
|
Experimental |
| 3706 |
nguynking/CS330
Assignment solutions for CS330: Deep Multi-Task and Meta Learning, Fall 2023... |
|
Experimental |
| 3707 |
deadlykitten4/ERC-SVD
ERC-SVD: Error-Controlled SVD for Large Language Model Compression |
|
Experimental |
| 3708 |
iiis-ai/TemplateMath
[ICLR 2025 DATA-FM] Training and Evaluating Language Models with... |
|
Experimental |
| 3709 |
Shreyas-Bhat/LMLF
Code for "Generating Novel Leads for Drug Discovery Using LLMs with Logical... |
|
Experimental |
| 3710 |
webnizam/alpaca-telegram-bot
Simplest way to host a local ChatGPT like model for Telegram. |
|
Experimental |
| 3711 |
osiriszjq/structured_init
Structured Initialization for Attention in Vision Transformers |
|
Experimental |
| 3712 |
VinniLP/Document-Similarity-Finding-using-BERT
Document-Similarity-Finding-using-BERT |
|
Experimental |
| 3713 |
tuan3w/llama-raycast
Chat with LLaMa in Raycast |
|
Experimental |
| 3714 |
s-omranpour/Shirin-Sokhan
A Persian Poet Transformer! (finetuned GPT2 on Ganjoor data) |
|
Experimental |
| 3715 |
alphadl/OOP-eval
The first Object-Oriented Programming (OOP) Evaluation Benchmark for LLMs |
|
Experimental |
| 3716 |
thunlp/cost-optimal-gqa
The code for the paper "Cost-Optimal Grouped-Query Attention for... |
|
Experimental |
| 3717 |
Trustworthy-ML-Lab/ThinkEdit
[EMNLP 25] An effective and interpretable weight-editing method for... |
|
Experimental |
| 3718 |
joshvoigts/llmctx
LLM context builder |
|
Experimental |
| 3719 |
raghavagps/il2pred
Prediction of IL2 inducing peptides |
|
Experimental |
| 3720 |
open-compass/Ada-LEval
The official implementation of "Ada-LEval: Evaluating long-context LLMs with... |
|
Experimental |
| 3721 |
TirendazAcademy/Bert-Text-Classification-Gradio-App
End-to-end text classification project with Transformers, Comet ML, and Gradio |
|
Experimental |
| 3722 |
SrikarVeluvali/Astor-AI
AstorAI is a user-friendly medical chatbot powered by Retrieval-Augmented... |
|
Experimental |
| 3723 |
didar00/Final-Project
SELFIES-Transformer: Learning the Representation of Chemical Space for... |
|
Experimental |
| 3724 |
mala-lab/HaMI
[NeurIPS 2025] Official implementation for ''Robust Hallucination Detection... |
|
Experimental |
| 3725 |
horenbergerb/llamagotchi
A bunch of LLaMa model investigations, including recreating generative... |
|
Experimental |
| 3726 |
IbrahimSobh/askpdf
In this tutorial we will see 💡 How to get answers from a PDF file using... |
|
Experimental |
| 3727 |
PCfVW/plip-rs
Mechanistic interpretability toolkit for code LLMs, in Rust. Analysis of... |
|
Experimental |
| 3728 |
KimDaeUng/PLM-Implementation
NLP Pretrained Language Models Implementation Study |
|
Experimental |
| 3729 |
oneonlee/KoAirBERT
🤗 항공 안전 도메인에 특화된 한국어 BERT 모델 ✈️ |
|
Experimental |
| 3730 |
basaanithanaveenkumar/HaloBlocks
Python library designed to make model experimentation seamless and fast. The... |
|
Experimental |
| 3731 |
RLHF-V/RLHF-V
[CVPR'24] RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from... |
|
Experimental |
| 3732 |
cattolatte/reflective-reasoning-transformer
🧠 R2T Prototype: An LLM pre-trained on causal graphs (not just text) to... |
|
Experimental |
| 3733 |
arcxteam/gguf-convert-model
Auto GGUF Converter for HuggingFace Hub Models with Multiple Quantizations... |
|
Experimental |
| 3734 |
Mattbusel/llm-cpp
The C++ LLM toolkit. 26 single-header libraries for streaming, caching, cost... |
|
Experimental |
| 3735 |
balaji1233/AI-Radiology-Reporting
Using MAIRA-2 multimodal transformer designed for the generation of... |
|
Experimental |
| 3736 |
minjiyoon/MMGL
Multimodal Graph Learning: how to encode multiple multimodal neighbors with... |
|
Experimental |
| 3737 |
EmbeddedLLM/embeddedllm
EmbeddedLLM: API server for Embedded Device Deployment. Currently support... |
|
Experimental |
| 3738 |
blayyyyyk/cs478
For the duration of my Independent Study course, I have been tasked with... |
|
Experimental |
| 3739 |
sajidkhan2067/LLMOnAWS
Deploy smaller LLM on AWS Lambda: Phi-2, cost-effective language model |
|
Experimental |
| 3740 |
DFKI-NLP/gevalm
Code and data for the paper "Evaluating German Transformer Language Models... |
|
Experimental |
| 3741 |
Khaeldur/NeuralForge
On-device LLM fine-tuning for Apple Silicon (ANE) |
|
Experimental |
| 3742 |
godofpdog/ViT_PyTorch
This is a simple PyTorch implementation of Vision Transformer (ViT)... |
|
Experimental |
| 3743 |
jranaraki/vllm-fit
A CLI tool designed to simply recommend (conservative), and/or profile (to... |
|
Experimental |
| 3744 |
VincLee8188/Spatio-temporal-forecasting-PyTorch
Leverage on recent advances in graph convolution and sequence modeling to... |
|
Experimental |
| 3745 |
cronenberg64/SciBERT-CTFT
SciBERT-based scientific abstract classification using SetFit framework with... |
|
Experimental |
| 3746 |
sc0v0ne/udemy_course_mastering_ollama_build_private_local_llm_apps_with_python
Udemy Course Mastering Ollama Build Private Local LLM Apps with Python |
|
Experimental |
| 3747 |
ffreemt/convbot
A conversational bot based on huggingface transformers |
|
Experimental |
| 3748 |
OFA-Sys/DiverseEvol
Self-Evolved Diverse Data Sampling for Efficient Instruction Tuning |
|
Experimental |
| 3749 |
princeton-nlp/dyck-transformer
[ACL 2021] Self-Attention Networks Can Process Bounded Hierarchical Languages |
|
Experimental |
| 3750 |
UNITES-Lab/HEXA-MoE
Official code for the paper "HEXA-MoE: Efficient and Heterogeneous-Aware MoE... |
|
Experimental |
| 3751 |
PRITHIVSAKTHIUR/Qwen-Image-LoRA-DLC
Qwen-Image model with various LoRA (Low-Rank Adaptation) styles. This tool... |
|
Experimental |
| 3752 |
ansh-info/Titans-Learning-to-Memorize-at-Test-Time-with-Manim
Visual animated walkthroughs of the DeepMind "Titans: Learning to Memorize... |
|
Experimental |
| 3753 |
MBadriNarayanan/ClickbaitClassification
Classifying clickbaits: articles with potentially misleading titles, using a... |
|
Experimental |
| 3754 |
MMStar-Benchmark/MMStar
[NeurIPS 2024] This repo contains evaluation code for the paper "Are We on... |
|
Experimental |
| 3755 |
RufelleEmmanuelPactol/Mixture-of-Experts-Transcript-Evaluator
A mixture of experts inspired transcript evaluator using LLM fine-tuning.... |
|
Experimental |
| 3756 |
symfony/ai-transformers-php-platform
TransformersPhp platform bridge for Symfony AI |
|
Experimental |
| 3757 |
hululuzhu/llama-lora-chinese-couplet
llama-lora e2e example to demo a Chinese Couplet AI in 10 mins. some... |
|
Experimental |
| 3758 |
Navya0203/Abstractive-Text-Summarization-Using-RNN-and-Transformers
This repository contains implementations of abstractive text summarization... |
|
Experimental |
| 3759 |
daniau23/LoRAfrica
LoRAfrica: Scaling LLM Fine Tuning for African History |
|
Experimental |
| 3760 |
zixi-liu/Transformers-Learning
Stanford CS25 - Transformer United and CS224n learning notes and code dump. |
|
Experimental |
| 3761 |
OpenNLPLab/Tnn
[ICLR 2023] Official implementation of Transnormer in our ICLR 2023 paper -... |
|
Experimental |
| 3762 |
Arnav-Sharmaa/Multilingual-Speech-to-Text-and-Speech-to-Speech-Content-Summarization-for-Indian-Languages
This project presents a multilingual pipeline for both speech-to-text and... |
|
Experimental |
| 3763 |
nateraw/discord-image-captioning-bot
A Discord bot for captioning images |
|
Experimental |
| 3764 |
ahmed19999520-alt/Veronica-X-Pro-open-source-code-2.0
Advanced AI system with real quantum computing integration, sophisticated... |
|
Experimental |
| 3765 |
codiceSpaghetti/numpyGPT
A from-scratch GPT built with NumPy and Python’s standard library. No... |
|
Experimental |
| 3766 |
nsourlos/LLM_evaluation_framework
Evaluate performance of LLM models for Q&A in any domain |
|
Experimental |
| 3767 |
ertugrulakben/NEURON
Hybrid memory architecture combining exact recall with infinite-capacity... |
|
Experimental |
| 3768 |
Nutanpatil06/Fine-Tuning-LLM-with-LLaMA-Factory
Complete LoRA/QLoRA implementation using LLaMA Factory. Fine-tune models... |
|
Experimental |
| 3769 |
opencodeiiita/Finetuning_Llama
Fine-Tuning LLaMA for Indian Laws |
|
Experimental |
| 3770 |
happydasch/llm_advisory
Modular framework for building topic-specific advisors powered by large... |
|
Experimental |
| 3771 |
1tangerine1day/chinese-QA-chatbot
A simple chinese QA chatbot implement with pytorch and transformer trained... |
|
Experimental |
| 3772 |
styfeng/SMERTI
Code for SMERTI for Semantic Text Exchange. |
|
Experimental |
| 3773 |
Bradley-Butcher/Conformers
Unofficial implementation of Conformal Language Modeling by Quach et al |
|
Experimental |
| 3774 |
harshpimpale/AyurvedaGPT
A Streamlit-based platform offering Ayurvedic remedies. Users can ask... |
|
Experimental |
| 3775 |
waybarrios/dgx-spark-finetune-llm
LLM fine-tuning with LoRA + NVFP4/MXFP8 on NVIDIA DGX Spark (Blackwell GB10) |
|
Experimental |
| 3776 |
xufangzhi/Symbol-LLM
[ACL 2024] The project of Symbol-LLM |
|
Experimental |
| 3777 |
ArturPen/ab-transformers-timeskip-exploit
Python + ADB automation script for the Time Skip exploit in Angry Birds Transformers. |
|
Experimental |
| 3778 |
AstraBert/DebateLLM-Championship
5 LLMs, 1vs1 matches to produce the most convincing argumentation in favor... |
|
Experimental |
| 3779 |
rashomon-gh/attention-visualiser
a module to visualise attention layer activations from transformer based... |
|
Experimental |
| 3780 |
smsnobin77/Awesome-Multimodal-Unlearning
This repo presents a survey of multimodal unlearning across vision,... |
|
Experimental |
| 3781 |
Comrade-1729/lex-brief-ai
Safety-first legal NLP system with hierarchical long-document processing,... |
|
Experimental |
| 3782 |
LGDiMaggio/few-shot-fault-diagnosis-multimodal-LLM
Few-shot bearing fault diagnosis using multimodal LLMs and prototypical networks |
|
Experimental |
| 3783 |
ilanaliouchouche/KANBert
Implementation of an Encoder only MoE usable as an Embedding Model,... |
|
Experimental |
| 3784 |
SauravP97/toy-transformer
A decoder only Transformer implementing masked attention |
|
Experimental |
| 3785 |
m-rishab/Research-Paper-Recommendation
This project aims to build a research paper recommendation system. Given a... |
|
Experimental |
| 3786 |
nitrictech/pycasts
A text to Podcast inference API |
|
Experimental |
| 3787 |
Dim10p/relation-extraction-on-financial-documents
This repository contains all the scripts and methodology for the Relations... |
|
Experimental |
| 3788 |
Strong-AI-Lab/Explanation-Generation
We introduce "ILearner-LLM" a framework that uses iterative enhancement with... |
|
Experimental |
| 3789 |
hank0316/AdaSearch
This includes the original implementation of "AdaSearch: Balancing... |
|
Experimental |
| 3790 |
YASSER-27/LLMs
A high-performance, cross-platform desktop application for chatting with... |
|
Experimental |
| 3791 |
Johandaonis1/OMG-Agent
🤖 Automate Android operations with OMG-Agent, an open-source Mobile GUI... |
|
Experimental |
| 3792 |
horde-research/horde-common
Shared scripts for offline Kazakh LLM eval—run inference, auto-score, and... |
|
Experimental |
| 3793 |
NS027/medical_chatbot_project_genAI
Multimodal AI-powered medical assistant with LLMs, speech, and image understanding. |
|
Experimental |
| 3794 |
MDalamin5/Build-and-Finetune-LLM-From-Scratch-Deploy-via-vLLM-AWS-GCP
A complete end-to-end learning repo covering everything from building Large... |
|
Experimental |
| 3795 |
bendsouza2/yt-translator
This project aims to provide free and accessible language learning resources... |
|
Experimental |
| 3796 |
Rin313/StegLLM
离线的LLM文本隐写程序。Offline LLM text steganography program. |
|
Experimental |
| 3797 |
mourga/transformer-uncertainty
Code for evaluating uncertainty estimation methods for Transformer-based... |
|
Experimental |
| 3798 |
gxcsoccer/alloy
Hybrid SSM-Attention language model on Apple Silicon with MLX — interleaving... |
|
Experimental |
| 3799 |
korovod/kenotron
Experimental fork of Nanotron, a minimalistic large language model... |
|
Experimental |
| 3800 |
frikishaan/glama-124m
GLaMA is a small-scale autoregressive transformer model inspired by... |
|
Experimental |