All Transformer Models
7,795 models ranked by quality score · Page 63 of 78
| # | Model | Score | Tier |
|---|---|---|---|
| 6201 |
Papasmurf79/RestaurantRecommenderLLM
With this project, I created an end-to-end AI pipeline that transforms a... |
|
Experimental |
| 6202 |
craigrmc/alignment
An essay to help facility the transition to AGI and then ASI. A traditional... |
|
Experimental |
| 6203 |
sitta07/LLM-Regex-DataPrep
A data preprocessing pipeline for TCAS admission data. This project... |
|
Experimental |
| 6204 |
wiedersehne/Paramixer
Parameterizing Mixing Links in Sparse Factors Works Better than Dot-Product... |
|
Experimental |
| 6205 |
madalinioana/intent-qualification
Hybrid company qualification pipeline using LLM intent parsing, vector... |
|
Experimental |
| 6206 |
lucasfrag/automated-dataset-translator
Automatically translate structured datasets (CSV, JSON, JSONL, TSV, Parquet)... |
|
Experimental |
| 6207 |
Gyldenn/storywriter
Fine-tuning Mistral 7B with LoRA (QLoRA 4-bit) to generate Shakespearean... |
|
Experimental |
| 6208 |
nmn-pandey/brain-tumour-segmentation
Code for automated brain tumor segmentation from MRI scans using CNNs with... |
|
Experimental |
| 6209 |
saadtariq-learning/llms_with_google_cloud
Leveraged the power of Google Cloud's Vertex AI platform to develop advanced... |
|
Experimental |
| 6210 |
G-B-KEVIN-ARJUN/size-precision-slm-bench
is it better to run a Tiny Model (2B-4B) at High Precision (FP16/INT8), or a... |
|
Experimental |
| 6211 |
Sparsh-2007/GPT-From-Scratch
Implementation of a GPT-style LLM from scratch, following "Build a Large ... |
|
Experimental |
| 6212 |
Rekhii/Machine-Learning
Daily ML practice notebooks covering tabular data, deep learning, and... |
|
Experimental |
| 6213 |
buhsnn/Vision-Language-Model
Vision-language model combining a ResNet18 vision encoder with a GPT-2... |
|
Experimental |
| 6214 |
likitha-am/my_newcityfrnd
MyNewFrnd is a web-based platform that helps people relocating to a new city... |
|
Experimental |
| 6215 |
kashan-alam/ai-backend-fastapi
AI-powered backend API built with FastAPI, JWT authentication, rate... |
|
Experimental |
| 6216 |
Mainframework/Quanta
Convert and quantize llm models |
|
Experimental |
| 6217 |
navanodonavan/Transformer_Based_Follow_Through_Prediction_ML_MSAAI590
Transformer based follow through prediction for intraday trading |
|
Experimental |
| 6218 |
Adwerse/Mini_LLM
π§ Transformer built from scratch β RoPE, SwiGLU, KV-Cache, Flash Attention.... |
|
Experimental |
| 6219 |
LinukPerera/JEPA-Explainer
A practical explainer of JEPA, Meta AIβs Joint Embedding Predictive... |
|
Experimental |
| 6220 |
Datta0/nanoformer
A small repo to experiment with Transformer (and more) architectures. |
|
Experimental |
| 6221 |
CosmonautCode/Tiny-Local-LLM-System-Expanded
A lightweight, self-contained Python project for running local LLM... |
|
Experimental |
| 6222 |
Eddie-oss-369/AI_Emotional_Mirror
π¬ Reflect on your emotions with AI Emotional Mirror, a tool that clarifies... |
|
Experimental |
| 6223 |
movieonlyemail4/vscode-local-llm-
Run local AI models in VS Code with automatic model detection, server start,... |
|
Experimental |
| 6224 |
rudyon/pipeline
Training pipeline for LLMs in PyTorch. |
|
Experimental |
| 6225 |
mlengineershub/LazyTeacher
End-To-End Machine Learning Project for Automatic Grades for Student Essays |
|
Experimental |
| 6226 |
garimamittal13/csai_S26
Neuroimaging preprocessing, brain decoding, and visual brain encoding using... |
|
Experimental |
| 6227 |
sahilkulkarni07/Merchant-Risk-ML-Pipeline
End-to-end BNPL merchant risk assessment pipeline with portfolio aggregation... |
|
Experimental |
| 6228 |
Root1V/llm-security
JWT-based authentication and authorization gateway for locally deployed LLM... |
|
Experimental |
| 6229 |
LazaUK/HuggingFace-BAAI-BGERerankerv2m3
BGE Reranker v2 m3 demo with Hugging Face transformers for local and Azure cloud use. |
|
Experimental |
| 6230 |
rinoScremin/Open_Cluster_AI_Station_beta
High-performance distributed matrix computation for AI workloads. Supports... |
|
Experimental |
| 6231 |
kantkrishan0206-crypto/AlignGPT
βThis project implements a mini LLM alignment pipeline using Reinforcement... |
|
Experimental |
| 6232 |
ikun-llm/ikun-V
ε€ζ¨‘ζθ§θ§θ―θ¨ζ¨‘ε | Vision-Language Model ποΈ |
|
Experimental |
| 6233 |
RAHB-REALTORS-Association/transcriber-describer
Transcribes videos and describes them with OpenAI APIs or local models. |
|
Experimental |
| 6234 |
JIA-Lab-research/TGDPO
[ICML 2025] TGDPO: Harnessing Token-Level Reward Guidance for Enhancing... |
|
Experimental |
| 6235 |
itxmjr/LLM-From-Scratch
A step-by-step Guide: Build a GPT-like LLM From Scratch using PyTorch |
|
Experimental |
| 6236 |
GAIR-Lab/LLMPopcorn
LLM-assisted Popular Micro-video Creation |
|
Experimental |
| 6237 |
Jake1402/Torch-GPTs
A way for users to train, and interact with their own mini language models... |
|
Experimental |
| 6238 |
Sheryar-bit/Lang-Chain
LangChain / Machine Learning / NLP, practice repository |
|
Experimental |
| 6239 |
SkywardAI/kimchima
Customise neural nets, knowledge distillation and transfer learning collection |
|
Experimental |
| 6240 |
kazuki-irie/hybrid-memory
Official repository for the paper "Blending Complementary Memory Systems in... |
|
Experimental |
| 6241 |
perlathebian/multilingual-ai-news-summarizer
AI-powered web app for multilingual news summarization. Processes... |
|
Experimental |
| 6242 |
plaban/fllm-aut25
This is the webpage repository of Foundation of LLM course offered by... |
|
Experimental |
| 6243 |
M3-IT/YING-VLM
Vision Large Language Models trained on M3IT instruction tuning dataset |
|
Experimental |
| 6244 |
dlukeh/transformer-deep-dive
A deep descent into the neural abyss β understanding transformers through... |
|
Experimental |
| 6245 |
arvind207kumar/Time-Cross-Adaptive-Self-Attention-TCSA-based-Imputation-model-
Time-Cross Adaptive Self-Attention (TCSA) model for multivariate Time... |
|
Experimental |
| 6246 |
sghosh-04/notes_generator
End-to-end AI pipeline β speech-to-text, NLP summarization, topic... |
|
Experimental |
| 6247 |
RealTapeL/Xiao_i_Chat
η¨δΊθδΈζθ²ι’εηε€§θ―θ¨ζ¨‘ε |
|
Experimental |
| 6248 |
Thatimmorsit/llm-document-summarizer
A Python-based tool for summarizing long documents using state-of-the-art... |
|
Experimental |
| 6249 |
nehamaheshh/Reasoning-style-fine-tuning-PEFT
LoRA vs QLoRA fine-tuning on CommonsenseQA measuring accuracy, GPU memory,... |
|
Experimental |
| 6250 |
emnaxbenjazia/Text-Summarization-NLP-Project
This is a learning project. It follows the same general steps and structure... |
|
Experimental |
| 6251 |
StyrbjornKall/TRIDENT_application
Source code for the web application associated with "Transformers enable... |
|
Experimental |
| 6252 |
arjunravi26/rag_news_extractor
A rag project that aims to summarize the news and also clarify questions... |
|
Experimental |
| 6253 |
sharpsalt/Captionforge-Multimodal-Image-Captioning-System
This PyTorch-based image captioning model uses ResNet-50 encoder and... |
|
Experimental |
| 6254 |
sitammeur/readerlm-litserve
Leverage Reader-LM's capabilities using LitServe. |
|
Experimental |
| 6255 |
Shravani018/interpreting-transformer-hallucinations
Mechanistic interpretability of transformer hallucinations via attention... |
|
Experimental |
| 6256 |
curiousbrutus/fNIRS-Vise
NIRS-VIS is a Master Thesis Project for decoding visual stimuli from fNIRS... |
|
Experimental |
| 6257 |
Jac-Zac/Thesis
Repository thesis on HTR on museum's biological artifacts labels |
|
Experimental |
| 6258 |
shreydan/simpleVLM
building a simple VLM. Implementing LlaMA-SmolLM2 from scratch + SigLip2... |
|
Experimental |
| 6259 |
miriusz6/Masters-Thesis
Code for my Master's Thesis: Enhancing Fossil Identification: A Study on... |
|
Experimental |
| 6260 |
alantomanu/Autograde
AutoGrade is a modern web application that automates the grading of... |
|
Experimental |
| 6261 |
Abhinaykotla/news-summarization-T5-Transformer
The project's goal is to use different Deep Learning techniques - T5... |
|
Experimental |
| 6262 |
UtpaL2102/Dark-Pattern-detector
DPdetector is a browser extension that detects dark patterns in ads without... |
|
Experimental |
| 6263 |
gazelle93/llm-fine-tuning-sft-lora-qlora
Practical examples for fine-tuning large language models (LLMs) with SFT,... |
|
Experimental |
| 6264 |
luhuim/Transformers_project
This is my ongoing master thesis project. It is a transformers model. |
|
Experimental |
| 6265 |
stelaras36/OCRfixer
Web & CLI tool to fix noisy OCR text using a fine-tuned T5 model |
|
Experimental |
| 6266 |
Parth844/AI_pdf_to_Epub
AI-powered PDF to EPUB conversion engine with LLM-based chapter detection... |
|
Experimental |
| 6267 |
omer-gulsoy/ML-ClassicalMusicEra
π» AI project classifying Classical Music eras (Baroque, Classical, Romantic,... |
|
Experimental |
| 6268 |
milsab/TETUP
TETUP: Code for "Towards Explainable Temporal User Profiling with LLMs"... |
|
Experimental |
| 6269 |
samratrajsharma/LLMs
Experimental implementations of core Large Language Model components... |
|
Experimental |
| 6270 |
Prince445-hub/mlx-drifting-model
π Evolve generative models with a drifting field for efficient single-step... |
|
Experimental |
| 6271 |
joetansey1/grafana-anomaly-detector
LLM-Powered SLO Anomaly Detector |
|
Experimental |
| 6272 |
gayatriiv/novelty-detector
transformer-based NLP system to detect semantic contradictions and novelty... |
|
Experimental |
| 6273 |
OleksandrZadvornyi/marketcap-forecasting
End-to-end market capitalization forecasting system using... |
|
Experimental |
| 6274 |
RWKV-Wiki/rwkv-wiki.github.io
RWKV Wiki website (archived, please visit official wiki) |
|
Experimental |
| 6275 |
j-f1/LLM-Playground
Play with LLaMA & GPT-3! |
|
Experimental |
| 6276 |
StarDust130/kaizen_ai
β‘π€ AI-Powered LinkedIn Post Generator That Slaps ππ₯β¨ |
|
Experimental |
| 6277 |
xlisp/learn-langgraph
learn langgraph step by step |
|
Experimental |
| 6278 |
gideon-ogunbanjo/WikiMindAI
WikiMindAI - Wikipedia-based Mindful AI Search Engine |
|
Experimental |
| 6279 |
hendrik-spl/sustainable-llm-knowledge-distillation
Resource-efficient LLM distillation: Improving sustainability and reducing... |
|
Experimental |
| 6280 |
VITA-Group/Data-Efficient-Scaling
[ICML 2023] "Data Efficient Neural Scaling Law via Model Reusing" by Peihao... |
|
Experimental |
| 6281 |
detail-novelist/novelist-triton-server
Deploy KoGPT with Triton Inference Server |
|
Experimental |
| 6282 |
robflynnyh/hydra-linear-attention
Implementation of: Hydra Attention: Efficient Attention with Many Heads... |
|
Experimental |
| 6283 |
genglinliu/UnknownBench
Repo for paper: Examining LLMs' Uncertainty Expression Towards Questions... |
|
Experimental |
| 6284 |
priyansh4320/Abstractive-Text-Summarization-Enhancing-Sequence-to-Sequence-Models-Using-Word-Sense-Disambiguatio
This repository contains code and resources for abstractive text... |
|
Experimental |
| 6285 |
Ojas025/almostGPT
A GPT implementation for training and generating text on custom datasets |
|
Experimental |
| 6286 |
madboy482/FakeNewsDetection
A deep learning-based fake news detection system leveraging BERT and... |
|
Experimental |
| 6287 |
yulo-dev/cf_ai_baseball
AI-powered baseball statistics assistant built with Cloudflare Workers AI... |
|
Experimental |
| 6288 |
RaphaelMouravieff/TabStruct
Code for ACL 2025 paper: "Structural Deep Encoding for Table Question Answering" |
|
Experimental |
| 6289 |
m15kh/Transformer_From_Scratch_Pytorch
Implementation of Transformer from scratch in PyTorch, covering full... |
|
Experimental |
| 6290 |
SaaranshDx/TinyLM
TinyML v1 β 3.1M-parameter AI, no frameworks, fully from scratch. |
|
Experimental |
| 6291 |
sakhileln/rope-pytorch
RoPE Playground β Rotary Positional Embeddings in PyTorch |
|
Experimental |
| 6292 |
diixo/build-gpt
A PyTorch library with educational re-implementation of GPT-models: GPT2, LLaMA |
|
Experimental |
| 6293 |
sumony2j/SeedGPT-22M
SeedGPT is a lightweight, 22M-parameter Transformer LLM for efficient text... |
|
Experimental |
| 6294 |
khaHesham/My-Machine-Learning-Journey
my legacy in machine learning including important summaries that I have made... |
|
Experimental |
| 6295 |
DimasDMM/transformers
Bunch of experiments with Transformers, BERT and GPT-2. Experiments include... |
|
Experimental |
| 6296 |
Prinxe304/AIChatModelWithStoringHistory
π€ Spring Boot AI chatbot backend with multi-session chat history using... |
|
Experimental |
| 6297 |
qthuy2k1/audio-instrument-classification
A program for training audio classification model |
|
Experimental |
| 6298 |
Rayyan9477/OCR-Image-to-text
Developed an OCR Image-to-Text application using Python and Streamlit,... |
|
Experimental |
| 6299 |
kldzj/vllm-transformers5
This repository provides a Docker image for vLLM with transformers>=5.0.0rc0... |
|
Experimental |
| 6300 |
Brahmendra-Ramoju/TrustLayer_AI
AI-powered content moderation API with toxicity detection and trust scoring... |
|
Experimental |