All Transformer Models

7,795 models ranked by quality score · Page 63 of 78

Showing 6201–6300 of 7,795
# Model Score Tier
6201 Papasmurf79/RestaurantRecommenderLLM

With this project, I created an end-to-end AI pipeline that transforms a...

14
Experimental
6202 craigrmc/alignment

An essay to help facility the transition to AGI and then ASI. A traditional...

14
Experimental
6203 sitta07/LLM-Regex-DataPrep

A data preprocessing pipeline for TCAS admission data. This project...

14
Experimental
6204 wiedersehne/Paramixer

Parameterizing Mixing Links in Sparse Factors Works Better than Dot-Product...

14
Experimental
6205 madalinioana/intent-qualification

Hybrid company qualification pipeline using LLM intent parsing, vector...

14
Experimental
6206 lucasfrag/automated-dataset-translator

Automatically translate structured datasets (CSV, JSON, JSONL, TSV, Parquet)...

14
Experimental
6207 Gyldenn/storywriter

Fine-tuning Mistral 7B with LoRA (QLoRA 4-bit) to generate Shakespearean...

14
Experimental
6208 nmn-pandey/brain-tumour-segmentation

Code for automated brain tumor segmentation from MRI scans using CNNs with...

14
Experimental
6209 saadtariq-learning/llms_with_google_cloud

Leveraged the power of Google Cloud's Vertex AI platform to develop advanced...

14
Experimental
6210 G-B-KEVIN-ARJUN/size-precision-slm-bench

is it better to run a Tiny Model (2B-4B) at High Precision (FP16/INT8), or a...

14
Experimental
6211 Sparsh-2007/GPT-From-Scratch

Implementation of a GPT-style LLM from scratch, following "Build a Large ...

14
Experimental
6212 Rekhii/Machine-Learning

Daily ML practice notebooks covering tabular data, deep learning, and...

14
Experimental
6213 buhsnn/Vision-Language-Model

Vision-language model combining a ResNet18 vision encoder with a GPT-2...

14
Experimental
6214 likitha-am/my_newcityfrnd

MyNewFrnd is a web-based platform that helps people relocating to a new city...

14
Experimental
6215 kashan-alam/ai-backend-fastapi

AI-powered backend API built with FastAPI, JWT authentication, rate...

14
Experimental
6216 Mainframework/Quanta

Convert and quantize llm models

14
Experimental
6217 navanodonavan/Transformer_Based_Follow_Through_Prediction_ML_MSAAI590

Transformer based follow through prediction for intraday trading

14
Experimental
6218 Adwerse/Mini_LLM

🧠 Transformer built from scratch β€” RoPE, SwiGLU, KV-Cache, Flash Attention....

14
Experimental
6219 LinukPerera/JEPA-Explainer

A practical explainer of JEPA, Meta AI’s Joint Embedding Predictive...

14
Experimental
6220 Datta0/nanoformer

A small repo to experiment with Transformer (and more) architectures.

14
Experimental
6221 CosmonautCode/Tiny-Local-LLM-System-Expanded

A lightweight, self-contained Python project for running local LLM...

14
Experimental
6222 Eddie-oss-369/AI_Emotional_Mirror

πŸ’¬ Reflect on your emotions with AI Emotional Mirror, a tool that clarifies...

14
Experimental
6223 movieonlyemail4/vscode-local-llm-

Run local AI models in VS Code with automatic model detection, server start,...

14
Experimental
6224 rudyon/pipeline

Training pipeline for LLMs in PyTorch.

14
Experimental
6225 mlengineershub/LazyTeacher

End-To-End Machine Learning Project for Automatic Grades for Student Essays

14
Experimental
6226 garimamittal13/csai_S26

Neuroimaging preprocessing, brain decoding, and visual brain encoding using...

14
Experimental
6227 sahilkulkarni07/Merchant-Risk-ML-Pipeline

End-to-end BNPL merchant risk assessment pipeline with portfolio aggregation...

14
Experimental
6228 Root1V/llm-security

JWT-based authentication and authorization gateway for locally deployed LLM...

14
Experimental
6229 LazaUK/HuggingFace-BAAI-BGERerankerv2m3

BGE Reranker v2 m3 demo with Hugging Face transformers for local and Azure cloud use.

14
Experimental
6230 rinoScremin/Open_Cluster_AI_Station_beta

High-performance distributed matrix computation for AI workloads. Supports...

14
Experimental
6231 kantkrishan0206-crypto/AlignGPT

β€œThis project implements a mini LLM alignment pipeline using Reinforcement...

14
Experimental
6232 ikun-llm/ikun-V

ε€šζ¨‘ζ€θ§†θ§‰θ―­θ¨€ζ¨‘εž‹ | Vision-Language Model πŸ‘οΈ

14
Experimental
6233 RAHB-REALTORS-Association/transcriber-describer

Transcribes videos and describes them with OpenAI APIs or local models.

14
Experimental
6234 JIA-Lab-research/TGDPO

[ICML 2025] TGDPO: Harnessing Token-Level Reward Guidance for Enhancing...

14
Experimental
6235 itxmjr/LLM-From-Scratch

A step-by-step Guide: Build a GPT-like LLM From Scratch using PyTorch

14
Experimental
6236 GAIR-Lab/LLMPopcorn

LLM-assisted Popular Micro-video Creation

14
Experimental
6237 Jake1402/Torch-GPTs

A way for users to train, and interact with their own mini language models...

14
Experimental
6238 Sheryar-bit/Lang-Chain

LangChain / Machine Learning / NLP, practice repository

14
Experimental
6239 SkywardAI/kimchima

Customise neural nets, knowledge distillation and transfer learning collection

14
Experimental
6240 kazuki-irie/hybrid-memory

Official repository for the paper "Blending Complementary Memory Systems in...

14
Experimental
6241 perlathebian/multilingual-ai-news-summarizer

AI-powered web app for multilingual news summarization. Processes...

14
Experimental
6242 plaban/fllm-aut25

This is the webpage repository of Foundation of LLM course offered by...

14
Experimental
6243 M3-IT/YING-VLM

Vision Large Language Models trained on M3IT instruction tuning dataset

14
Experimental
6244 dlukeh/transformer-deep-dive

A deep descent into the neural abyss β€” understanding transformers through...

14
Experimental
6245 arvind207kumar/Time-Cross-Adaptive-Self-Attention-TCSA-based-Imputation-model-

Time-Cross Adaptive Self-Attention (TCSA) model for multivariate Time...

14
Experimental
6246 sghosh-04/notes_generator

End-to-end AI pipeline β€” speech-to-text, NLP summarization, topic...

13
Experimental
6247 RealTapeL/Xiao_i_Chat

η”¨δΊŽθŒδΈšζ•™θ‚²ι’†εŸŸηš„ε€§θ―­θ¨€ζ¨‘εž‹

13
Experimental
6248 Thatimmorsit/llm-document-summarizer

A Python-based tool for summarizing long documents using state-of-the-art...

13
Experimental
6249 nehamaheshh/Reasoning-style-fine-tuning-PEFT

LoRA vs QLoRA fine-tuning on CommonsenseQA measuring accuracy, GPU memory,...

13
Experimental
6250 emnaxbenjazia/Text-Summarization-NLP-Project

This is a learning project. It follows the same general steps and structure...

13
Experimental
6251 StyrbjornKall/TRIDENT_application

Source code for the web application associated with "Transformers enable...

13
Experimental
6252 arjunravi26/rag_news_extractor

A rag project that aims to summarize the news and also clarify questions...

13
Experimental
6253 sharpsalt/Captionforge-Multimodal-Image-Captioning-System

This PyTorch-based image captioning model uses ResNet-50 encoder and...

13
Experimental
6254 sitammeur/readerlm-litserve

Leverage Reader-LM's capabilities using LitServe.

13
Experimental
6255 Shravani018/interpreting-transformer-hallucinations

Mechanistic interpretability of transformer hallucinations via attention...

13
Experimental
6256 curiousbrutus/fNIRS-Vise

NIRS-VIS is a Master Thesis Project for decoding visual stimuli from fNIRS...

13
Experimental
6257 Jac-Zac/Thesis

Repository thesis on HTR on museum's biological artifacts labels

13
Experimental
6258 shreydan/simpleVLM

building a simple VLM. Implementing LlaMA-SmolLM2 from scratch + SigLip2...

13
Experimental
6259 miriusz6/Masters-Thesis

Code for my Master's Thesis: Enhancing Fossil Identification: A Study on...

13
Experimental
6260 alantomanu/Autograde

AutoGrade is a modern web application that automates the grading of...

13
Experimental
6261 Abhinaykotla/news-summarization-T5-Transformer

The project's goal is to use different Deep Learning techniques - T5...

13
Experimental
6262 UtpaL2102/Dark-Pattern-detector

DPdetector is a browser extension that detects dark patterns in ads without...

13
Experimental
6263 gazelle93/llm-fine-tuning-sft-lora-qlora

Practical examples for fine-tuning large language models (LLMs) with SFT,...

13
Experimental
6264 luhuim/Transformers_project

This is my ongoing master thesis project. It is a transformers model.

13
Experimental
6265 stelaras36/OCRfixer

Web & CLI tool to fix noisy OCR text using a fine-tuned T5 model

13
Experimental
6266 Parth844/AI_pdf_to_Epub

AI-powered PDF to EPUB conversion engine with LLM-based chapter detection...

13
Experimental
6267 omer-gulsoy/ML-ClassicalMusicEra

🎻 AI project classifying Classical Music eras (Baroque, Classical, Romantic,...

13
Experimental
6268 milsab/TETUP

TETUP: Code for "Towards Explainable Temporal User Profiling with LLMs"...

13
Experimental
6269 samratrajsharma/LLMs

Experimental implementations of core Large Language Model components...

13
Experimental
6270 Prince445-hub/mlx-drifting-model

πŸŒ€ Evolve generative models with a drifting field for efficient single-step...

13
Experimental
6271 joetansey1/grafana-anomaly-detector

LLM-Powered SLO Anomaly Detector

13
Experimental
6272 gayatriiv/novelty-detector

transformer-based NLP system to detect semantic contradictions and novelty...

13
Experimental
6273 OleksandrZadvornyi/marketcap-forecasting

End-to-end market capitalization forecasting system using...

13
Experimental
6274 RWKV-Wiki/rwkv-wiki.github.io

RWKV Wiki website (archived, please visit official wiki)

13
Experimental
6275 j-f1/LLM-Playground

Play with LLaMA & GPT-3!

13
Experimental
6276 StarDust130/kaizen_ai

βš‘πŸ€– AI-Powered LinkedIn Post Generator That Slaps πŸš€πŸ”₯✨

13
Experimental
6277 xlisp/learn-langgraph

learn langgraph step by step

13
Experimental
6278 gideon-ogunbanjo/WikiMindAI

WikiMindAI - Wikipedia-based Mindful AI Search Engine

13
Experimental
6279 hendrik-spl/sustainable-llm-knowledge-distillation

Resource-efficient LLM distillation: Improving sustainability and reducing...

13
Experimental
6280 VITA-Group/Data-Efficient-Scaling

[ICML 2023] "Data Efficient Neural Scaling Law via Model Reusing" by Peihao...

13
Experimental
6281 detail-novelist/novelist-triton-server

Deploy KoGPT with Triton Inference Server

13
Experimental
6282 robflynnyh/hydra-linear-attention

Implementation of: Hydra Attention: Efficient Attention with Many Heads...

13
Experimental
6283 genglinliu/UnknownBench

Repo for paper: Examining LLMs' Uncertainty Expression Towards Questions...

13
Experimental
6284 priyansh4320/Abstractive-Text-Summarization-Enhancing-Sequence-to-Sequence-Models-Using-Word-Sense-Disambiguatio

This repository contains code and resources for abstractive text...

13
Experimental
6285 Ojas025/almostGPT

A GPT implementation for training and generating text on custom datasets

13
Experimental
6286 madboy482/FakeNewsDetection

A deep learning-based fake news detection system leveraging BERT and...

13
Experimental
6287 yulo-dev/cf_ai_baseball

AI-powered baseball statistics assistant built with Cloudflare Workers AI...

13
Experimental
6288 RaphaelMouravieff/TabStruct

Code for ACL 2025 paper: "Structural Deep Encoding for Table Question Answering"

13
Experimental
6289 m15kh/Transformer_From_Scratch_Pytorch

Implementation of Transformer from scratch in PyTorch, covering full...

13
Experimental
6290 SaaranshDx/TinyLM

TinyML v1 β€” 3.1M-parameter AI, no frameworks, fully from scratch.

13
Experimental
6291 sakhileln/rope-pytorch

RoPE Playground – Rotary Positional Embeddings in PyTorch

13
Experimental
6292 diixo/build-gpt

A PyTorch library with educational re-implementation of GPT-models: GPT2, LLaMA

13
Experimental
6293 sumony2j/SeedGPT-22M

SeedGPT is a lightweight, 22M-parameter Transformer LLM for efficient text...

13
Experimental
6294 khaHesham/My-Machine-Learning-Journey

my legacy in machine learning including important summaries that I have made...

13
Experimental
6295 DimasDMM/transformers

Bunch of experiments with Transformers, BERT and GPT-2. Experiments include...

13
Experimental
6296 Prinxe304/AIChatModelWithStoringHistory

πŸ€– Spring Boot AI chatbot backend with multi-session chat history using...

13
Experimental
6297 qthuy2k1/audio-instrument-classification

A program for training audio classification model

13
Experimental
6298 Rayyan9477/OCR-Image-to-text

Developed an OCR Image-to-Text application using Python and Streamlit,...

13
Experimental
6299 kldzj/vllm-transformers5

This repository provides a Docker image for vLLM with transformers>=5.0.0rc0...

13
Experimental
6300 Brahmendra-Ramoju/TrustLayer_AI

AI-powered content moderation API with toxicity detection and trust scoring...

13
Experimental
« Prev 1 2 3 61 62 63 64 65 76 77 78 Next »