All Transformer Models

7,795 models ranked by quality score · Page 54 of 78

Showing 5301–5400 of 7,795
# Model Score Tier
5301 LessUp/llm-speed

CUDA Kernel Library for LLM Inference: FlashAttention, HGEMM, Tensor Core...

19
Experimental
5302 hadirsa/spring-data-llm-adapter

A Spring Boot library that extracts metadata from JPA entities (@Entity,...

19
Experimental
5303 Sairamg18814/GUBBALA-V3-TRUE

Revolutionary Self-Evolving Language Model - 100% self-contained AI trained...

19
Experimental
5304 Charlotte322/amazon-translator-extension

一款专为亚马逊购物场景深度优化的浏览器插件,全程调用本地微调大模型,实现中英实时自动互译。核心聚焦亚马逊买家对话框全场景——覆盖客服选项、商品卡片信息、买...

19
Experimental
5305 telota/imagines-nummorum-vlm-data-extraction

A computer vision system for automated analysis of index cards from a...

19
Experimental
5306 CastorYu/train-hybrid-llm-from-scratch

A simplistic script for training your own hybrid llm (using autoregressive...

19
Experimental
5307 ethicalabs-ai/BlossomTuneLLM

Federated Supervised Fine-Tuning for Small Language Models (SLMs)

19
Experimental
5308 xndien2004/ViGSA

ViGSA: A Multi-Task Aspect-Based Sentiment Analysis Model with Auxiliary...

19
Experimental
5309 code-vygr/local-llm-ocr-ollama

🖼️ Extract text from images locally using Ollama's LLMs—100% free, offline,...

19
Experimental
5310 sumanthsaivenkat1113/inspiregen-ai-content-generator

A modular, production-ready AI content generation app using React, Vite, and...

19
Experimental
5311 b14ucky/Taco-LLMingway

Custom GPT Transformer architecture built from scratch in PyTorch. Trained...

19
Experimental
5312 Omikrone/Mnemos

Mnemos is a mini-LLM based on Transformers, designed for training and...

19
Experimental
5313 ShunyaAI/huggingface-advanced-search

A self-hostable UI for an enhanced Hugging Face Hub experience. Provides...

19
Experimental
5314 zTgx/DeepText

A GPT Model To Generate Text

19
Experimental
5315 ambv231/tinyllama-coreml-ios18-quantization

Quantize TinyLlama-1.1B-Chat from PyTorch to CoreML (float16, int8, int4)...

19
Experimental
5316 michael-borck/loco-bench

Systematic benchmarks of quantized small language models on consumer hardware

19
Experimental
5317 MirrorDNA-Reflection-Protocol/SCD-Protocol

Structured Contextual Distillation — compress AI memory without losing...

19
Experimental
5318 SaraVaez/Transformer-From-Scratch

Educational PyTorch implementation of Transformer architecture with...

19
Experimental
5319 thejatingupta7/LLMCA

🤖 Large Language Models Acing Chartered Accountancy: Introduces CA‑Ben 📈, a...

19
Experimental
5320 JennEYoon/ECG-transform

Customizing AI models for wearable ECG prototype - quantified health.

19
Experimental
5321 antononcube/Raku-WWW-LLaMA

Raku package that provides access to the algorithms/models of (the...

19
Experimental
5322 PeymanKh/delisio_mvp

CNN food recognition combined with LLM recipe generation using LangGraph...

19
Experimental
5323 arq105/llm-speech-summarization

📘 Summarize speeches and documents swiftly with advanced techniques using...

19
Experimental
5324 kuloud/DFN-next-client

A web application that analyzes the semantic similarity between text and...

19
Experimental
5325 Training-Datasmith/olmo3-code-150m-pretrain

Pre-training a ~150M parameter code-specialized language model using OLMo 3...

19
Experimental
5326 k4yt3x/ollamem

Accurately estimate the memory required to run GGUF models and the maximum...

19
Experimental
5327 mie-lab/mobility_generation

MobilityGen: DDPM for human mobility behavior

19
Experimental
5328 MaoJianwei/llama.cpp-arm-armv7l-Raspberry-Pi-Release-Prebuild

On the Releases page, you can download pre-built binaries for arm, armv7l...

19
Experimental
5329 rafiq15/health-ai-assistant

🏥 AI-powered health assistant using fine-tuned LLaMA 3.2 model with Spring...

19
Experimental
5330 nehalvaghasiya/RecipeBot

AI chatbot that provides recipe suggestions and cooking instructions based...

19
Experimental
5331 duck4i/retro-ui

Retro Llama

19
Experimental
5332 gkietle/sequolkit

Enhancing Text-to-SQL Capabilities of Open-Source Small Language Models via...

19
Experimental
5333 KazKozDev/dataset-creator

Generating high-quality datasets for model distillation.

19
Experimental
5334 kbzh2558/dialogue_emotion_classification_via_multimodality

Multimodal Emotion Classification

19
Experimental
5335 ankraj1234/MediGuide

Comparing QLoRA, Prompt & Prefix Tuning on Mistral-7B for medical...

19
Experimental
5336 tzhengtek/saute

SAUTE is a lightweight transformer-based architecture adapted for dialog modeling

19
Experimental
5337 zzmtsvv/ad-gta

Grouped-Tied Attention by Zadouri, Strauss, Dao (2025).

19
Experimental
5338 Sid7on1/ViT-Vision-Transformer

ViT-ClassiPy is a lightweight Vision Transformer built from scratch using...

18
Experimental
5339 daniel-was-taken/AI-Powered-Academic-Research-Assistant

An AI-Powered Academic Research Assistant (apara.) with document scraping...

18
Experimental
5340 PRITHIVSAKTHIUR/bellatrix-tiny3-1b-webgpu

webgpu based llm chatbot, try on chrome browsers

18
Experimental
5341 minorprojects/Stable-CAT

Stable Causal Attention Transformer(StableCAT) is a tiny, minimal modern ...

18
Experimental
5342 andredisa/AI_WebScraper

🕵️‍♂️ Welcome to AI Web Scraper Agent, a powerful and user-friendly app to...

18
Experimental
5343 kikirizki/transformer

Minimalistic PyTorch implementation of transformer

18
Experimental
5344 pedrocurvo/HAET

HAET: Hierarchical Attention Erwin Transolver is a hybrid neural...

18
Experimental
5345 R2D2-08/turmachpy

A python package for simulating a variety of Turing machines.

18
Experimental
5346 anto18671/image-to-dense-caption

Generate vivid, human-like captions for portrait images using the...

18
Experimental
5347 daspartho/DistillClassifier

Easily generate synthetic data for classification tasks using LLMs

18
Experimental
5348 giankev/Ancient-to-Modern-Italian-Automatic-Translation

Finetuning and evaluating LLMs on Ancient-to-Modern Italian translation task.

18
Experimental
5349 sukrucildirr/factqa-atropos

A factual question-answering environment designed to work both standalone...

18
Experimental
5350 CESOIA/transformer-surgeon

Transformer models library with compression options

18
Experimental
5351 maettuu/Thesis-on-Test-Generation-Using-LLMs

Repository for Master's Thesis 2025

18
Experimental
5352 Aditya22b0053/Text-Summarizer

Dual-mode text summarizer using BART (abstractive) and a hand-built TF-IDF +...

18
Experimental
5353 Jourdelune/Transformer

My implementation of the transformer architecture from the paper "Attention...

18
Experimental
5354 Arpangpta/AlgoD-CodeStructure-Identifier

Identify algorithmic structures in source code using Abstract Syntax Trees...

18
Experimental
5355 hermanpetrov/KeyBERT-Estonian-setup

This is setup for Estonian text use of keyword extraction with KeyBERT. The...

18
Experimental
5356 alemoraru/exceed-project-overview

Reproduction package for a framework that uses LLMs to generate tailored,...

18
Experimental
5357 CAI991108/Machine-Learning-and-Language-Model

This project explores GPT-2 and Llama models through pre-training,...

18
Experimental
5358 RobinSmits/Schaapje

Schaapje - A Dutch Small Language Model

18
Experimental
5359 mdxabu/nl2sql

Leveraging Open-Source AI Models for Natural Language to SQL Query...

18
Experimental
5360 atasoglu/turkish-llava-notebooks

A useful collection of notebooks for quantization, fine-tuning, and...

18
Experimental
5361 BramVanroy/lt3-2019-transformer-trainer

Transformer trainer for variety of classification problems that has been...

18
Experimental
5362 socialx-analytics/transformerX

Transformer Library Helper by SocialX

18
Experimental
5363 jElhamm/Article-Mathematical-Modeling-of-the-Short-Circuit-Mode-of-a-Voltage-Transformer

"Simulations for the paper Mathematical Modeling of the Short Circuit Mode...

18
Experimental
5364 Jayesh-Dev21/FinanceWise

AI web application, using custom data-set to train and build a LLM from...

18
Experimental
5365 Anonym0usWork1221/JaraConverse-TransformersBased

This JaraConverse model is a cutting-edge Transformer-based supervised...

18
Experimental
5366 h3nock/ai-deep-dive

An open-source interactive learning platform for understanding LLMs through...

18
Experimental
5367 istat-methodology/GenAI-OS-2025

Material for our "Transformer-based Models for Official Statistics: A...

18
Experimental
5368 AInnovateLab/watermark-collision

[NAACL'25 Findings] Lost in Overlap: Exploring Logit-based Watermark...

18
Experimental
5369 graylan0/Aeternitas

Intercommunication loop between Llama model and GPT-Neo.

18
Experimental
5370 hewei2001/ReachQA

[EMNLP 2025] Distill Visual Chart Reasoning Ability from LLMs to MLLMs

18
Experimental
5371 nath54/ChunkedDiffusion_LLM

Chunked Diffusion LLM is an innovative machine learning project exploring a...

18
Experimental
5372 MdAliAhnaf/Bengali-Sentiment-Analysis-ML_Fine-Tune-Llama-3.1

Trained and evaluated traditional ML models, fine-tuned Dolphin 2.9.4 based...

18
Experimental
5373 zsychina/verl-python

Support `muliti-step` python call in verl. formation:...

18
Experimental
5374 BlocTheWorker/llm-modding-guide

A general guide for modders who want to use LLM to mod their favorite games

18
Experimental
5375 tdawe1/multi-llm-translator

An automated Python translation assistant that monitors for jobs, processes...

18
Experimental
5376 EnricoBenedetti/GarfieldRetrieve

Garfield Strip retrieval system

18
Experimental
5377 RahulSaini02/sentiment-analysis-with-transformers

A sentiment analysis project using Hugging Face transformers, fine-tuned on...

18
Experimental
5378 dan0nchik/llm-attack-kit

A collection of LLM attacks

18
Experimental
5379 ParthaPRay/Sycophancy_in_LLM_model

This repo shows the coding of sycophancy in LLMs as Bayesian-Latent model

18
Experimental
5380 johntsi/preast_qa

Complex Question Answering by Pairwise Passage Ranking and Answer Style...

18
Experimental
5381 erraji-jo/LLM-Finutune-based-on-customData

The project aims to showcase the process of fine-tuning LLMs on...

18
Experimental
5382 acaklovic/Thesis-Comparison-of-feature-extractors-using-ASReview

Project to examine if state-of-the-art feature extractors (i.e.,...

18
Experimental
5383 nexageapps/LLM

Hands-on notebooks to understand and build Large Language Models (LLMs) from...

18
Experimental
5384 agustinbrusco/tokens-thorugh-lang

Analysis of LLM token representation of texts in different languages

18
Experimental
5385 samujjwaal/multilingual-chatbot

A Multilingual Chatbot for Pizza Ordering

18
Experimental
5386 sirmammingtonham/data2text

Code for IJCoL 7 Special Issue Paper - Improving Data-to-Text Generation via...

18
Experimental
5387 siraben/llama-bot

Discord bot for interacting with the LLaMA language model

18
Experimental
5388 FunnySaltyFish/best_llm

Vote the Best LLM by yourself! 票选你最喜欢的大语言模型

18
Experimental
5389 Shengwei-Peng/Chinese-News-Summarization

A project for Chinese news summarization using state-of-the-art pre-trained...

18
Experimental
5390 Rohan-Thoma/Coding-attention-from-scratch

This repository consists code for executing attention mechanism from scratch...

18
Experimental
5391 Pavansomisetty21/LlamaNLP-Unsloth-Next-Gen-Text-Processing-with-Llama-and-Unsloth

In this we generate NER ,Question Answering and text generation using...

18
Experimental
5392 amai-gsu/LM-Meter

Official code repo of SEC'25 paper: lm-Meter: Unveiling Runtime Inference...

18
Experimental
5393 Nolhan42789/AI-Studio

🤖 Enhance your workflow with AI-Studio, a Streamlit app that offers tools...

18
Experimental
5394 Daddy-Myth/Fine-tuning-Flan-T5-RLHF

Aligning FLAN-T5 with Reinforcement Learning from Human Feedback (RLHF) for...

18
Experimental
5395 Devanik21/HAG-MoE

HAG-MoE introduces a revolutionary approach to artificial intelligence by...

18
Experimental
5396 tahmidmir/Graph-RAG

Fine-tuning GPT-2 on domain-specific articles related to skin cancer, using...

18
Experimental
5397 plandes/lmtask

Inferencing and Training Large Language Model Tasks

18
Experimental
5398 kaustpradalab/LLM-sycophancy

[AAAI'26 Main🎉] Official code of "When Truth Is Overridden: Uncovering the...

18
Experimental
5399 chrislemke/deep-martin

Text simplification for a better world: Deep-Martin Transformer 🤗

18
Experimental
5400 mahadi-nahid/NormTab

[EMNLP 2024] NormTab: Improving Symbolic Reasoning in LLMs Through Tabular...

18
Experimental
« Prev 1 2 3 52 53 54 55 56 76 77 78 Next »