All Transformer Models

7,795 models ranked by quality score · Page 61 of 78

Showing 6001–6100 of 7,795
# Model Score Tier
6001 Sarhamam/ZetaFormer

Curriculum learning framework that uses geometrically structured datasets...

16
Experimental
6002 AndreaLolli2912/SemEval2026-EmoVA

SemEval-2026 Task 2: EmoVA. A Transformer-LSTM architecture with Set...

16
Experimental
6003 yass-ML/slm-few-shot-optimization

An empirical investigation into optimizing few-shot prompting strategies for...

16
Experimental
6004 shrutikakapade/Building-LLM-Pipelines-with-Hugging-Face-LangChain

An end-to-end guide to building robust LLM pipelines with Hugging Face and...

16
Experimental
6005 Ruiyang-061X/Awesome-MLLM-Uncertainty

✨A curated list of papers on the uncertainty in multi-modal large language...

16
Experimental
6006 viktor-shcherb/qk-sniffer

Capture sampled Q/K attention vectors from HF transformers into per-branch...

16
Experimental
6007 Scottcjn/pse-vcipher-collapse

Non-bijunctive attention collapse for LLM inference — POWER8 hardware AES...

16
Experimental
6008 chen-hao-chao/mdm-prime-v2

MDM-Prime-v2: Binary Encoding and Index Shuffling Enable Compute-optimal...

16
Experimental
6009 kevincojean/llama-vim-adapter

Extends the llama.vim plugin to enable LLM autocompletion from third party...

16
Experimental
6010 yos-r/go_emotions

Multi-Label Emotion Classification from Text Using Deep Learning :...

16
Experimental
6011 HTLinh0604/invoice_ocr_craft_llama3

This CRAFT + Llama 3.1 pipeline automates invoice semantic extraction,...

16
Experimental
6012 GoJo-Rika/Text-Summarizer-Using-HuggingFace-Transformers

An end-to-end MLOps project for text summarization using the HuggingFace...

16
Experimental
6013 Franekskc/gemma3-qa-finetuning

Comparing Full Fine-Tuning, LoRA, and Layer Freezing for extractive QA on...

16
Experimental
6014 k-siddhartha-ai/multilingual-sentiment-analysis

Multilingual Sentiment Analysis using Hugging Face Transformers and Gradio

16
Experimental
6015 dineshsoudagar/llm-lab-from-scratch-to-fine-tuning

Comprehensive resources and scripts for training and fine-tuning Large...

15
Experimental
6016 tsvlgd/gpt-from-scratch

decoder-only Transformer (GPT) language model coded from scratch in pytorch

15
Experimental
6017 FreezB11/PsyDuck

a 60M parameter LLM from scratch

15
Experimental
6018 yhinsson/airllm

🚀 Optimize memory for large language models, enabling 70B models on a 4GB...

15
Experimental
6019 stefanpietrusky/IEC

Repository for the article in the online magazine Level Up Coding

15
Experimental
6020 liyucheng09/llm-compressive

Longitudinal Evaluation of LLMs via Data Compression

15
Experimental
6021 Asimo-o/blipren_release

🚀 Train any LLM with BLIPren, a flexible architecture that adapts to your...

15
Experimental
6022 zchoi/Multi-Modal-Large-Language-Learning

Awesome multi-modal large language paper/project, collections of popular...

15
Experimental
6023 sastpg/CoVo

Consistent Paths Lead to Truth: Self-Rewarding Reinforcement Learning for...

15
Experimental
6024 germain-hug/NeurHal

Visual Correspondence Hallucination: Towards Geometric Reasoning (Under Review)

15
Experimental
6025 GodreignElgin/llm-comparision

Jupyter Notebook for LLM compression via quantization (INT8, INT4, FP16) and...

15
Experimental
6026 Samarth2001/LLM-Fine-tuning

Parameter-efficient fine-tuning experiments for 7B LLMs on consumer...

15
Experimental
6027 Vujavujavuja/Vsearcher

A sequential Large Language Model (LLM) agent system designed for automated...

15
Experimental
6028 tsinghua-fib-lab/PIGEON

[ACL 2025 Findings] Open-Set Living Need Prediction with Large Language Models

15
Experimental
6029 Ankur-krGarg/ChatBot

Transformer-based chatbot demo using Hugging Face's conversational models

15
Experimental
6030 sparkup/medical-llm-finetuning-alignment

Medical LLM fine-tuning and preference alignment using SFT and DPO, with...

15
Experimental
6031 Keytoyze/JumpCoder

Code for ACL (main) paper "JumpCoder: Go Beyond Autoregressive Coder via...

15
Experimental
6032 king/transformer-pooling

Pool Me Wisely: On the Effect of Pooling in Transformer-Based Models

15
Experimental
6033 samkibe/Basics-of-model-development-with-Lightning-PyTorch-

One of a kind, hectic

15
Experimental
6034 Sammy-Lastre/BigChat

BigChat is a WinUI 3 chat application built for chatting with large language...

15
Experimental
6035 E1ims/math-vlm-finetune-pipeline

📐 Transcribe handwritten math into accurate LaTeX using a modular...

15
Experimental
6036 nikelborm/amd-amdgpu-rocm-ollama-gfx90c-ati-radeon-vega-ryzen7-5800H-arch-linux

Run Ollama on AMD Ryzen 7 5800H CPU with integrated GPU AMD ATI Radeon Vega...

15
Experimental
6037 murapadev/Phinetuning

A repository dedicated to finetuning phi2 models using advanced machine...

15
Experimental
6038 ensarakbas77/LIFT-UP-Project-Similarity-Analysis

A system that compares newly submitted projects with previously completed...

15
Experimental
6039 aarnetalman/nli-with-transformers

Fine-tune transformers with NLI data

15
Experimental
6040 dhruvjverma/NanoLanguageModel

A minimalist, high-performance GPT implementation in PyTorch, optimized for...

15
Experimental
6041 PurCL/muke

[COLM 2025] Official implementation of μKE - edit LLM knowledge while...

15
Experimental
6042 su-mana-s/Semantic-Communication

Semantic Message Extraction for Text Based Data With Deep Neural Nets

15
Experimental
6043 duoan/ReplicateAI

Recreating every milestone in Machine Learning and Artificial Intelligence

15
Experimental
6044 namjoo2006/Langchain-fundamental-in-model-component-access-data-using-api-keys

LangChain fundamentals for model components: learn to access language and...

15
Experimental
6045 quocnhut134/Finetuning-LLM-Model-for-Intent-Classification-in-Banking

Fine-tuning Large Language Models (LLMs) for precise customer intent...

15
Experimental
6046 SunayHegde2006/Air.rs

Air.rs 70B+ inference on consumer GPU, LLM inference in Rust

15
Experimental
6047 duongkstn/durationqa-vlsp-solution

VLSP 2025 Vietnamese temporalQA - DurationQA. First Rank Solution.

15
Experimental
6048 Nazmul0005/Nazmul0005

AI/ML Engineer | Published Researcher (MDPI 2024) | Building intelligent...

15
Experimental
6049 d-senyaka/letter-forge

From-scratch Transformer implementation for character-level understanding...

15
Experimental
6050 North-Shore-AI/crucible_ensemble

Multi-model ensemble voting strategies for LLM reliability

15
Experimental
6051 theSohamTUmbare/CLIP-model

Reimplementation of the CLIP model

15
Experimental
6052 amoghj98/neuroLIFT

This repository contains code associated with Neuro-LIFT: A Neuromorphic,...

15
Experimental
6053 leonhard-leung/IlokoFusionMT

Bidirectional Iloko ↔ English neural machine translation system using a T5...

15
Experimental
6054 shalakapadalkar16/viral-genome-classifier

Production-ready ML pipeline for viral genome classification from NCBI...

15
Experimental
6055 edersoncorbari/fine-tune-llm

Demonstrate how to fine-tune a pre-trained LLM

15
Experimental
6056 GabMartino/TransformerForDummies

Annotated implementation of vanilla Transformers to guide through all the...

15
Experimental
6057 david-xander/measuring-llm-knowledge

How much does an LLM know about my programming language?

15
Experimental
6058 zjysteven/Awesome-Byte-LLM

A curated list of papers and resources on byte-based large language models...

15
Experimental
6059 AlexeyMalafeev/ruformers

"Руформеры" - список популярных базовых моделей на основе трансформеров для...

15
Experimental
6060 JoyousJohn/deeply-researched

Open-source clone of OpenAI's Deep Research. Works with any transformer,...

15
Experimental
6061 Ultron09/Numpy-Transformer

A pure NumPy implementation of GPT built from scratch for educational...

15
Experimental
6062 VincenzoManto/llmtrim

A library for trimming tokens in encoding and decoding in LLM (Large...

15
Experimental
6063 1337hero/rx7900xtx-llama-bench-rocm

Benchmark script for llama.cpp & results for AMD RX 7900 XTX

15
Experimental
6064 lennor-tan/openrouter-free-model

🌐 Explore and manage free models on OpenRouter effortlessly with our web...

15
Experimental
6065 Thopterek/ChessBenchmark

Aleph Alpha and LEVEL3, LLM benchmark

15
Experimental
6066 SyedAkramaIrshad/transformer-grokking-lab

Tiny Transformer grokking experiment with live notebook visualizations.

15
Experimental
6067 caiomadeira/llama2-psp

Llama 2 inference in C on the PlayStation Portable (PSP).

15
Experimental
6068 harpertoken/memoraxx

LLaMA-style models with memory persistence.

15
Experimental
6069 spignelon/TrustLink_CyberHackathon

TrustLink: Detect and safeguard against deceptive URLs. Real-time threat...

15
Experimental
6070 dineshkgn/deep-learning-lab

Reproducible deep learning experiments: tabular transformers, optimization,...

15
Experimental
6071 DolbyUUU/DeepEnlighten

Pure RL to post-train base models for social reasoning capabilities....

15
Experimental
6072 ShraddhaSharma24/Natural-Language-Processing

A comprehensive NLP repository covering fundamentals, preprocessing,...

15
Experimental
6073 1337hero/rx7900xtx-llama-bench-vulcan

Benchmark script for llama.cpp & results for AMD RX 7900 XTX - using Vulcan

15
Experimental
6074 Reason-Wang/NAT

[NAACL 2025] The official implementation of paper "Learning From Failure:...

15
Experimental
6075 nsarrazin/chessformer

Experiments in chess & transformers

14
Experimental
6076 SafeRL-Lab/TeaMs-RL

[TMLR] TeaMs-RL: Teaching LLMs to Generate Better Instruction Datasets via...

14
Experimental
6077 ainize-team/free-llama-api

Run Meta Llama 3.2 API without your GPU for free. We always support lastest model 🧡

14
Experimental
6078 Yousifus/rlhf_loop_humain

RLHF Loop System - Learning project with monitoring dashboard, drift...

14
Experimental
6079 viktor-shcherb/qk-pca-analysis

PCA analysis of Q/K attention vectors to discover position-correlated...

14
Experimental
6080 lakshayGoyal1188/text_to_sql

A schema-aware Text-to-SQL system using a locally hosted Mistral LLM...

14
Experimental
6081 spatialft/spatialft.github.io

LoRA fine-tuning of LFM2.5-1.2B to improve spatial reasoning on StepGame —...

14
Experimental
6082 krishnakoushik225/ecg-peft-benchmark

Benchmarking PEFT (LoRA vs adapters) for ECG segment classification using...

14
Experimental
6083 yamanobora/Android-Offline-Meeting-Recorder

Android app for offline speech recognition and AI meeting summarization...

14
Experimental
6084 Sachin-0001/ChatCut

ChatCut is a text summarizing tool built on Bidirectional Auto Regressive...

14
Experimental
6085 xkiwilabs/llm-inference-hub

A reproducible LLM inference stack built on vLLM + LiteLLM, designed for...

14
Experimental
6086 FromZeroToFanatic/LLM_Practical_Implementation_Demo1

大模型实战学习路线阶段1:大模型技术总览(必备基础)与实战

14
Experimental
6087 MuthusaravananS/PINPOINT

Pipeline for discovering novel protease inhibtiors at plant pathogen interface.

14
Experimental
6088 Kaden-Schutt/hipfire

RDNA-native LLM inference engine in Rust. 59 tok/s Qwen3-8B on RX 5700 XT —...

14
Experimental
6089 Shreya831/multimodal-ai-visual-analyzer

Multimodal AI system that detects objects in images and answers questions...

14
Experimental
6090 gatorduck/Creating_Custom_Decoder_Transformer

Custom decoder Transformer that treats a patient's medical journey like a...

14
Experimental
6091 NKU-MetautoAI/awesome-large-vision-language-models

Advances in recent large vision language models (LVLMs)

14
Experimental
6092 fake-it0628/jailbreak-defense

Jailbreak Defense System based on Hidden State Causal Monitoring for LLMs

14
Experimental
6093 fajieyuan/recommendation_transfer_learning_pretraining

Pre-training and Transfer learning papers for recommendation

14
Experimental
6094 vishvaRam/Data-Prep-for-LLM-fine-tuning

This repository helps prepare datasets for fine-tuning Large Language Models...

14
Experimental
6095 rajatady/Inference-Stack

Production-grade LLM inference API built from scratch. NestJS gateway +...

14
Experimental
6096 Gaolingx/llama.cpp-Launcher

run llama.cpp quickly and conveniently.

14
Experimental
6097 YahiaGrdh/vibe-agents

Coordinate AI agents to break down tasks, plan workflows, and delegate...

14
Experimental
6098 PratapShashwat/End-to-End-LLM-Fine-Tuning

Train Gemma to summarize documents.

14
Experimental
6099 Baci-Ak/book-recommender

LLM - Book Recommendation system with LLM

14
Experimental
6100 btboilerplate/sms-spam-classification-transformer

SMS spam classification using a Transformer-based model built with...

14
Experimental
« Prev 1 2 3 59 60 61 62 63 76 77 78 Next »