All Transformer Models

7,795 models ranked by quality score · Page 41 of 78

Showing 4001–4100 of 7,795
# Model Score Tier
4001 eladwf/adaptive-multirate-transformers

DSP-inspired multirate wrappers for GPT with adaptive hyperparameters and...

23
Experimental
4002 naveenkumar123/llm-training-project

Different LLM model usage projects

23
Experimental
4003 Jorffy/NoteMR

[CVPR 2025] Code for "Notes-guided MLLM Reasoning: Enhancing MLLM with...

23
Experimental
4004 visresearch/SDMPrune

The official implementation of "SDMPrune: Self-Distillation MLP Pruning for...

23
Experimental
4005 inuwamobarak/stable-diffusion

Implementing a diffusion framework with Hugging Face. Stable diffusion...

23
Experimental
4006 AIDC-AI/Wings

The code repository for "Wings: Learning Multimodal LLMs without Text-only...

23
Experimental
4007 okefemi12/ecommerce-recommender-system

Production-grade Hybrid Recommender System combining Sequential Transformers...

23
Experimental
4008 Kyle1668/LLM-TTA

Code for the paper: Improving Black-box Robustness with In-Context Rewriting

23
Experimental
4009 koc-lab/graph-teacher

The repository for the "GraphTeacher: Transductive Fine-Tuning of Encoders...

23
Experimental
4010 Mattbusel/LLMTokenStreamQuantEngine

A low-latency, C++-based simulation engine that ingests token streams from...

23
Experimental
4011 baohuyvanba/Vision-Zephyr

Vision-Zephyr: a multimodal LLM for Visual Commonsense Reasoning—CLIP-ViT +...

23
Experimental
4012 insoochung/transformer_bcq

BCQ tutorial for transformers

23
Experimental
4013 U4RASD/dalla-model-training

Dalla training recipe using Huggingface SFT trainer

23
Experimental
4014 iiis-ai/IterativeQuestionComposing

[AAAI 2025] Augmenting Math Word Problems via Iterative Question Composing...

23
Experimental
4015 The-Swarm-Corporation/MoF

This work introduces Flow Matching Mixture of Experts (FM-MoE), a framework...

23
Experimental
4016 SatvikPraveen/PyTroch-Mastery-Hub

A comprehensive PyTorch portfolio demonstrating advanced ML implementations...

23
Experimental
4017 rdemarqui/llm_complaint_management

Multi-label Classification of Complaints with LLM

23
Experimental
4018 albertan017/LLM4Binary

Collect thoughts, suggestions and resources to build an LLM model for binary

23
Experimental
4019 vizzies/NASA-Semantic-Search-Engine-for-Scientific-Literature

Intelligent scientific literature search and analysis application that uses...

23
Experimental
4020 llap4585/T5-Refiner-DomainFocus

Derived from Medical Literature Development: Injecting domain expertise into...

23
Experimental
4021 ictnlp/SiLLM

SiLLM is a Simultaneous Machine Translation (SiMT) Framework. It utilizes a...

23
Experimental
4022 sukrazit/nlp-playground-notebooks

🚀 Explore NLP with hands-on notebooks for RNNs and Transformers, featuring...

23
Experimental
4023 AlinaMustaqeem/open-LLM

Kickstart with LLMs

23
Experimental
4024 tushar2704/LLM_ChatBot_streamlit

Streamlit application called T-BOT, using HugChat

23
Experimental
4025 sovit-123/lm_sft

Various LMs/LLMs below 3B parameters (for now) trained using SFT (Supervised...

23
Experimental
4026 piratheon/LB-llm_training_scripts

A bunch of script to train your own offsec LLM

23
Experimental
4027 AKC23/Harnessing-LLMs-over-transformer-models-for-detecting-Bengali-depressive-text-A-comprehensive-study

Harnessing large language models over transformer models for detecting...

23
Experimental
4028 kyegomez/ai-reading-list

This collection brings together the highest-signal research papers in modern...

23
Experimental
4029 NLPForUA/ZNO

Structured test tasks and model tuning scripts for multiple subjects from...

23
Experimental
4030 Brokttv/Transformers-on-Majority

An empirical study that builds on Merrill et al. theoretical construction to...

23
Experimental
4031 MuhammadTahaNasir/llm-learning-hub

A hands-on collection of practical notebooks for learning and building with...

23
Experimental
4032 NamrataThakur/Large_Language_Model_From_Scratch_Implementation

Implementing an LLM from scratch block-by-block using PyTorch

23
Experimental
4033 eduardoleao052/Transformer-from-scratch

Educational Transformer from scratch (no autograd), with forward and backprop.

23
Experimental
4034 seonglae/llama2gptq

Chat to LLaMa 2 that also provides responses with reference documents over...

23
Experimental
4035 piratheon/LiquidBunny-llm

A bunch of script to train your own offsec LLM

23
Experimental
4036 SwethaMagesh/sankshepika-mlpro

NLP based Legal document summariser. Takes large documents with complex...

23
Experimental
4037 AswaniSahoo/weather-transformer-scratch

Physics-aware Vision Transformer for weather forecasting built from scratch...

23
Experimental
4038 Anmol25/NewsDigest

An AI-powered full-stack news platform delivering personalized...

23
Experimental
4039 ossirytk/llm_resources

Information and resources on everything related about running large language...

23
Experimental
4040 shreydan/VisionGPT2

Combining ViT and GPT-2 for image captioning. Trained on MS-COCO. The model...

23
Experimental
4041 vitality-vis/vitality-vis.github.io

Promoting Serendipitous Discovery of Academic Literature with Transformers &...

23
Experimental
4042 Dangocan/comfyui_glm_ocr

ComfyUI custom node to run GLM-OCR locally — text, formula, and table...

23
Experimental
4043 cmavro/PackLLM

Pack of LLMs: Model Fusion at Test-Time via Perplexity Optimization

23
Experimental
4044 Letian2003/MM_INF

An efficient multi-modal instruction-following data synthesis tool and the...

23
Experimental
4045 elixpo/emoji_transnetv1

A Machine Learning Initiative Taken to fine tune MT5_SMALL to contextually...

23
Experimental
4046 shreyansh26/Red-Teaming-Language-Models-with-Language-Models

A re-implementation of the "Red Teaming Language Models with Language...

23
Experimental
4047 Spico197/MoE-SFT

🍼 Official implementation of Dynamic Data Mixing Maximizes Instruction...

23
Experimental
4048 sarnikowski/danish_transformers

A collection of Danish Transformers

23
Experimental
4049 InternLM/Visual-ERM

Official Implementation of "Visual-ERM: Reward Modeling for Visual Equivalence"

23
Experimental
4050 lenticularis39/llama2.inferno

Inference Llama 2 in one file of pure Limbo

23
Experimental
4051 HKUSTDial/megatran

[VLDB'25] Official repo for Paper "Weak-to-Strong Prompts with...

23
Experimental
4052 davor10105/relative-absolute-magnitude-propagation

Explain the outputs of your Vision Transformers, Residual Networks and...

23
Experimental
4053 juancmacias/Small_Lenguage_Model

Píldora formativa sobre SLM (Small Lenguage Model)

23
Experimental
4054 LuDanielPoyu/Cowrie-Log-Helper

Multi-task NLP on Cowrie honeypot attacker-session logs—classification, QA,...

23
Experimental
4055 research-outcome/llm-langchain-examples

Examples of llm apps developed using langchain opensource framework

23
Experimental
4056 tmro98/financial-news-classification

binary topic and 3-class sentiment classification of full-length news articles

23
Experimental
4057 PRITHIVSAKTHIUR/Florence-2-Image-Caption

This application utilizes the powerful Florence-2 vision-language model from...

23
Experimental
4058 jsuryanm/text-summarization-system

End-to-End text summarization system built with bart-base using HuggingFace...

23
Experimental
4059 zizisaigao/ePQA-NLP-modeling

This project builds nlp models on ePQA dataset. It covers LM trained for...

23
Experimental
4060 RhythrosaLabs/game-maker-public-dev

Game Maker is a Streamlit app that uses AI to accelerate game development by...

23
Experimental
4061 Scr44gr/elelems

A very, very, very simple SLM. I'm just learning.

23
Experimental
4062 ranzeet013/LLM_Notebooks

Exploring LLMs with interactive notebooks.

23
Experimental
4063 AI-14/micar-vl-moe

[IJCNN 2025] [Official code] - MicarVLMoE: A modern gated cross-aligned...

23
Experimental
4064 benisalla/Tiny-ViT-Transformer-from-scratch

This repository offers a straightforward implementation of Vision...

23
Experimental
4065 Rahulkumar010/microDPO

microDPO: A minimalist, pure PyTorch implementation of Direct Preference...

23
Experimental
4066 MU-Enigma/BotForge

Welcome to BotForge, an open-source project dedicated to advancing NLP-based...

23
Experimental
4067 subhasis-ai/Hindi-ASR-Wav2Vec2

This repository demonstrates development of Hindi ASR model using transformers.

23
Experimental
4068 FusionSid/Rick-AI

An AI chatbot made using DialoGPT in python | Join why discord to try it out:

23
Experimental
4069 5663015/LLMs_train

一套代码指令微调大模型

23
Experimental
4070 anar-rzayev/Empathetic-Dialogue-Generation

Open-Domain Dialogue model which produces empathetic responses when trained...

23
Experimental
4071 jacksonchen1998/LLaMA-Paper-List

Collection of papers using LLaMA as backbone model

23
Experimental
4072 elip06/covid19-fact-checking

A fact-checking system of short to medium-sized documents on the topic of COVID-19

23
Experimental
4073 yihedeng9/rlhf-summary-notes

A brief and partial summary of RLHF algorithms.

23
Experimental
4074 asigalov61/Incredible-MahlerNet

Absolutely fantastic and fully working SOTA Transformer-XL Music AI...

23
Experimental
4075 abhisheksingh-7/cotrend

Extending Decoders with an Integrated Encoder, as Part of Llama-3 Hackathon

23
Experimental
4076 Silvestre17/ChatMeter_FinalProject.inDataScience

💬 Capstone project for the Data Science bachelor's at ISCTE. "ChatMeter" is...

23
Experimental
4077 NeuralCoder3/custom_infinite_craft

A custom implementation of Infinite Craft (https://neal.fun/infinite-craft/)

23
Experimental
4078 jaepil/geometric-adam

A Ray Tracing-Inspired Approach to Neural Network Optimization

23
Experimental
4079 wambugu71/SmartAgriImage_classification_ViT

Vision Transformer trained with thousands of agricultural diseases in...

23
Experimental
4080 mattialoszach/LoRA-Agentic-Output-Format

Fine-tuning LLMs for structured agent-style outputs (e.g. JSON), built for...

23
Experimental
4081 Emart29/phi4-finance-finetuning

Fine-tuning Microsoft Phi-4 Mini 3.8B on SEC 10-K financial Q&A using QLoRA...

23
Experimental
4082 EdVince/whisper-trtllm

Whisper in TensorRT-LLM

23
Experimental
4083 MakazhanAlpamys/Soup

Soup turns the pain of LLM fine-tuning into a simple workflow. One config,...

23
Experimental
4084 sahilichake/Document-Summarization-App-using-LLM

Document Summarization App using large language model (LLM) and Langchain...

23
Experimental
4085 himanshuvnm/Foundation-Model-Large-Language-Model-FM-LLM

This repository was commited under the action of executing important tasks...

23
Experimental
4086 SharathHebbar/Model-Sharding

Sharding Large Language Models for loading them efficiently in lesser RAM

23
Experimental
4087 therrshan/image-captioning

Comparitive analysis of image captioning model using RNN, BiLSTM and...

23
Experimental
4088 rese1f/STEVE

[ECCV 2024] STEVE in Minecraft is for See and Think: Embodied Agent in...

23
Experimental
4089 ZhiningLiu1998/SelfElicit

[ACL'25 Main] SelfElicit: Your Language Model Secretly Knows Where is the...

23
Experimental
4090 waqasm86/Ubuntu-Cuda-Llama.cpp-Executable

Pre-built llama.cpp CUDA binary for Ubuntu 22.04. No compilation required -...

22
Experimental
4091 taranjeet/llmformatter

Get deterministic output in any format like json from any LLM.

22
Experimental
4092 Chirayu-Tripathi/Paper-Implementations

My implementation of Machine Learning and Deep Learning papers from scratch.

22
Experimental
4093 mbeps/llama3.1_fine-tuning_mult-it

Fine-tuning various Llama 3.1 family of models on the Mult-It dataset

22
Experimental
4094 Pro-GenAI/ShortLang

Compressed Text for efficient LLMs

22
Experimental
4095 Arneunalarming861/Laminae

Bridge raw large language models to production-ready AI with a lightweight...

22
Experimental
4096 uhasker/large-language-models

Files for the book "Large Language Models"

22
Experimental
4097 mcd-unison/llm

Material sobre Grandes Modelos de Lenguajes (LLM) realizado en forma...

22
Experimental
4098 K2-BoundaryArchitect/Reflex-Motor-DB

A deterministic reflex memory layer for stabilizing LLM execution.

22
Experimental
4099 akunba3970/llm-cost-calculator

Estimate token usage and API costs for large language models to help...

22
Experimental
4100 YY0649/ICE-PIXIU

ICE-PIXIU:A Cross-Language Financial Megamodeling Framework

22
Experimental
« Prev 1 2 3 39 40 41 42 43 76 77 78 Next »