All Transformer Models

7,795 models ranked by quality score · Page 34 of 78

Showing 3301–3400 of 7,795
# Model Score Tier
3301 LaMP-Benchmark/LaMP

Codes for papers on Large Language Models Personalization (LaMP)

28
Experimental
3302 kaalen/tiny-assistant

Experimenting with smaller LLMs that can run on commodity hardware like...

28
Experimental
3303 DanHrmti/SenTransformer-VAE-pytorch

Sentence VAE using the Transformer encoder-decoder architecture.

27
Experimental
3304 jasonacox/ProtosAI

A Study in Artificial Intelligence - Simple scripts that explore...

27
Experimental
3305 sabrinaherbst/distilbert_question_answering

Implements a Q&A ML model usuing DistilBERT.

27
Experimental
3306 Yusuf270200101/DeepAnalyze

🔍 Empower data scientists with DeepAnalyze, a tool that leverages large...

27
Experimental
3307 khairulislam/Timeseries-Explained

Interpreting Deep Learning timeseries models using Local Interpretation methods

27
Experimental
3308 LSquaredM/mutual_info_scaling_law

(NeurIPS 2025) Official Code for L²M: Mutual Information Scaling Law for...

27
Experimental
3309 HROlive/Advanced-Deep-Learning-with-Transformers

Workshop that will take you from Graph Neural Networks (GNNs) to...

27
Experimental
3310 andylolu2/jax-vqvae-gpt

Implementation of VQ-VAE with a GPT-style sampler in the JAX and Haiku ecosystem.

27
Experimental
3311 oooranz/GraDe

📐 Not All Features Deserve Attention: Graph-Guided Dependency Learning for...

27
Experimental
3312 ImMohammadHosseini/MKP-RL

:sparkles: Solve multi_dimensional multiple knapsack problem using...

27
Experimental
3313 Uokoroafor/transformer_from_scratch

This is a PyTorch implementation of the Transformer model in the paper...

27
Experimental
3314 WooooDyy/MathCritique

Implementation for the research paper "Enhancing LLM Reasoning via Critique...

27
Experimental
3315 zhuang-li/SCAR

[ACL 2025 main] SCAR: Data Selection via Style Consistency-Aware Response...

27
Experimental
3316 debugst1ck/tarp

🦠 Antimicrobial resistance prediction using transformer models.

27
Experimental
3317 othmanelhoufi/LM-for-FactChecking

An automated solution for fact-checking using available claims and fake-news...

27
Experimental
3318 plutonium-239/memsave_torch

Lowering PyTorch's Memory Consumption for Selective Differentiation

27
Experimental
3319 philogicae/ai-notebooks-colab

Useful colab notebooks to try out Stable Diffusion, LLM, etc.

27
Experimental
3320 nishantb06/smolLM

Reverse Engineering SmolLM2 model and training it from scratch

27
Experimental
3321 hurui200320/llama-cpp-kt

The Kotlin wrapper of llama.cpp, powered by JNA

27
Experimental
3322 LastBotInc/llama2j

Pure Java Llama2 inference with optional multi-GPU CUDA implementation

27
Experimental
3323 AmericanPresidentJimmyCarter/yal-discord-bot

Yet Another LLaMA/ALPACA Discord Bot

27
Experimental
3324 AshutoshKulkarni4998/UMWTransformer

Inference code for "Unified Multi-Weather Transformer for Multi-Weather...

27
Experimental
3325 atomlayer/llama_cute_voice_assistant

Llama cute voice assistant

27
Experimental
3326 DDDOH/LLM_News

LOLA_ LLM-Assisted Online Learning Algorithm for Content Experiments

27
Experimental
3327 MusfiqDehan/Llama2-Finetuned-for-Translation

Fine-Tuned Llama-2 For Machine Translation

27
Experimental
3328 harshtiwari01/llm-heatmap-visualizer

A set of scripts to generate full attention-head heatmaps for transformer-based LLMs

27
Experimental
3329 snexus/nlp-question-answering-system

Question answering system with transformers

27
Experimental
3330 codessian/epistemic-confidence-layer

Model-agnostic trust protocol for calibrated, auditable AI

27
Experimental
3331 snsn3/policy-LLM

Finetuning an LLM for heavy policy work

27
Experimental
3332 AnkitaMungalpara/Building-LLM-From-Scratch

This repository provides a step-by-step guide to creating your own large...

27
Experimental
3333 tobifinn/ensemble_transformer

Official PyTorch implementation of "Self-Attentive Ensemble Transformer:...

27
Experimental
3334 AdrienneDeganutti/DANTE-AD

"DANTE-AD: Dual-Vision Attention Network for Long-Term Audio Description"...

27
Experimental
3335 Scientific-Computing-Lab/Tokompiler

Scope is all you need: Transforming LLMs for HPC Code

27
Experimental
3336 isaacus-dev/terge

An easy-to-use Python library for merging PyTorch models.

27
Experimental
3337 scalable-ml-deep-learning/fine_tune_whisper

Fine-Tune Whisper for Italian ASR with transformers

27
Experimental
3338 declare-lab/KNOT

This repository contains the implementation of the paper -- KNOT: Knowledge...

27
Experimental
3339 andyngdz/exogen_backend

ExoGen Backend

27
Experimental
3340 ictnlp/LSG

The code for AAAI 2025 “Large Language Models Are Read/Write Policy-Makers...

27
Experimental
3341 gersongerardcruz/extractive_and_abstractive_text_summarization

A combination of extractive and abstractive text summarization for...

27
Experimental
3342 Guest400123064/ezgatr

Geometric Algebra Transformer Made Easy

27
Experimental
3343 cui-shaobo/causal-strength

evaluating the causal strength between cause and effect

27
Experimental
3344 richardsonlima/synapsense

SynapSense: Python In-Context Learning for Large Language Models SynapSense...

27
Experimental
3345 sparkle-reasoning/sparkle

[NeurIPS'25] Beyond Accuracy: Dissecting Mathematical Reasoning for LLMs...

27
Experimental
3346 JianxXiong/AAPO

Implementation of AAPO (Arxiv: 2505.14264v2) paper

27
Experimental
3347 affjljoo3581/starcoder-jax

a Jax/Flax inference code of StarCoder

27
Experimental
3348 LoupFireYT/llm.c

🛠️ Explore GPT-2 with llm.c, a lightweight C implementation that simplifies...

27
Experimental
3349 Quotify-Bot/quotify-frontend

AI-powered inspirational quote generator

27
Experimental
3350 Orfeous/llamacpp.net

C#/.NET binding of llama.cpp

27
Experimental
3351 ziansu/prorec

Official Implementation of NeurIPS 2024 paper - Source Code Foundation...

27
Experimental
3352 emagod/LLM-Forecast

🚀 Integrate ARIMA and Large Language Models for accurate forecasting with...

27
Experimental
3353 navamai/navamai

Use NavamAI to supercharge your productivity and workflow with personal,...

27
Experimental
3354 robjsliwa/llama-agent

Fun project to run your own LLM chat bot using llama.cpp

27
Experimental
3355 jesusvilela/IGBundle-LLM

IGBundle LLM is an experimental framework for adapting Large Language Models...

27
Experimental
3356 newfull5/NLLB-200-Distilled-350M-en-ko

nllb-200 distilled 350M for English to Korean translation

27
Experimental
3357 rachel-pai/T5Elasticsearch

Elasticsearch with T5/Bert/Other models provided by huggingface Transfomers.

27
Experimental
3358 nmamie/HiveLLM

This project evaluated the collective intelligence potential of...

27
Experimental
3359 Framstag/LLMAnalysisJinni

A tool to implement complex analysis tasks using an LLM in cases where you...

27
Experimental
3360 imSanko/Image_Caption_Generator_With_Transformers

This repository contains code for generating captions for images using a...

27
Experimental
3361 IsmaelMousa/TTL

Full-stack simulator for a todo task list application using FastAPI, I built...

27
Experimental
3362 Yellow4Submarine7/LLMDoctor

🩺 Token-Level Flow-Guided Preference Optimization for Efficient Test-Time...

27
Experimental
3363 bernardoleite/question-generation-t5-pytorch-lightning

Question Generation for English and Portuguese, using the T5 model,...

27
Experimental
3364 rekalantar/MedSegmentAnything_SAM_LungCT

The code to finetune SAM with bounding box prompt for segmentation of the lungs on CT

27
Experimental
3365 sno2/bertml

Use common pre-trained ML models in Deno!

27
Experimental
3366 CeMOS-IS/GenFormer

[ICPR 2024] Official repository of the paper "GenFormer - Generated Images...

27
Experimental
3367 codepawl/turboquant-torch

Unofficial PyTorch implementation of TurboQuant (Google Research, ICLR...

27
Experimental
3368 AbineshSivakumar/Llama-2-7B-QLoRA-Vicuna

This repository contains code to fine-tune a Llama-7B-Uncensored model using...

27
Experimental
3369 gsarti/pecore

Materials for "Quantifying the Plausibility of Context Reliance in Neural...

27
Experimental
3370 aerosta/rewardhackwatch

Runtime detector for reward hacking and misalignment in LLM agents (89.7% F1...

27
Experimental
3371 JarvisPei/MemDLM

MemDLM: Memory-enhanced Diffusion Language Model

27
Experimental
3372 shub-garg/Vision-Transformer-VIT-for-MNIST

This repository implements a Vision Transformer (ViT) to classify...

27
Experimental
3373 szheng3/Rust-server-pre-trained-models

Rust server that summarizes text with pre-trained models

27
Experimental
3374 mpociot/llamero

A GUI application to easily try out Facebook's LLaMA models.

27
Experimental
3375 Hyun-Ryu/clover

Official code for "Divide and Translate: Compositional First-Order Logic...

27
Experimental
3376 JamesVorder/python-tddpp

This LLM generates code based on tests, and makes sure they pass.

27
Experimental
3377 Wells-the-Doctor/leaxer

🌟 Build and deploy local AI models with Leaxer for real-time interaction,...

27
Experimental
3378 eshoyuan/WeChat-LLM

WeChat-LLM: Build a LLM that Mirrors Your Chat Style Using WeChat...

27
Experimental
3379 OpenNLG/OpenBA-v2

OpenBA-V2: 3B LLM (Large Language Model) with T5 architecture, utilizing...

27
Experimental
3380 maggiesong7/FullyAttentional

Fully Attentional Network for Semantic Segmentation [AAAI 2022]

27
Experimental
3381 Michael-Jackson666/Zero2Hero-AI

From first principles to state-of-the-art: A hands-on journey implementing...

27
Experimental
3382 hydropix/AutoDescribe-Images

Tool to automatically generate text descriptions for images using Ollama...

27
Experimental
3383 ibnaleem/mixtral.py

A Python module for running the Mixtral-8x7B language model with...

27
Experimental
3384 jankstar/pydocu

fastapi server for classification of documents and extraction of data

27
Experimental
3385 aidendorian/Marcella-60M-SLM

A 66M parameter decoder-only transformer language model implemented from...

27
Experimental
3386 yul091/GraphLogAD

Codebase for the ICKG 2023 paper: "GLAD: Content-aware Dynamic Graphs For...

27
Experimental
3387 muhammad-fiaz/EMSUGI

EMSUGI is a future prediction & analysis project on various factor like...

27
Experimental
3388 li-plus/nanoRLHF

Train a tiny LLaMA model from scratch to repeat your words using...

27
Experimental
3389 CanvaChen/chinese-llama-tokenizer

目标:构建一个更符合语言学的小而美的 llama 分词器,支持中英日三国语言

27
Experimental
3390 Losif01/text-preprocessing-to-transformers-NLP-notes

This repo is my personal notes from the Stanford NLP course, and i currently...

27
Experimental
3391 isaaccorley/segmenter-pytorch

PyTorch implementation of "Segmenter: Transformer for Semantic Segmentation"...

27
Experimental
3392 UCSC-VLAA/Sight-Beyond-Text

[TMLR 2024] Official implementation of "Sight Beyond Text: Multi-Modal...

27
Experimental
3393 ilya16/deephumor

DeepHumor: Image-based Meme Generation using Deep Learning

27
Experimental
3394 merlerm/In-Context-Symbolic-Regression

Official code implementation for the ACL 2024 Student Research Workshop...

27
Experimental
3395 yyy01/PAC

The official implementation of the paper "Data Contamination Calibration for...

27
Experimental
3396 Y-Research-SBU/CSR

Official Repository for CSR - ICML 2025 Oral

27
Experimental
3397 Type-Here/med-vix-ray

A Knowledge-Guided Model for CXR classification

27
Experimental
3398 erfanzar/OST-OpenSourceTransformers

OST Collection: An AI-powered suite of models that predict the next word...

27
Experimental
3399 ArtificialZeng/transformers-Explained

官方transformers源码解析。AI大模型时代,pytorch、transformer是新操作系统,其他都是运行在其上面的软件。

27
Experimental
3400 januverma/transformers-for-sequential-recommendation

Notebooks on using transformers for sequential recommendation tasks

27
Experimental
« Prev 1 2 3 32 33 34 35 36 76 77 78 Next »