All Transformer Models

7,795 models ranked by quality score · Page 43 of 78

Showing 4201–4300 of 7,795
# Model Score Tier
4201 NJUxlj/llm-hub

Popular Large Language Model's modeling file and finetune+pretrain scripts,...

22
Experimental
4202 OMI-KALIX/Multi-Agent-AI-Workflow-for-Content-Creation

A fully automated multi-agent AI system that creates LinkedIn content end to...

22
Experimental
4203 pkdubey/content_moderation

An AI-powered content moderation system using Python and Hugging Face...

22
Experimental
4204 rodneylab/local-ai-llm-playground

Experiments running offline LLMs in Python and Rust locally using Ollama and...

22
Experimental
4205 fattorib/Little-GPT

GPT* - Training faster small transformers using ALiBi, Parallel Residual...

22
Experimental
4206 RichardHam-co-uk/ProjectLodestar

AI development environment with 90% cost savings. Routes between 8 LLM...

22
Experimental
4207 Vincentiv/BERT_Finetuning_from_scratch

Notebook on finetuning BERT

22
Experimental
4208 SAP-samples/acl2025-contrastive-perplexity

This reposity contains the source code of the ACL'25 paper "Contrastive...

22
Experimental
4209 wklee610/VLM-Model-fastapi

A reusable FastAPI module for serving and integrating Vision-Language Models (VLM)

22
Experimental
4210 amazon-science/TSFM-Compression

Official Implementation of Understanding Transformers for Time Series: Rank...

22
Experimental
4211 DanMeon/xlstruct

LLM-powered Excel parser — define a Pydantic schema, get structured data...

22
Experimental
4212 theanasuddin/Advanced-Deep-Learning

Computer exercises for Advanced Deep Learning. Includes implementations of...

22
Experimental
4213 RahulSChand/gpt2_squad

GPT2 training on squad dataset

22
Experimental
4214 FuxiaoLiu/DocumentCLIP

[ICPRAI 2024] DocumentCLIP: Linking Figures and Main Body Text in Reflowed Documents

22
Experimental
4215 ihuzaifashoukat/ml-mastery-path

Advanced Machine Learning and LLM training implementations. A comprehensive...

22
Experimental
4216 awsaf49/detect-fake-text

LLM - Detect AI Generated Text || Identify which essay was written by a...

22
Experimental
4217 wesleyscholl/squish

🤖🗜️⚡️ Compress local LLMs once, run them forever at sub-second load times....

22
Experimental
4218 uakarsh/TiLT-Implementation

Implementation of the paper: Going Full-TILT Boogie on Document...

22
Experimental
4219 mytechnotalent/mechanistic_interpretability

Mechanistic Interpretability (MI) is a subfield of AI alignment and safety...

22
Experimental
4220 BlackRoad-AI/blackroad-llm-fine-tuner

ulackroad llm fine tuner — Part of the BlackRoad OS ecosystem. Sovereign...

22
Experimental
4221 NAME0x0/OMNI

PERSPECTIVE v2 — A 1.05 trillion parameter sparse Mixture-of-Experts...

22
Experimental
4222 qora-protocol/QORA-LLM-3B

Pure Rust inference engine for the SmolLM3-3B language model. No Python...

22
Experimental
4223 kyegomez/TinyGPTV

Simple Implementation of TinyGPTV in super simple Zeta lego blocks

22
Experimental
4224 gurpejsingh13/punjabi-gpt-scratch-20m

Developed and pre-trained a 20.39M-parameter Punjabi GPT-style base model...

22
Experimental
4225 devbm7/QGen

Question Generator System

22
Experimental
4226 symanto-research/merge-tokenizers

Package to align tokens from different tokenizations.

22
Experimental
4227 HySonLab/Protein_Pretrain

Multimodal Pretraining for Unsupervised Protein Representation Learning

22
Experimental
4228 AswaniSahoo/llama-task-agent

Fine-tuned LLaMA-3.1-8B task agent with LoRA for reliable tool execution

22
Experimental
4229 GJ98/Megatron-LM

Megatron-LM implemented by PyTorch

22
Experimental
4230 kunjmehta/cross-modal-retrieval-food-ai

Course project for 198:536 at Rutgers University. The project is about...

22
Experimental
4231 hazdzz/converter

The official PyTorch implementation of Converter.

22
Experimental
4232 abhilashpuli98/Deep-Learning-Paper-Implementations

A collection of paper implementations using the PyTorch framework

22
Experimental
4233 mgokulkrish/LENR.ai

Github Repo for Recommendation System using LLMs.

22
Experimental
4234 mrtrizer/UnityLlamaCpp

Llama.cpp in Unity, straightforward and clean

22
Experimental
4235 liyaooi/TAMO

TAMO: reimagine Table representation as an independent Modality for LLMs

22
Experimental
4236 jawline/Synthic

Automatically generate gameboy music using machine learning

22
Experimental
4237 sairam-s0/local_ai_automation

This project automates question solving using AI and OCR. Instead of...

22
Experimental
4238 amajji/LLM-Quantization-Techniques-Absmax-Zeropoint-GPTQ-GGUF

LLM quantization techniques: absmax, zero-point, GPTQ and GGUF

22
Experimental
4239 NamelyCorp/NamelyCorp-LLM-Studio

Local-first LoRA fine-tuning studio with web UI for document-grounded LLM training.

22
Experimental
4240 BlackRoad-OS/Modelfile

BlackRoad OS Ollama model definitions and custom models

22
Experimental
4241 PRITHIVSAKTHIUR/Molmo2-HF-Demo

A Gradio-based demonstration for the AllenAI Molmo2-8B multimodal model,...

22
Experimental
4242 Nilanshrajput/Intent_classification

Intent Classification with Hugging Face, Mlfow experiment tracking,...

22
Experimental
4243 wanglne/DELMAN

[ACL 2025 Findings] DELMAN: Dynamic Defense Against Large Language Model...

22
Experimental
4244 agentdr1/LA_MIL

Implementation of LA_MIL, Local Attention Graph-based Transformer for WSIs, PyTorch

22
Experimental
4245 DNGros/lmwrapper_OLD

An object-oriented wrapper around language models. Moved to...

22
Experimental
4246 D0men1c0/Benchmark-Gemma-Models

Highly customizable Python suite for LLM evaluation (Gemma, LLaMA+). Full...

22
Experimental
4247 parham1998/Enhancing-High-Vocabulary-IA-with-a-Novel-Attention-Based-Pooling

Official Pytorch Implementation of: "Enhancing High-Vocabulary Image...

22
Experimental
4248 IAAR-Shanghai/FastMem

Fast Memorization of Prompt Improves Context Awareness of Large Language...

22
Experimental
4249 hjshah142/BERT-Fine-Tuning-Software-Requirements-Classification

Fine-tuning a pre-trained model using the Transformers library (Bert) on...

22
Experimental
4250 ccs96307/fast-llm-inference

Accelerating LLM inference with techniques like speculative decoding,...

22
Experimental
4251 mohamedshameem-dev/Review_Classification_Engine

Batch-optimized LLM-based automated customer review classification and...

22
Experimental
4252 papachristoumarios/llm-network-formation

Supplementary Code and Data for "Network Formation and Dynamics among Multi-LLMs"

22
Experimental
4253 pecharesjoselito/chuck.optimizer

Optimize neural network training by monitoring loss, gradients, and...

22
Experimental
4254 llap4585/T5-Refiner-DomainFocus-TrainOnly

This project provides code for fine-tuning T5/mT5 models on data...

22
Experimental
4255 aimagelab/JARVIS

Seeing Beyond Words: Self-Supervised Visual Learning for Multimodal Large...

22
Experimental
4256 jrajath94/jax-transformer-impl

JAX/XLA Transformer with MHA, MQA, GQA (Ainslie et al. 2023) — JIT, vmap, pmap

22
Experimental
4257 aliuyar1234/proberoute

Research code for ProbeRoute, a probe-initialized sparse routing method for...

22
Experimental
4258 aswinvinodd/emotion-detection-system

AI-based Emotion Detection and Sentiment Analysis System using NLP and Streamlit

22
Experimental
4259 timteh/timteh-forge

⚡ TIMTEH Model Forge — Uncensored, abliterated & reasoning-distilled GGUFs....

22
Experimental
4260 mamounyosef/commit-message-llm

Fine-tuning Qwen2.5-Coder-0.5B LLM using QLoRA (4-bit quantization + LoRA)...

22
Experimental
4261 JacobJ215/Sentiment-Analysis-with-DistilBERT

Here we leverage a subset of the amazon_polarity dataset to train two...

22
Experimental
4262 mcbal/afem

Implementation of approximate free-energy minimization in PyTorch

22
Experimental
4263 machinelearningzuu/LLM-in-Production

Welcome to the "LLM in Production" repository! This project aims to provide...

22
Experimental
4264 SharathHebbar/sft_mathgpt2

Supervised Fine tuning using TRL library

22
Experimental
4265 utkukose/llm_persona_hallucination_study

Code for the study on persona vectors in controling / understanding...

22
Experimental
4266 Trustworthy-ML-Lab/Efficient-LLM-automated-interpretability

[NeurIPS'23 ATTRIB] An efficient framework to generate neuron explanations for LLMs

22
Experimental
4267 Mukuta-Manit-D/AI-Mirror

AI Mirror is a smart, interactive web application that detects human...

22
Experimental
4268 0606zt/PanoLlama

[ICCV 2025 Highlight] Panorama Generation as a Next-Token Prediction Task.

22
Experimental
4269 or4k2l/enhanced-audio-anomaly-detection

Hybrid ensemble (AST + classical) for industrial anomaly detection. Pump:...

22
Experimental
4270 ArshockAbedan/Natural-Language-Processing-with-Attention-Models

Attention Models in NLP

22
Experimental
4271 EN10/BabyLlama

Train and run a small Llama 2 model from scratch on the TinyStories dataset.

22
Experimental
4272 dunktra/attention-binding-a11y

Code for tracking concept emergence via attention-head binding (EB*). Pythia...

22
Experimental
4273 mbeps/qwen3-italic-benchmark

Benchmarking Qwen3 models f various sizes on the ITALIC benchmark to evluate...

22
Experimental
4274 termehtaheri/SAR-LM

Official implementation of “SAR-LM: Symbolic Audio Reasoning with Large...

22
Experimental
4275 anyantudre/NLP-Course-Hugging-Face

This course will teach you about Natural Language Processing (NLP) using...

22
Experimental
4276 Vext-Labs-Inc/vext-pentest-7b

Open-source 7B language model for autonomous penetration testing — parses...

22
Experimental
4277 xHarshit/Self-Healing-Classification-DAG-with-Fine-Tuned-Model

A self-healing text classification pipeline built with LangGraph and a...

22
Experimental
4278 HishamAlyahya/PyLLM

Leverage Large Language Models to generate and execute code dynamically...

22
Experimental
4279 Keyvanhardani/kvcache-autotune

Automatic KV-Cache optimization for HuggingFace Transformers. Find the...

22
Experimental
4280 LinukPerera/Physics-Constrained-Transformer-for-Cyclone-Trajectory-and-Damage-Prediction

This framework fuses satellite imagery, atmospheric data, and terrain...

22
Experimental
4281 kyegomez/open_qwen

A non-official implementation of Qwen 3.5, as there doesn’t seem to be a...

22
Experimental
4282 hereandnowai/transformers-simplified

Simplified, standalone Python scripts for transformer models, LLMs, TTS,...

22
Experimental
4283 rohanmistry231/Transformers-Hugging-Face-Interview-Preparation

A curated resource for mastering Transformers and Hugging Face libraries,...

22
Experimental
4284 M4T1SS3/DeltaLoop

Continuous fine-tuning layer that converts AI agent logs into LoRA adapters.

22
Experimental
4285 deepagency/llm-resource-planner

A simple CLI tool to fetch Hugging Face model metadata and estimate required...

22
Experimental
4286 omerfarooq223/AutoGrader-Agent

AI agent that grades student assignments from a ZIP file using LLMs —...

22
Experimental
4287 tegridydev/mechamap

MechaMap - Toolkit for Mechanistic Interpretability (MI) Research

22
Experimental
4288 orionw/MTLvsIFT

Code for the paper "When to Use Multi-Task Learning vs Intermediate...

22
Experimental
4289 nphdang/Pred-LLM

Generating tabular data via Large Language Models (LLMs)

22
Experimental
4290 himanshu231204/hk-devbrain

HK-DevBrain is a lightweight AI developer assistant built on Llama 3.2 (3B)...

22
Experimental
4291 rmovva/LLM-publication-patterns-public

[NAACL 2024] Topics, Authors, and Institutions in Large Language Model...

22
Experimental
4292 idiap/HMMGradients.jl

Enables computing the gradient of the parameters of Hidden Markov Models (HMMs)

22
Experimental
4293 HectorPulido/discord-bot-LLama

It's a chatbot made with Python that simulates natural conversation with...

22
Experimental
4294 AntonioVFranco/elamonica

Production-ready test-time compute optimization framework for LLM inference....

22
Experimental
4295 avijit-thawani/Augmented-LMs

Living Survey of Augmented LMs

22
Experimental
4296 svn05/vietnamese-nmt

Vietnamese-English-Japanese NMT with fine-tuned NLLB-200, beam search, and...

22
Experimental
4297 zalkklop/LVSM

Official code for "LVSM: A Large View Synthesis Model with Minimal 3D...

22
Experimental
4298 SolomonB14D3/intelligent-svd

Knowledge-preserving SVD compression for large language models via...

22
Experimental
4299 pinkbanty5707/GEO-AI-Woo

Optimize WooCommerce sites for AI search engines by generating llms.txt,...

22
Experimental
4300 reyrove/Sparrow-Hawk-CodeArtGenerator

A sassy, neon-drenched AI copilot for chaotic creators—built with Groq +...

22
Experimental
« Prev 1 2 3 41 42 43 44 45 76 77 78 Next »