All Transformer Models

7,795 models ranked by quality score · Page 62 of 78

Showing 6101–6200 of 7,795
# Model Score Tier
6101 AmitAK1/Neural-Vulnerability-Scanner

CodeBERT + LoRA fine-tuning for C/C++ vulnerability detection | F1 = 74.3% |...

14
Experimental
6102 AddictivelyRecursive/lightweight-multimodal-transformer-pipeline

Lightweight multimodal transformer pipeline comparing MobileViT and...

14
Experimental
6103 mtichikawa/llm-bias-detection

Research project detecting and quantifying demographic bias in language models

14
Experimental
6104 igna-s/QLoRA-Experiments

A collection of SFT and distillation pipelines to train specialized medical...

14
Experimental
6105 adityonugrohoid/vllm-explorer

Probes and catalogs the full vLLM server API — endpoint reference, model...

14
Experimental
6106 DzmitryPihulski/Encoder-transformer-from-scratch

Fully functional encoder transformer from tokenizer to lm-head

14
Experimental
6107 anviit/llm-inference-serving

Production LLM inference stack — 28ms TTFT, 39 tok/s, 81% cache hit rate on a 6GB GPU

14
Experimental
6108 benjaminr/gendantic

Generate synthetic data using Pydantic Models and LLMs

14
Experimental
6109 Hidde-Heijnen/audioGPT

Cambridge Master Thesis: Audio Interpretable Transformers

14
Experimental
6110 abdulrahimzulfiqar/QueryMate

A portable, offline-first CLI AI assistant built with C++ and llama.cpp....

14
Experimental
6111 philippnormann/malicious-payload-detection

🕵️‍♂️ ML project to identify malicious web payloads, aimed at boosting the...

14
Experimental
6112 taishan1994/Classification-Factory

以极简的代码实现文本分类,包括二分类、多分类、多标签分类。

14
Experimental
6113 HubertKasperek/ai-companion-py

Python bindings for ai-companion (only backend, without WebUI)

14
Experimental
6114 andreaps96/prompt-injection-classifier

Fine-tuned mDeBERTa-v3 binary classifier for prompt injection detection on...

14
Experimental
6115 ztjhz/miniLM

Small Model Is All You Need - NTU SC4001 Neural Network & Deep Learning Project

14
Experimental
6116 jshwanth/image-captioning

Error-centric comparison of CNN-LSTM, attention-based, and transformer...

14
Experimental
6117 lciric/gptq-from-scratch

GPTQ post-training quantization from scratch — GPT-2, OPT, LLaMA support

14
Experimental
6118 EM7m4/Distill-R1

Combine reinforcement learning with online teacher-student distillation to...

14
Experimental
6119 RManLuo/llm-facteval

Source code of paper "Systematic Assessment of Factual Knowledge in Large...

14
Experimental
6120 rajkundalia/error-analyzer-with-baml

Analyze Java compilation and runtime errors using BAML with a local Ollama...

14
Experimental
6121 Gholamrezadar/finetuning_llm_on_letter_counting

Fine-tuning Gemma-3 4B on the letter-counting dataset

14
Experimental
6122 ghostperpper007/small_programming_model

A from-scratch Python code model with GNN-based structure encoding,...

14
Experimental
6123 AmirK-S/llm-to-production-python

Formation complete LangChain/LangGraph en Python - du LLM basique aux agents avances

14
Experimental
6124 soy-tuber/localllama-insights

Technical insights from r/LocalLLaMA — vLLM, FP8, NVFP4, Blackwell GPU...

14
Experimental
6125 egeozsoy/ORacle

Official code of the paper ORacle: Large Vision-Language Models for...

14
Experimental
6126 shrutikakapade/Designing-Structured-Outputs-for-LLMs-TypedDict-Pydantic-Output-Parsers-with-LangChain

Design robust structured outputs for LLM applications. Learn how to enforce...

14
Experimental
6127 anna-kay/Reddit-summarization

Abstractive summarization of Reddit datasets with Transformers.

14
Experimental
6128 bywords/directed_sentiment_analysis

Dataset and code for directed sentiment analysis in news text.

14
Experimental
6129 TJ-Solergibert/transformers-in-supercomputers

Transformers training in a supercomputer with the 🤗 Stack and Slurm

14
Experimental
6130 nv78/Benchmarking-Question-Answering

Research Code for Benchmarking Question Answering Models Paper

14
Experimental
6131 sacredvoid/alignrl

LLM post-training playbook: SFT, GRPO, DPO, eval, and inference. pip install alignrl

14
Experimental
6132 zpqiu/rl-infra-notes

Personal notes about RL Infra/Algorithm and LLMs

14
Experimental
6133 liziniu/cold_start_rl

Code for Blog Post: Can Better Cold-Start Strategies Improve RL Training for LLMs?

14
Experimental
6134 its-not-rocket-science/mnemosyne

An autonomous, distributed knowledge discovery agent combining LLMs and...

14
Experimental
6135 martainntwiglike586/tinyforge

Train a 0.8B parameter model to improve itself on any task with tests, using...

14
Experimental
6136 ikun-llm/ikun-Distill

知识蒸馏 | Knowledge Distillation from teacher model 🎓

14
Experimental
6137 huysuy05/LLM-Bias-Benchmark

An analysis of different LLMs' performance on balanced, imbalanced data with...

14
Experimental
6138 iakashpaul/Portal

Android app for the Hole in your Palm project, making LLMs accessible on-device!

14
Experimental
6139 YounesBensafia/Algeria-2-0-FineTuning-workshop

This repository contains resources and examples used in my workshop for...

14
Experimental
6140 Pects1949/LLM-Fine-tuning-Toolkit

A comprehensive toolkit for fine-tuning and deploying Large Language Models...

14
Experimental
6141 Witurpred64/LLM-FineTuning-Toolkit

A comprehensive toolkit for fine-tuning Large Language Models (LLMs) with...

14
Experimental
6142 buhsnn/eli5-gpt2-language-model

Decoder-only Transformer (GPT-2 style) trained from scratch on the ELI5...

14
Experimental
6143 ZZZ150751/cs336_spring2025_assignment1

Implementation of a Decoder-only Transformer language model from scratch for...

14
Experimental
6144 di37/full-fine-tuning-nvidia-question-and-answering

Flan-t5-base model was fine-tuned on Nvidia Question and Answer Pair Dataset...

14
Experimental
6145 miga1999/AirClaw

Run OpenClaw locally on any GPU or CPU without API costs, supporting large...

14
Experimental
6146 Atsusheeesh/vllm-daily

📊 Summarize merged PRs daily with vLLM, ensuring you stay updated on key...

14
Experimental
6147 YUECHE77/EE641-Final-Project

Deprecated

14
Experimental
6148 botosadam/matryoshka

🚀 Build Ruby gems that utilize Rust for enhanced performance through two...

14
Experimental
6149 gayathrikumari/Enterprise-Grade-LLM-Optimization-Alignment-PEFT-RLHF-

End-to-end LLM optimization pipeline covering PEFT (LoRA), RLHF (PPO),...

14
Experimental
6150 eren23/blipren_release

BLIP-2 implementation for training vision-language models. Q-Former + frozen...

14
Experimental
6151 dakshjain-1616/nemotron3-super-vs-gpt5.4-nano

Head-to-head benchmark comparing Nemotron and GPT-5.4-nano on code generation tasks

14
Experimental
6152 ScalingOpt/SGG

[ACL 2025 Main] Taming LLMs by Scaling Learning Rates with Gradient Grouping

14
Experimental
6153 Pyrolignic-paydirt84/pse-vcipher-collapse

Accelerate LLM inference by collapsing attention paths with...

14
Experimental
6154 serhiismetanskyi/llm-output-evaluation-with-deepeval

DeepEval LLM quality evaluation tests with LLM-as-a-judge

14
Experimental
6155 yilin-geng/llm-instruction-conflicts

This repository contains the data and the code for the paper "Control...

14
Experimental
6156 FawwazAhmd/msc-group-project

MSc group project evaluating instruction-tuned LLMs for legal clause...

14
Experimental
6157 Prajit-Rahul/Transformer-Based-Multi-Modal-Stock-Market-Prediction

Multi-modal stock forecasting with transformers, sentiment signals, and...

14
Experimental
6158 Thableaddles57/neural-nexus

A modular framework for distributed LLM training and fine-tuning across...

14
Experimental
6159 mpuodziukas-labs/llm-cobol-benchmark

Systematic benchmark: top LLMs produce broken COBOL. 5 programs, 3 models,...

14
Experimental
6160 SolomonB14D3/confidence-cartography

Teacher-forced confidence as a false-belief sensor for language models.

14
Experimental
6161 webml-community/TranslateGemma-WebGPU

TranslateGemma is a family of translation models from Google, built on top...

14
Experimental
6162 pranavgoel29/comparinator

Browser-local image region comparator using Transformers.js + pixel...

14
Experimental
6163 aims-foundations/reeval

Reliable and Efficient Model-based Generative Model Evaluation

14
Experimental
6164 rick97julho/do-i-have-the-vram

🔍 Estimate your VRAM needs for Hugging Face models in seconds without...

14
Experimental
6165 Almas-ansari/Transliteration-Hindi-To-English

A neural machine transliteration system that converts words written in...

14
Experimental
6166 chizkidd/bert-masked-attention-visualizer

Visualizing and analyzing BERT self-attention heads during masked language modeling.

14
Experimental
6167 Isha1600/LLM-Finetuning

Fine-tuning Large Language Models (LLMs) using custom datasets for improved...

14
Experimental
6168 driellecristine/BERT-Contrastive-LoRA

Enhance BERT fine-tuning for intent classification using supervised...

14
Experimental
6169 codershiyar/llama-google-colab-tutorial

Step-by-step tutorial on loading and using Llama 3.1 8B Instruct in Google...

14
Experimental
6170 Prelf1992/LLM-Orchestration-Framework

A robust framework for building, deploying, and managing large language...

14
Experimental
6171 Josh396s/BERT-Contrastive-LoRA

Optimizing BERT for intent classification on the Amazon Massive dataset...

14
Experimental
6172 Yahy5715/jailbreak-defense

Detect and prevent large language model jailbreaks using hidden state causal...

14
Experimental
6173 derrickrajkumar10/GameOfThrones_AI_Analayser

AI/NLP dashboard that analyses Game of Thrones episodes and predicts...

14
Experimental
6174 Allen0307/AdapterBias

Code for the Findings of NAACL 2022(Long Paper): AdapterBias:...

14
Experimental
6175 tk-yasuno/feature_tsfm_hybrid_gbdt

HVAC Range Deviation Forecast - v2.0 (Hybrid Model) Granite TS Embeddings +...

14
Experimental
6176 samx77710/brrr

🌬️ Streamline your coding workflow with brrr, a tool that enhances...

14
Experimental
6177 GhostGuyyt/llm-cpp

Provide lightweight, zero-dependency C++17 single-header libraries to...

14
Experimental
6178 aakarsh31/qlora-llm-finetuning

QLoRA fine-tuning of Llama 3.2 3B on MedQA with full LoRA rank ablation...

14
Experimental
6179 ajheshbasnet/model-reincarnated

a collection of re-implementations of renowned artificial intelligence...

14
Experimental
6180 macespinoza/mini-transformer-didactico

Implementación didáctica de un Transformer Encoder–Decoder basada en...

14
Experimental
6181 mohitdixit02/webpage-chatbot

A Chrome extension chatbot that answers user queries based on the content of...

14
Experimental
6182 anastber/youtube-sentiment-analysis

AI-powered YouTube comment sentiment analyzer built with Streamlit. ...

14
Experimental
6183 cnygaard/FractalHTransformer

Fractal Hierarchical Transformer: multi-resolution causal attention patterns...

14
Experimental
6184 lxntung95/lyricloop-llm

LyricLoop bridges the gap between semantic LLM text and professional musical...

14
Experimental
6185 olafurjohannsson/models

Transformer Models

14
Experimental
6186 ahmad-albasha/Frankenstein-LLM-Model-fine-tuning-code

Fine-tuning Mistral-7B-v0.1 on Mary Shelley's Frankenstein using LoRA/QLoRA...

14
Experimental
6187 trivox-io/llmx

Unified Python interface for working with multiple LLM providers across...

14
Experimental
6188 VicNa559/code-offline

Run a local coding agent with llama.cpp for private, containerized...

14
Experimental
6189 dustalov/llmfao

Large Language Model Feedback Analysis and Optimization (LLMFAO)

14
Experimental
6190 CogComp/SRL-Spanish

Semantic Role Labeling (SRL) for Spanish using transformers

14
Experimental
6191 rawatshaurya/LORA-vs-QLORA

Reasoning-style fine-tuning of an instruction LLM using LoRA vs QLoRA,...

14
Experimental
6192 bassrehab/steering-vectors-agents

Runtime control of LLM agent behaviors through activation steering vectors....

14
Experimental
6193 Shreya831/AI-Medical-Symptom-Checker-SLM

AI-powered medical symptom checker using DistilBERT (Small Language Model)...

14
Experimental
6194 MukundaKatta/grammarprobe

GrammarProbe — Universal Grammar Detector. Test whether LLMs have...

14
Experimental
6195 luozichen/NeonBench

A systematic study of ultra-tiny language models

14
Experimental
6196 nachammai779/Large-Language-Models-from-Scratch

🧠 Building Large Language Models from the ground up — BPE tokenization,...

14
Experimental
6197 jinda-liu/R-LoRA

This repository contains the source code and related resources for R-LoRA.

14
Experimental
6198 arafat2020/cut_py

An R&D project to cut the best prat from a video using AI and ffmpeg.

14
Experimental
6199 Harsha-hue/visual-transformer-guide

I built a visual guide explaining how Transformers work. Tokenization...

14
Experimental
6200 dt-3t/TeGu

Official implementation of "Temporal Guidance for Large Language Models". A...

14
Experimental
« Prev 1 2 3 60 61 62 63 64 76 77 78 Next »