All Transformer Models

7,795 models ranked by quality score · Page 33 of 78

Showing 3201–3300 of 7,795
# Model Score Tier
3201 Loguru-AI/Loguru-CLI

An interactive commandline interface that brings intelligence to your logs.

28
Experimental
3202 HamidrezaGholamrezaei/LLM-Text-Classification-with-RoBERTa

A project demonstrating the use of Large Language Models (LLMs) for text...

28
Experimental
3203 rasyosef/amharic-news-category-classification

notebooks to finetune `bert-small-amharic`, `bert-mini-amharic`, and...

28
Experimental
3204 HemantBK/LLaMA-Sum-Fine-Tuning

Fine-tuned Meta's LLaMA 3.2 1B for text summarization using QLoRA (4-bit...

28
Experimental
3205 sayannath/ViT-TF-Hub-Application

Build and fine-tune your Image Classifier using a Vision Transformer Model...

28
Experimental
3206 dimiz51/DETR-Factory-PyTorch

This project is an implementation of the Detection Transformer (DETR) and...

28
Experimental
3207 FuxiaoLiu/VisualNews-Repository

[EMNLP'21] Visual News: Benchmark and Challenges in News Image Captioning

28
Experimental
3208 aaronbriel/absum

Abstractive Summarization for Data Augmentation

28
Experimental
3209 iovdin/tune-models

LLM models for tune, from openai, anthropic, openrouter, groq, ollama, mistral

28
Experimental
3210 Riko0/messenger_logger_callback

messenger-logger-callback — Send ML training logs to Telegram. Standalone...

28
Experimental
3211 LegendLeoChen/llm-finetune

使用trl、peft、transformers等库,实现对huggingface上模型的微调。

28
Experimental
3212 alxfgh/Large-Language-Models-in-Chemistry

Working collection of papers, repos and models of transformer based language...

28
Experimental
3213 EdgeTypE/FolderToLLM

A PowerShell tool for Windows that recursively scans a directory, captures...

28
Experimental
3214 kenzic/run-models-in-the-browser-with-transformers.js-demo

Working demo for the article Run Models in the Browser With Transformers.js

28
Experimental
3215 Furyton/awesome-language-model-analysis

This paper list focuses on the theoretical and empirical analysis of...

28
Experimental
3216 KimJaehee0725/YoYAK

[제 13회 투빅스 컨퍼런스] YoYAK - Yes or Yes, Attention with gap-sentence for Korean...

28
Experimental
3217 MalihehIzadi/catiss

CatIss is an intelligent tool for automatic categorization of issue reports...

28
Experimental
3218 cpuheater/cause-life-is-a-game

Solving games with reinforcement learning

28
Experimental
3219 ranpy13/Learning-LLM

Learning to build LLM from scratch, following rasbt/LLMs-from-scratch footsteps.

28
Experimental
3220 tanishqgautam/SETI-Breakthrough-Listen

Solution to the SETI Breakthrough Listen Competition hosted on Kaggle

28
Experimental
3221 fuglede/llama.ttf

A font for writing tiny stories

28
Experimental
3222 KillerShoaib/RLM-From-Scratch

Implementation of Recursive Language Model paper from scratch

28
Experimental
3223 IsaacRodgz/multimodal-transformers-movies

Experiments with multimodal deep learning models based on transformers

28
Experimental
3224 sayakpaul/vision-transformers-tf

A non-exhaustive collection of vision transformer models implemented in TensorFlow.

28
Experimental
3225 InternLM/Spark

An official implementation of "SPARK: Synergistic Policy And Reward...

28
Experimental
3226 Shekswess/tiny-think

Reasoning-first post-training for tiny language models (140M) on a single GPU.

28
Experimental
3227 dkurt/optimum-openvino

Intel OpenVINO extension for Hugging Face Transformers

28
Experimental
3228 rhubarbwu/linguistic-collapse

Codebase for Linguistic Collapse: Neural Collapse in (Large) Language Models...

28
Experimental
3229 Kevo-03/AttentionNet

AttentionNet: Encrypted Network Traffic Classification Solution with...

28
Experimental
3230 grouzen/ollana

Ollama over LAN - Auto-discover your Ollama server on your local network...

28
Experimental
3231 mintaywon/IF_RLHF

Source code for 'Understanding impacts of human feedback via influence functions'

28
Experimental
3232 kaylode/vqa-transformer

Visual Question Answering using Transformer and Bottom-Up attention....

28
Experimental
3233 knoveleng/steering

Official repo for the paper: "Selective Steering: Norm-Preserving Control...

28
Experimental
3234 Beomi/easy-lm-trainer

🤗 최소한의 세팅으로 LM을 학습하기 위한 샘플코드

28
Experimental
3235 nullHawk/simple-transformer

Implementation of Transformer model in PyTorch

28
Experimental
3236 sober-clever/ReRe

The implementations of paper "Reinforced Preference Optimization for...

28
Experimental
3237 Hexastack/hexabot-template-starter

Hexabot Project Starter Template, fork this project to create you own...

28
Experimental
3238 rozek/node-red-flow-llama

Node-RED Flow (and web page example) for the LLaMA AI model

28
Experimental
3239 YanSte/NLP-LLM-Fine-tuning-Llame-2-QLoRA-2024

Natural Language Processing (NLP) and Large Language Models (LLM) with...

28
Experimental
3240 line/sacpo

[NeurIPS 2024] SACPO (Stepwise Alignment for Constrained Policy Optimization)

28
Experimental
3241 rahul13ramesh/compositional_capabilities

Compositional Capabilities of Autoregressive Transformers: A Study on...

28
Experimental
3242 subhasisj/FastAPI-Streamlit-Docker-NLP

Text Classification model deployment using FastAPI, Streamlit and Docker Compose

28
Experimental
3243 namuan/snap-assist

Summon intelligence in a snap

28
Experimental
3244 OSU-NLP-Group/QA4RE

[ACL'23 Findings] "Aligning Instruction Tasks Unlocks Large Language Models...

28
Experimental
3245 kaistAI/LangBridge

[ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision

28
Experimental
3246 fattorib/transformer_shmap

Tensor Parallelism with JAX + Shard Map

28
Experimental
3247 GraphPKU/CoI

Chain of Images for Intuitively Reasoning

28
Experimental
3248 eilamc14/Simplify-This

Comparative Analysis of Prompt-Based and Fine-Tuned LLMs

28
Experimental
3249 yukyunglee/transformers-resources

huggingface transformers tutorial, code, resources

28
Experimental
3250 Qwen-Applications/STAR

STAR: Similarity-guided Teacher-Assisted Refinement for Super-Tiny Function...

28
Experimental
3251 raghavbali/text_generation

Notebooks to better understand text generation

28
Experimental
3252 matt-k-wong/mlx-flash

Lightning-fast MLX utilities and optimizations for Apple Silicon

28
Experimental
3253 yucc2018/share

一些代码实践分享。

28
Experimental
3254 robjsliwa/mlx-sd-single-file-models

Single safetensors file support Apple MLX Stable Diffusion

28
Experimental
3255 LEL-A/doc

Overarching documentation and planning to build so-called...

28
Experimental
3256 vbario/sleeping-llm

A language model that forms persistent memories from conversation and...

28
Experimental
3257 SJTU-IPADS/Bamboo

Bamboo-7B Large Language Model

28
Experimental
3258 Cre4T3Tiv3/unsloth-llama3-alpaca-lora

Advanced 4-bit QLoRA fine-tuning pipeline for LLaMA 3 8B with...

28
Experimental
3259 cbacary/MoDeGPT

An implementation of the MoDeGPT LLM compression from the ICLR 2025...

28
Experimental
3260 jhuang265/Calibrating-LLMs-with-Label-Smoothing

Code to our ICML 2025 Paper "Calibrated Language Models and How to Find Them...

28
Experimental
3261 vbercy/g2tm-segmenter

Graph-Guided Token Merging (G2TM) is a lightweight one-shot module designed...

28
Experimental
3262 YukinoshitaKaren/Reason-KE

[EMNLP 2025 Findings] Robust Knowledge Editing via Explicit Reasoning Chains...

28
Experimental
3263 nerdimite/bert-finetuning-webinar

Code for the FullStack AI Live Coding Series- Part 1 (CellStrat AI Lab)

28
Experimental
3264 liziniu/GEM

Code for Paper (Preserving Diversity in Supervised Fine-tuning of Large...

28
Experimental
3265 technion-cs-nlp/BiologicalTokenizers

Effect of tokenization on transformers for biological sequence

28
Experimental
3266 sanskar9999/CodeEvolveLLM

A framework for using local LLMs (Qwen2.5-coder 7B) that are fine-tuned...

28
Experimental
3267 robinzixuan/FROST

[ICLR 2026] FROST: Filtering Reasoning Outliers with Attention for Efficient...

28
Experimental
3268 frncscp/patacognition

Legacy repo for the Artificial Intelligence capable of patacón recognition...

28
Experimental
3269 lix19937/llm-deploy

AI Infra LLM infer/ tensorrt-llm/ vllm

28
Experimental
3270 llaraspata/HallucinationDetection

Analyzing the correlation between Hallucinations and Knowledge Conflicts in...

28
Experimental
3271 Frozen-Projects/AI_Cactus

Cactus AI framework plugin for UE5 to run local LLMs at runtime....

28
Experimental
3272 LightDopper/skill-codex

🚀 Enable automated code analysis and editing with Claude Code using Codex...

28
Experimental
3273 EMalagoli92/CvT-TensorFlow

TensorFlow 2.X reimplementation of CvT: Introducing Convolutions to Vision...

28
Experimental
3274 Beomi/transformers-language-modeling

Train 🤗transformers with DeepSpeed: ZeRO-2, ZeRO-3

28
Experimental
3275 YuanheZ/LoRA-One

LoRA-One: One-Step Full Gradient Could Suffice for Fine-Tuning Large ...

28
Experimental
3276 tsinghua-fib-lab/AAAI2025_MIA-Tuner

[AAAI'25 Oral] "MIA-Tuner: Adapting Large Language Models as Pre-training...

28
Experimental
3277 mts-ai/OpenAutoNLU

An open-source pipeline for training natural language understanding models

28
Experimental
3278 ArneBinder/pytorch-ie-hydra-template-1

PyTorch-IE Hydra Template

28
Experimental
3279 BenChaliah/NVFP4-on-4090-vLLM

AdaLLM is an NVFP4-first inference runtime for Ada Lovelace (RTX 4090) with...

28
Experimental
3280 JIA-Lab-research/Step-DPO

Implementation for "Step-DPO: Step-wise Preference Optimization for...

28
Experimental
3281 proycon/deepfrog

An NLP-suite powered by deep learning

28
Experimental
3282 PKU-Alignment/aligner

[NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct

28
Experimental
3283 julienokumu/Solving-ML-Papers

Attempts at solving machine learning papers(9/_)✅

28
Experimental
3284 atapour/rank-over-class

Source code for the training pipeline of the text ranking model used in the...

28
Experimental
3285 TamSiuhin/OPPU

Official Implementation of "Democratizing Large Language Models via...

28
Experimental
3286 dnbaker/bioseq

Tokenizers and Machine Learning Models for biological sequence data

28
Experimental
3287 osiriszjq/impulse_init

Convolutional Initialization for Data-Efficient Vision Transformers

28
Experimental
3288 wjn1996/HugNLP

HugNLP is a unified and comprehensive NLP library based on HuggingFace...

28
Experimental
3289 Yusuf80216/QnATables-An-Intelligent-Question-Answering-System

Question Answering System to answer question over tables in a document

28
Experimental
3290 bhavsarpratik/transformers

Implementations of transformer models in pytorch

28
Experimental
3291 kyegomez/Multi-Model-Training

An experimental repository on research for training multiple models all at...

28
Experimental
3292 LehengTHU/AlphaRec

[ICLR 2025 Oral 🏆] The implementation of paper "Language Representations Can...

28
Experimental
3293 ArenRedd/AI-Chatbot-using-LLaMA2

Uncensored AI: An open-source, unrestricted LLaMA 2 model for free and raw...

28
Experimental
3294 Armaggheddon/ClipServe

🚀 ClipServe: A fast API server for embedding text, images, and performing...

28
Experimental
3295 arifulislamat/local-voice-cloning-app

Powered by ChatterboxTTS | Transformer | Llama | Gradio

28
Experimental
3296 TobyYang7/Llava_Qwen2

Visual Instruction Tuning for Qwen2 Base Model

28
Experimental
3297 unipr-org/AI

AI - Intelligenza Artificiale presso l'Università degli Studi di Parma (6 CFU).

28
Experimental
3298 aakinlalu/GenerativeAI

Series of generative artificial intelligence (AI) for creating new content,...

28
Experimental
3299 yoniLc/GeometricTransformerMolecule

Transformer for End to End Molecule Property Prediction

28
Experimental
3300 czg1225/VeriThinker

[NeurIPS 2025] VeriThinker: Learning to Verify Makes Reasoning Model Efficient

28
Experimental
« Prev 1 2 3 31 32 33 34 35 76 77 78 Next »