All Transformer Models

7,795 models ranked by quality score · Page 37 of 78

Showing 3601–3700 of 7,795
# Model Score Tier
3601 lwch/llama2.go

Port of Facebook's LLaMA 2 model in pure go and use little memory

26
Experimental
3602 kuiperzone/Marklet-AI

Open Source AI Model Client

26
Experimental
3603 SkillichSE/Lumi-bot

A Telegram bot powered by aiogram integrated with a local LLM (LM Studio)....

26
Experimental
3604 cvcio/rtaa-classifier

Comments & Twitter accounts gRPC classification service.

26
Experimental
3605 olliverc1985/AXIOM

Lightweight Rust ML framework for training and deploying small transformer...

25
Experimental
3606 Lumi-node/model-garage

Open the hood on neural networks. Component-level model surgery, analysis,...

25
Experimental
3607 Abhinand20/MathFormer

MathFormer - Solve math equations using NLP and transformers!

25
Experimental
3608 neural-processing-lab/MEG-XL

Code for "MEG-XL: Data-Efficient Brain-to-Text via Long-Context Pre-Training"

25
Experimental
3609 imsigma1/AI-Knowledge-Creativity

🧠 Power AI-driven tools for creative exploration and knowledge retrieval,...

25
Experimental
3610 xingbpshen/medical-calibration-fairness-mllm

[MICCAI 2025] The official implementation of the paper "Exposing and...

25
Experimental
3611 OpenDFM/HeadsUp

[ICML 2025] Codes for the paper "Heads up! Large Language Models Can Perform...

25
Experimental
3612 KrishnanJothi/MT5_Language_identification_NLP

MT5-small is fine-tuned on the downstream task of Natural Language...

25
Experimental
3613 theboringhumane/echoOLlama

🦙 echoOLlama: A real-time voice AI platform powered by local LLMs. Features...

25
Experimental
3614 bgreenwell/statlingua

Explain Statistical Output with Large Language Models

25
Experimental
3615 mahsaama/ViT3D-BrainTumorSegmentation

Segmentation of Brain Tumors using Vision Transformer

25
Experimental
3616 SreeEswaran/Train-your-LLM

This repository contains code and resources for training, fine-tuning, and...

25
Experimental
3617 marvelefe/vit-brain-tumor

Vision Transformer (ViT) model for brain tumour detection and classification

25
Experimental
3618 Kareem404/hyper-connections

A minimal implementation of Manifold-Constrained Hyper-Connections (mHC)...

25
Experimental
3619 LimDoHyeon/EEG-LLM

Fine-tuned LLM for electroencephalography(EEG) data classification

25
Experimental
3620 friendshipkim/overfill

Code for OverFill: Two-Stage Models for Efficient Language Model Decoding

25
Experimental
3621 mkagenius/llm-token-visualizer

See How Big Exactly A 128k Token Text Is

25
Experimental
3622 Curtis-Wu/Equivariant-Graph-Transformer

A deep neural network with hybrid architecture (EGNN + Transformer) for...

25
Experimental
3623 ys-zong/VLGuard

[ICML 2024] Safety Fine-Tuning at (Almost) No Cost: A Baseline for Vision...

25
Experimental
3624 luo-junyu/Awesome-Data-Efficient-LLM

A list of data-efficient and data-centric LLM (Large Language Model) papers....

25
Experimental
3625 gabriellst/paraphrase.ia

paraphrase.ia is a Chrome extension that let's you make paraphrases of a...

25
Experimental
3626 kyegomez/Open-Olmo

Unofficial open-source PyTorch implementation of the OLMo Hybrid...

25
Experimental
3627 Hashmat02/Fine-Tuning-LLaMA-2-for-Toxicity-Classification

Fine-tuning LLaMA 2 for toxicity classification using a balanced Kaggle...

25
Experimental
3628 nlkli/lachat

minimal CLI client for llama-server

25
Experimental
3629 elinx/safe-view

A terminal-based application for visualizing and analyzing safetensors files.

25
Experimental
3630 itsShnik/adaptively-finetuning-transformers

Adaptively fine tuning transformer based models for multiple domains and...

25
Experimental
3631 machinelearningzuu/experiments-on-large-language-models

This Repository Contains Different Experiments on LLMs with Hugging Face,...

25
Experimental
3632 shreydan/masked-language-modeling

Transformers Pre-Training with MLM objective — implemented encoder-only...

25
Experimental
3633 o-messai/fastVLM

An implementation of FastVLM/LLaVA or any llm/vlm model using FastAPI...

25
Experimental
3634 yejoon-lee/kr3

KR3: Korean Restaurant Review with Ratings / Experiments on...

25
Experimental
3635 subhasisj/llm-product-insights

Extracting Product Insights from Unstructured text data using LLMs with LangChain

25
Experimental
3636 zhiyuanhubj/LongRecipe

LongRecipe: Recipe for Efficient Long Context Generalization in Large Language Models

25
Experimental
3637 s4um1l/aya-cross-lingual-probe

Mechanistic interpretability of cross-lingual concept representations in...

25
Experimental
3638 ambideXtrous9/Finetune-Qwen3-using-Unsloth

Finetune Qwen3 using Unsloth : Reasoning and Non-Reasoning Dataset

25
Experimental
3639 rafaelvp-db/langchain-sql-databricks

Simple examples of using LLMs and Langchain on Databricks,

25
Experimental
3640 caesarnine/llm-experiments

Playing around with LLMs

25
Experimental
3641 sastpg/RFTT

RFTT: Reasoning with Reinforced Functional Token Tuning

25
Experimental
3642 Pomilon/LEMA

LEMA (Layer-wise Efficient Memory Abstraction): A hardware-aware framework...

25
Experimental
3643 nercone-dev/zeta-llm-dataset

Public Datasets for Zeta-Tool

25
Experimental
3644 Mr-TalhaIlyas/segformer

PyTorch Implementation of SegFormer: Simple and Efficient Design for...

25
Experimental
3645 taishan1994/qlora-chinese-LLM

使用qlora对中文大语言模型进行微调,包含ChatGLM、Chinese-LLaMA-Alpaca、BELLE

25
Experimental
3646 dilbersha/llm-inference-benchmarking-3080

A production-grade telemetry-aware suite for benchmarking LLM inference...

25
Experimental
3647 joshxfi/bumblebee

🐝 Run on-device models directly from your browser via Transformers.

25
Experimental
3648 JIA-Lab-research/Q-LLM

This is the official repo of "QuickLLaMA: Query-aware Inference Acceleration...

25
Experimental
3649 exitudio/GaitMixer

Official repository for "GaitMixer: Skeleton-based Gait Representation...

25
Experimental
3650 Brazilian-willametteriver232/llama.swift

🚀 Access llama.cpp easily in your Swift projects, leveraging precompiled...

25
Experimental
3651 fshnkarimi/train_scheduling_assistant

This project utilizes a fine-tuned Large Language Model (LLM) to generate...

25
Experimental
3652 aditeyabaral/maple

Implementation of the paper, MAPLE - MAsking words to generate blackout...

25
Experimental
3653 inuwamobarak/nougat

Nougat is a Meta AI's revolutionary OCR model designed to transcribe...

25
Experimental
3654 Bhargav1144/Mental_Health_Chatbot

A Streamlit-based AI chatbot offering compassionate mental health support...

25
Experimental
3655 mytechnotalent/MicroGPT

MicroGPT is a clean, educational implementation of the GPT (Generative...

25
Experimental
3656 SharathHebbar/ML-Project-list

List of all ML projects

25
Experimental
3657 Adora-Foundation/llm-energy-lab

Web application for benchmarking and comparing LLM behaviour, energy and...

25
Experimental
3658 cja5553/LLMs_in_perioperative_care

Codes for: Alba, C., Xue, B., Abraham, J. et al. The foundational...

25
Experimental
3659 mahadi-nahid/TabSQLify

[NAACL 2024] TabSQLify: Enhancing Reasoning Capabilities of LLMs Through...

25
Experimental
3660 ai-center-kth/cuBERT-source-code-clustering

Fine-tuning cuBERT embeddings for clustering source code by functionality

25
Experimental
3661 datvodinh/serve-llm

Serve high throughput and scalable LLM using Ray and vLLM

25
Experimental
3662 hpfield/Text2Touch

CoRL 2025 - Tactile In-Hand Manipulation with LLM-Designed Reward Functions

25
Experimental
3663 TristanLecourtois/NL2SQL

Text2SQL project comparing different LLM models

25
Experimental
3664 IsaacRodgz/Multimodal-Adapters

Adapter modules with support for multimodal fusion of information (text,...

25
Experimental
3665 DaemonLoki/MyAppleIntelligence

Custom implementation of Apple Intelligence features

25
Experimental
3666 DRSY/EasyKV

Easy control for Key-Value Constrained Generative LLM...

25
Experimental
3667 FredyRivera-dev/Flux2-from-scratch

This repo proposes to implement the Flux2 model from scratch

25
Experimental
3668 dwain-barnes/LLM-GGUF-Auto-Converter

Automated Jupyter notebook solution for batch converting Large Language...

25
Experimental
3669 Ahwar/NER-NLP-with-ONNX-Java

A Java NLP application that identifies names, organizations, and locations...

25
Experimental
3670 gesis24csspy/analyzing-text-data

Course materials on computational text analysis. John McLevey. 2024....

25
Experimental
3671 talmago/spacy_coref

Lightweight cross-lingual coreference resolution with spaCy using ONNX...

25
Experimental
3672 theonesud/embedia

Create LLM-powered webapps with ease

25
Experimental
3673 UCSC-VLAA/vllm-safety-benchmark

[ECCV 2024] Official PyTorch Implementation of "How Many Unicorns Are in...

25
Experimental
3674 IbrahimSobh/askdoc

In this tutorial we will see 💡 How to get answers from documents using...

25
Experimental
3675 Yuan-ManX/infera

Infera — A High-Performance Inference Engine for Large Language Models.

25
Experimental
3676 zerob13/modelinfo-cli

A CLI to query AI model capabilities, context limits, and pricing from...

25
Experimental
3677 Omid-Nejati/Locality-iN-Locality

Robust Transformer with Locality Inductive Bias and Feature Normalization...

25
Experimental
3678 Arunkumar2510/LLM-Interview-Questions-and-Answers-Hub

🧠 Discover and prepare with 100+ LLM interview questions and answers to...

25
Experimental
3679 LennartKeller/roberta2longformer

Convert pretrained RoBerta models to various long-document transformer models

25
Experimental
3680 x-zheng16/CALM

[AAAI 25] CALM: Curiosity-Driven Auditing for LLMs

25
Experimental
3681 BenChaliah/Superposition-Transformer

a novel architecture that leverages Autoencoders to superimpose the hidden...

25
Experimental
3682 len-sla/NLP_mBART_mT5_translation

Polyglot Power: mBART & mT5 Translation Toolkit ...

25
Experimental
3683 rafaelvp-db/db-ancient-code-translation

Simple repo showing code-to-code and code-to-text capabilities using LLMs on...

25
Experimental
3684 designer-coderajay/logit-lens-explorer

Mechanistic interpretability tool visualizing GPT-2's layer-by-layer...

25
Experimental
3685 HenryNdubuaku/super-lazy-autograd

Hand-derived memory-efficient VJPs for tuning LLMs on laptops.

25
Experimental
3686 scienceetonnante/eiffel-tower-llama

Let's try to reproduce the Golden Gate Claude demo, but using open-source...

25
Experimental
3687 chrisliu298/llm-unlearn-eco

[NeurIPS 2024] Large Language Model Unlearning via Embedding-Corrupted Prompts

25
Experimental
3688 gs-ai/mlm-memory

A functionally operational, mathematically unhinged system for achieving 10×...

25
Experimental
3689 Jiacheng-Zhu-AIML/AsymmetryLoRA

Preprint: Asymmetry in Low-Rank Adapters of Foundation Models

25
Experimental
3690 riccardodm97/QA-QG

Question Answering and Question Generation NLP tasks on the SQuAD v1.1 dataset

25
Experimental
3691 Mattbusel/llm-wasm

LLM inference primitives for WebAssembly — cache, retry, routing, guards,...

25
Experimental
3692 Kovelja009/handwriting-recognition

Benchmark of different network architectures for handwritten text recognition.

25
Experimental
3693 tetratensor/Stock-Market-News-Sentiment-Analysis

A Python-based news sentiment analysis using Hugging Face Sentiment Analysis...

25
Experimental
3694 HelpingAI/inferno

Run Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1, and other...

25
Experimental
3695 AbdBarho/transformers-stack

A full stack solution for deploying a transformers model from HuggingFace

25
Experimental
3696 yashjakhotiya/Adversarial-Attacks-On-Transformers

Exploring vulnerabilities of Transformers-based Malware Detectors to...

25
Experimental
3697 AdamCoscia/iScore

Upload, score, and visually compare multiple LLM-graded summaries simultaneously!

25
Experimental
3698 Aaronhuang-778/SliM-LLM

[ICML 2025] SliM-LLM: Salience-Driven Mixed-Precision Quantization for Large...

25
Experimental
3699 gokul-pv/PanopticSegmentation

Panoptic segmentation on custom construction objects using DETR

25
Experimental
3700 Andras7/gpt2-pytorch

Extremely simple and understandable GPT2 implementation with minor tweaks

25
Experimental
« Prev 1 2 3 35 36 37 38 39 76 77 78 Next »