All Transformer Models

7,795 models ranked by quality score · Page 39 of 78

Showing 3801–3900 of 7,795
# Model Score Tier
3801 KevinBian107/MOSAIC

Motif-preserving Graph Tokenization for Biological Structure Generation...

24
Experimental
3802 jaslatendresse/llm-demo

This repository demonstrates how to do inference using llama.cpp on a...

24
Experimental
3803 DigitalHarborFoundation/FlexEval

FlexEval is an LLM evaluation tool designed for practical quantitative analysis.

24
Experimental
3804 dkopi/Bitune

Implementation of Bitune: Bidirectional Instruction-Tuning

24
Experimental
3805 SupratikB23/HarmonyRL

Deep Learning framework for generating symbolic music (MIDI) using...

24
Experimental
3806 sambitbhaumik/siamese-nn-sts

Project files contain PyTorch implementations for Siamese BiLSTM models for...

24
Experimental
3807 Uranarc/Disentanglement

Comparative NLP study: BERTopic vs. Llama 3 for conversation...

24
Experimental
3808 Josephrp/SmolFactory

finetune gpt-oss and smollm3 on your data easily and cheaply

24
Experimental
3809 hppRC/simple-simcse-ja

Exploring Japanese SimCSE

24
Experimental
3810 ToluClassics/LowResourceOCR

This work is an adaptation of CNN+Transformer architecture to training text...

24
Experimental
3811 abgache/NanoGPL

Small test generative pre-trained LAM (Linear Attention Mechanism).

24
Experimental
3812 longday1102/VietAI-experiment-LLaMA2

⚡ LLaMA-2 model experiment

24
Experimental
3813 Blinorot/ALARM

Official Implementation of "ALARM: Audio–Language Alignment for Reasoning Models"

24
Experimental
3814 SIC98/GPT2-python-code-generator

GPT2 finetuning with transformers 🤗

24
Experimental
3815 strickvl/isafpr_finetune

Finetuning an LLM for structured data extraction from press releases

24
Experimental
3816 ia-labo/French-News-Clustering

Text classification and clustering using transformers and Denstream.

24
Experimental
3817 saloni-1919/biosum-reliable

AI-powered biomedical text summarization using extractive NLP, biomedical...

24
Experimental
3818 LennartKeller/DeepTextClustering

Deep text clustering with language models

24
Experimental
3819 Hi-archers/MLaKE

COLING 2025: MLaKE: Multilingual Knowledge Editing Benchmark for Large...

24
Experimental
3820 icon-lab/TranSMS

Official Implementation of Transformers for System Matrix Super-resolution (TranSMS)

24
Experimental
3821 madara88645/VibeGraph

Turn any Python codebase into an interactive call graph with AI-powered...

24
Experimental
3822 AmbiTyga/Automated-Medical-Assistance

Paper: https://openreview.net/forum?id=jYV4ZXy0L5

24
Experimental
3823 thinkwee/NOVER

[EMNLP-2025] R1-Zero on ANY TASK

24
Experimental
3824 AlirezaSalehy/Tipsomaly

This is an extended version of the paper “TIPS Over Tricks: Simple Prompts...

24
Experimental
3825 Human-Centric-Machine-Learning/counterfactual-llms

Code for "Counterfactual Token Generation in Large Language Models", Arxiv 2024.

24
Experimental
3826 aastroza/llm-teaching

Teaching materials on Large Language Models (LLMs)

24
Experimental
3827 zhengyima/knowqa

预训练模型知识量度量竞赛 Baseline F1 0.35 BERTForMaskedLM

24
Experimental
3828 cvedix/omnisdk

On-device AI deloper platform

24
Experimental
3829 pszemraj/decoder-pytorch-template

Hackable PyTorch template for decoder-only transformer architecture...

24
Experimental
3830 TarekkMU1911/AI-Agent-Diabetes-Diagnosis

This project builds an AI-powered agent to support diabetes patients using...

24
Experimental
3831 artpli/CodeIE

[ACL 23] CodeIE: Large Code Generation Models are Better Few-Shot...

24
Experimental
3832 krishnaplwl/Homework_Solver_LLM

A fine-tuned LLM to solve homework questions ranging from maths to science...

24
Experimental
3833 ArpitKadam/Attention-Is-All-You-Code

From Attention Mechanisms to Large Language Models — built from scratch.

24
Experimental
3834 NikolaOgnjenovic/WebWise

Full stack web app which lets users upload & browse videos in order to...

24
Experimental
3835 Talnz007/VulkanIlm

GPU-accelerated LLaMA inference wrapper for legacy Vulkan-capable systems a...

24
Experimental
3836 HKUNLP/multilingual-transfer

Code for paper ”Language Versatilists vs. Specialists: An Empirical...

24
Experimental
3837 tensorchord/inference-benchmark

Benchmark for machine learning model online serving (LLM, embedding,...

24
Experimental
3838 tatwan/mastering_llm_deployments

This is based on my comprehensive course on deploying Large Language Models...

24
Experimental
3839 nnilayy/BioCore

A comprehensive bioinformatics platform/suite for molecular biology research...

24
Experimental
3840 asokraju/LangChainDatasetForge

Generating artificial datasets using langchain and finetuning the LLMs on...

24
Experimental
3841 loryanstrant/ha-transformers-theme

A Transformers theme for Home Assistant

24
Experimental
3842 abhayra12/StudentLife-Phenotyping

End-to-end behavioral prediction system using digital phenotyping. PyTorch...

24
Experimental
3843 Yash-Kavaiya/30-Days-LLM-Mastery-Course

30-Days-LLM-Mastery-Course: A comprehensive, hands-on course diving deep...

24
Experimental
3844 schmijul/TransformerForSignalPredicition

This is a private learning Project to play around with Transformers

24
Experimental
3845 avijit-jana/huggingface-nlp-image-tool

An end‑to‑end application leveraging Hugging Face pretrained models for...

24
Experimental
3846 ovshake/rat

Reverse Attention Tracer: A lightweight API to visualize which words...

24
Experimental
3847 mtkaya/transformer-edge-optimization

Optimize Transformer models for edge devices

24
Experimental
3848 BLCK-B/Moerkepub

Local EPUB translation using multilingual Transformer models on GPU.

24
Experimental
3849 sergio11/llm_finetuning_and_evaluation

The LLM FineTuning and Evaluation project 🚀 enhances FLAN-T5 models for...

24
Experimental
3850 ayaka14732/TrAVis

TrAVis: Visualise BERT attention in your browser

24
Experimental
3851 mosh98/MMBT

Multi modal BiTransformer [ Reimplementation ] in Pytorch That Acutally Works !

24
Experimental
3852 madeburo/GEO-AI-Shopify

AI Search Optimization for Shopify. Generate llms.txt, AI crawler rules and...

24
Experimental
3853 tianzhaotju/LEAM

We propose a novel DL-based mutation technique (LEAM), which adapts the...

24
Experimental
3854 wondergo2017/LLM4DyG

Implementation codes for KDD24 paper "LLM4DyG: Can Large Language Models...

24
Experimental
3855 Eric2i/LLM-MindMap

EMNLP 2025 - "Mapping the Minds of LLMs: A Graph-Based Analysis of Reasoning...

24
Experimental
3856 InternRobotics/Grounded_3D-LLM

Code&Data for Grounded 3D-LLM with Referent Tokens

24
Experimental
3857 DresOperatingSystems/Dresguardian

Privacy First Telegram Group Management Bot with Built in AI and DuckDuckGo...

24
Experimental
3858 inuwamobarak/Meta-Llama-3-8B

Experiments with the Meta-Llama-3-8B

24
Experimental
3859 byroneverson/Mia

A simple swift app for MacOS/iOS to test large language models (LLM)

24
Experimental
3860 Fromsko/neural_friend_kit

用微信聊天记录训练神经网络,复刻朋友的说话风格

24
Experimental
3861 lakshyaag/Deep-Learning-From-Scratch

Implementing popular deep learning papers in PyTorch

24
Experimental
3862 shreydan/scratchformers

building various transformer model architectures and its modules from scratch.

24
Experimental
3863 timvvvht/HKEX-Announcement-Classifier

A project on data exploration, analysis and using a neural network to...

24
Experimental
3864 hsj576/GTO

Official Implementation of "Bridging Draft Policy Misalignment: Group Tree...

24
Experimental
3865 TheDarkchip/nfp

Lean 4 library + CLI for rigorous bounds in transformer computations...

24
Experimental
3866 mirzayasirabdullahbaig07/Fine-Tuning-LLaMA-3.2-3B-Using-PEFT-LoRA

This project showcases parameter-efficient fine-tuning of the LLaMA 3.2 (3B)...

24
Experimental
3867 mhajder/llama.cpp-updater

A shell script to automatically update or build llama.cpp with optimal GPU...

24
Experimental
3868 MingSun-Tse/Awesome-Efficient-ViT

Recent Advances on Efficient Vision Transformers

24
Experimental
3869 saichandrapandraju/TabQGen

This repository hosts the code for the paper "Answer-Aware Question...

24
Experimental
3870 Ebimsv/LLM-Lab

Pretraining and Finetuning Language Model

24
Experimental
3871 amanongithub7/classical-music-generation

Comparing LSTM and Transformer-based deep learning approaches for classical...

24
Experimental
3872 bassrehab/credit_risk

Forecast long sequence default/downgrade of corporate entities and financial...

24
Experimental
3873 afspies/attention-tutorial

Jupyter Notebook tutorial on Attention Mechanisms, Position Embeddings and...

24
Experimental
3874 danelpeng/Awesome-Continual-Leaning-with-PTMs

This is a curated list of "Continual Learning with Pretrained Models" research.

24
Experimental
3875 Devnetly/image-captioning

Image captioning model & application based on transformers.

24
Experimental
3876 a1exus/koda

Local LLM orchestration — run GGUF models via llama.cpp with one command

24
Experimental
3877 MouxiaoHuang/PPE

[ICLR 2026] Official code of PPE: Positional Preservation Embedding for...

24
Experimental
3878 yuval6957/SIIM-Transformer

Yuval and nosound models and write-up for Kaggle's competition "SIIM-ISIC...

24
Experimental
3879 ExposedCat/tg-local-llm

Run local LLMs powered up by tools in Telegram Messenger

24
Experimental
3880 pablo-reyes8/implementing-gpt

Clean-room GPT-2/GPT-3 implementation: tokenizers, architecture blocks,...

24
Experimental
3881 Stoksweet/modlable

A platform for building, training and running inference on TensorflowJS...

24
Experimental
3882 DoctorLai/SimilarString

Compute the score of similarity between two strings

24
Experimental
3883 JexanJoel/VoiceIQ-Backend

AI engine for VoiceIQ - transcribes Hinglish & Tanglish call recordings via...

24
Experimental
3884 RitoCryo/DeepRWKV-Reasoning

🔍 Enhance reasoning in Large Language Models with DeepRWKV-Reasoning, using...

24
Experimental
3885 Andrew2077/Alpaca

Simple Q/A app, where i created a UI for alpaca (fine tuned LLAMA) model...

24
Experimental
3886 Technolog796/image_captioning

Создание русскоязычной модели для image captioning

24
Experimental
3887 RUCKBReasoning/CodeRM

Official code implementation for the ACL 2025 paper: 'Dynamic Scaling of...

24
Experimental
3888 farukalamai/background-removal-birefnet

Background Removal Application using BiRefNet

24
Experimental
3889 yubainu/sibainu-engine

Real-time hallucination detection for LLMs via Geometric Drift Analysis in...

24
Experimental
3890 showlab/VisInContext

Official implementation of Leveraging Visual Tokens for Extended Text...

24
Experimental
3891 GoWtEm/llm-model-selector

A high-performance Rust utility that analyzes your system hardware to...

24
Experimental
3892 AI-14/pkatransnet

[IVC 2025] [Official code] - Enhancing radiology report generation: A prior...

24
Experimental
3893 shreyansh26/LLM-Sampling

A collection of various LLM sampling methods implemented in pure Pytorch

24
Experimental
3894 NachoPeinador/FRUGAL_AI_CHIP

FrugalAI Chip: Modular silicon architecture for disposable AI. Achieves...

24
Experimental
3895 Assaoka/Guide-to-Advanced-LLM-Techniques

Este repositório é um tutorial completo e prático que explora metodologias...

24
Experimental
3896 harshpimpale/LegalMind

A project that uses Large Language Models (LLMs) to assist users with legal...

24
Experimental
3897 byramsubramanian/yt-video-summarizer

Video Summarization Experiments with Open LLMs

23
Experimental
3898 MiuLab/InstUPR

Source code of our paper "InstUPR: Instruction-based Unsupervised Passage...

23
Experimental
3899 IonutIga/LLMs-for-KGC

Repository for experiments regarding the assessment of the suitability of...

23
Experimental
3900 sayhitosandy/Mamba_SSM

Mamba: Linear-Time Sequence Modeling with Selective State Spaces

23
Experimental
« Prev 1 2 3 37 38 39 40 41 76 77 78 Next »