All Transformer Models

7,795 models ranked by quality score · Page 19 of 78

Showing 1801–1900 of 7,795
# Model Score Tier
1801 niclasgriesshaber/llm_patent_pipeline

LLMs for Historical Dataset Construction from Archival Image Scans

36
Emerging
1802 fmueller/scribae

CLI to turn Markdown notes into SEO briefs, drafts, metadata, and...

36
Emerging
1803 nanxiang11/CodeLab_LLM

🌟 从LLaMA2开启大语言模型原理与实践教程

36
Emerging
1804 ykjaat6104/LLM-Cost-and-Token-Efficiency-Analysis

A benchmark study analyzing cost and token efficiency across 14 LLMs from 5...

36
Emerging
1805 clint-kristopher-morris/llm-guided-evolution

LLM Guided Evolution - The Automation of Models Advancing Models

36
Emerging
1806 jackaduma/ChatGLM-LoRA-RLHF-PyTorch

A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer...

36
Emerging
1807 MaxiDonkey/DelphiGroqCloud

The GroqCloud API wrapper for Delphi provides access to models from Meta,...

36
Emerging
1808 jagilley/fact-checker

Fact-checking LLM outputs with self-ask

36
Emerging
1809 OSU-STARLAB/Simul-LLM

[ACL 2024] An easily extensible framework for simultaneous, text-to-text...

36
Emerging
1810 PaddlePaddle/PALM

a Fast, Flexible, Extensible and Easy-to-use NLP Large-scale Pretraining and...

36
Emerging
1811 assembly-automation-hub/repo-governance

⚙️ Reusable GitHub repository governance kit: CI/CD workflows, CodeQL SAST,...

36
Emerging
1812 zrr1999/emotion-recognition

多模态情绪识别方法研究(Multimodal Emotion Recognition)

36
Emerging
1813 JRC1995/BERT-Disaster-Classification-Capsule-Routing

Exploration of BERT-BiLSTM models with Layer Aggregation (attention-based...

36
Emerging
1814 ariya/query-llm

Query LLM with Chain-of-Tought

36
Emerging
1815 SimeonHristov99/DL_25-26

Practice sessions for the course "Introduction to deep learning" in the...

36
Emerging
1816 martin-wey/peft-llm-code

Replication package of the paper "Exploring Parameter-Efficient Fine-Tuning...

36
Emerging
1817 jaketae/alibi

PyTorch implementation of Train Short, Test Long: Attention with Linear...

36
Emerging
1818 ziplab/HVT

[ICCV 2021] Official implementation of "Scalable Vision Transformers with...

36
Emerging
1819 luciusssss/ZhuangBench

[ACL'24 Findings] Teaching Large Language Models an Unseen Language on the Fly

36
Emerging
1820 antonyvigouret/Pay-Attention-to-MLPs

My implementation of the gMLP model from the paper "Pay Attention to MLPs".

36
Emerging
1821 zhongkaifu/TensorSharp

A C# inference engine for running large language models (LLMs) locally using...

36
Emerging
1822 AristotelisPap/Question-Answering-with-BERT-and-Knowledge-Distillation

Fine-tuned BERT on SQuAd 2.0 Dataset. Applied Knowledge Distillation (KD)...

36
Emerging
1823 aquadzn/deploy-transformers

Easily deploy a state-of-the-art language model from HuggingFace's Transformers

36
Emerging
1824 sigeisler/reinforce-attacks-llms

REINFORCE Adversarial Attacks on Large Language Models: An Adaptive,...

36
Emerging
1825 deep-symbolic-mathematics/Multimodal-Math-Pretraining

[ICLR 2024 Spotlight] This is the official code for the paper "SNIP:...

36
Emerging
1826 camenduru/alpaca-lora-colab

Alpaca Lora

36
Emerging
1827 AutonomicPerfectionist/PipeInfer

PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculation

36
Emerging
1828 shoppollama/shoppollama

Open Source Agentic Commerce Platform built on Ollama and Stripe — Run...

36
Emerging
1829 JHansiduYapa/Fine-Tuning-a-Small-Language-Model-for-Cypher-Query-Generation

This project fine-tunes Unsloth's Gemma-3 4B IT (4-bit) model to translate...

36
Emerging
1830 sinanuozdemir/oreilly-bert-nlp

This repository contains code for the O'Reilly Live Online Training for BERT

36
Emerging
1831 NgJaBach/dark-kit

Collect and share guidance + code snippets for running LM-related tasks.

36
Emerging
1832 dvgodoy/LLM-visuals

Over 60 figures and diagrams of LLMs, quantization, low-rank adapters...

36
Emerging
1833 flowersteam/LLM-Culture

Code for the "Cultural evolution in populations of Large Language Models" paper

36
Emerging
1834 alantess/gtrxl-torch

Gated Transformer Model for Computer Vision

36
Emerging
1835 thanhlecongg/Invalidator

Invalidator: Automated Patch Correctness Assessment via Semantic and...

36
Emerging
1836 gotzmann/booster

Booster - open accelerator for LLM models. Better inference and debugging...

36
Emerging
1837 m-horky/sllm

Tools using small Large Language Models

36
Emerging
1838 yongchao98/R1-Code-Interpreter

R1-Code-Interpreter: Training LLMs to Reason with Code via Supervised and...

36
Emerging
1839 RishabSA/Sketch2Graphviz

Sketch2Graphviz allows you to convert sketches or images of graphs and...

36
Emerging
1840 crux82/BISS-2024

This repository hosts materials from the Bertinoro International Spring...

36
Emerging
1841 abdur75648/V-Zen

V-Zen: Efficient GUI Understanding and Precise Grounding With A Novel...

36
Emerging
1842 trzy/llava-cpp-server

LLaVA server (llama.cpp).

36
Emerging
1843 sashazykov/json-repair-rb

A simple Ruby gem designed to repair broken JSON strings

36
Emerging
1844 TIGER-AI-Lab/VisualWebInstruct

The official repo for "VisualWebInstruct: Scaling up Multimodal Instruction...

36
Emerging
1845 etaoxing/multigame-dt

Implementation of Multi-Game Decision Transformers in PyTorch

36
Emerging
1846 qingsongedu/Awesome-TimeSeries-SpatioTemporal-LM-LLM

A professional list on Large (Language) Models and Foundation Models (LLM,...

36
Emerging
1847 phvv-me/frame-representation-hypothesis

Official Repository for Frame Representation Hypothesis paper

36
Emerging
1848 sampathkethineedi/bert-topic-sentiment

Topic Based Sentiment Detection using BERT

36
Emerging
1849 dravenk/ollama-zig

Ollama Zig library

36
Emerging
1850 rickiepark/the-lm-book

<대규모 언어 모델, 핵심만 빠르게!>(인사이트, 2025)의 코드 저장소

36
Emerging
1851 weiserlab/TinyLLM

Bringing Language Models to the Most Resource Constrained Devices

36
Emerging
1852 HenryNdubuaku/nanodl

Build GPT, Gemma, LlaMa, Mixtral, Whisper, SWin, ViT and more in JAX.

36
Emerging
1853 warner-benjamin/commented-transformers

Highly commented implementations of Transformers in PyTorch

36
Emerging
1854 urban-mobility-generation/Language-Modeling-for-Urban-Mobility

Language Modeling for Urban Mobility: A Data-Centric Review and Guidelines

36
Emerging
1855 saeeddhqan/tiny-transformer

Tiny transformer models implemented in pytorch.

36
Emerging
1856 grigio/llm-eval-simple

llm-eval-simple is a simple LLM evaluation framework with intermediate...

36
Emerging
1857 yihongXU/TransCenter

This is the official implementation of TransCenter (TPAMI). The code and...

36
Emerging
1858 nehalvaghasiya/interview-bot

AI-powered virtual interview bot to simulate real interview practice.

36
Emerging
1859 markusaksli/ai-music

A vanilla Trasformer Decoder music generation model trained on Final Fantasy...

36
Emerging
1860 seedatnabeel/CLLM

Curated LLM (ICML 2024)

36
Emerging
1861 DAMO-NLP-SG/LLM-Zoo

LLM Zoo collects information of various open- and close-sourced LLMs

36
Emerging
1862 Alsace08/Chain-of-Embedding

[ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding...

36
Emerging
1863 SingleZombie/LLSA

Official implementation of Log-linear Sparse Attention (LLSA).

36
Emerging
1864 jaketae/vit-breast-cancer

Transfer learning pretrained vision transformers for breast histopathology

36
Emerging
1865 laurab222/TSAD

Benchmarking Unsupervised Strategies for Anomaly Detection in Multivariate...

36
Emerging
1866 Praveengovianalytics/falcon-evaluate

Falcon Evaluate is an open-source Python library aims to revolutionise the...

36
Emerging
1867 Azure/nlp-samples

Japanese NLP sample codes

36
Emerging
1868 0x7o/text2keywords

Trained T5 and T5-large model for creating keywords from text

36
Emerging
1869 UCSC-REAL/DS2

[ICLR 2025] Official implementation of paper "Improving Data Efficiency via...

36
Emerging
1870 TIGER-AI-Lab/LongICLBench

Code and Data for "Long-context LLMs Struggle with Long In-context Learning"...

36
Emerging
1871 MME-Benchmarks/Video-MME

✨✨[CVPR 2025] Video-MME: The First-Ever Comprehensive Evaluation Benchmark...

36
Emerging
1872 SculptAI/GIMKit

Guided Infilling Modeling Toolkit

36
Emerging
1873 abenechehab/dicl

[ICLR 2025] Official implementation of DICL (Disentangled In-Context...

36
Emerging
1874 IrohXu/Awesome-Multimodal-LLM-Autonomous-Driving

[WACV 2024 Survey Paper] Multimodal Large Language Models for Autonomous Driving

36
Emerging
1875 ray-project/ray-llm

RayLLM - LLMs on Ray (Archived). Read README for more info.

36
Emerging
1876 styfeng/DataAug4NLP

Collection of papers and resources for data augmentation for NLP.

36
Emerging
1877 monologg/HanBert-Transformers

HanBert on 🤗 Huggingface Transformers 🤗

36
Emerging
1878 Aloereed/llama.cpp-server-ohos

Llama.cpp server for OpenHarmony

36
Emerging
1879 suamin/T2NER

T2NER: Transformers based Transfer Learning Framework for Named Entity...

36
Emerging
1880 hristijanpeshov/SHAP-Explainable-Lexicon-Model

This project proposes a novel methodology to automatically learn financial...

36
Emerging
1881 bvanaken/visbert

VisBERT: Demo web app for "How Does BERT Answer Questions?"

36
Emerging
1882 gbaptista/ollama-ai

A Ruby gem for interacting with Ollama's API that allows you to run open...

36
Emerging
1883 cakshat/AlloyBERT

Introducing AlloyBERT: a transformer encoder-based model for predicting...

36
Emerging
1884 epfl-dlab/llm-latent-language

Repo accompanying our paper "Do Llamas Work in English? On the Latent...

36
Emerging
1885 cosbidev/NAIM

Official implementation for the paper ``Not Another Imputation Method: A...

36
Emerging
1886 JinjieNi/MegaDLMs

GPU-optimized framework for training diffusion language models at any scale....

36
Emerging
1887 kyegomez/M2PT

Implementation of M2PT in PyTorch from the paper: "Multimodal Pathway:...

36
Emerging
1888 AaronFeng753/Ollama-Model-Dumper

Export and Backup Ollama models into GGUF and ModelFile

36
Emerging
1889 aryan-jadon/Regression-Loss-Functions-in-Time-Series-Forecasting-Tensorflow

This repository contains the implementation of paper Temporal Fusion...

36
Emerging
1890 yaph/charla

A terminal based chat application that works with AI language models.

36
Emerging
1891 abhilashreddys/Fake-News-Article

Detecting fake news articles by analyzing patterns in writing.

36
Emerging
1892 hao-ai-lab/DistCA

Efficient Long-context Language Model Training by Core Attention Disaggregation

36
Emerging
1893 GeorgeMichailidis/multi-task-mixed-freq

Code repository for "Multi-Task Encoder-Dual-Decoder Modeling Framework on...

36
Emerging
1894 real-stanford/reflect

[CoRL 2023] REFLECT: Summarizing Robot Experiences for Failure Explanation...

36
Emerging
1895 BUAADreamer/Chinese-LLaVA-Med

中文医学多模态大模型 Large Chinese Language-and-Vision Assistant for BioMedicine

36
Emerging
1896 robinhad/kruk

Ukrainian instruction-tuned language models and datasets

36
Emerging
1897 ahans30/goldfish-loss

[NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs

36
Emerging
1898 asigalov61/Perceiver-Music-Transformer

SOTA Google's Perceiver-AR Music Transformer Implementation and Model

36
Emerging
1899 zhilizju/Awesome-instruction-tuning

A curated list of awesome instruction tuning datasets, models, papers and...

36
Emerging
1900 DFKI-NLP/thermostat

Collection of NLP model explanations and accompanying analysis tools

36
Emerging
« Prev 1 2 3 17 18 19 20 21 76 77 78 Next »