All Transformer Models

7,795 models ranked by quality score · Page 13 of 78

Showing 1201–1300 of 7,795
# Model Score Tier
1201 ai4co/parco

[NeurIPS 2025] PARCO: Parallel AutoRegressive Combinatorial Optimization

40
Emerging
1202 hongyehu/Machine_Learning_Quantum_State_Tomography

An **unofficial** pytorch implementation of using generative models to do...

40
Emerging
1203 cdli-gh/Semi-Supervised-NMT-for-Sumerian-English

Exploring the Limits of Low-Resource Neural Machine Translation

40
Emerging
1204 asigalov61/Allegro-Music-Transformer

Full-attention multi-instrumental music transformer featuring asymmetrical...

40
Emerging
1205 clabrugere/scratch-llm

Implements a LLM similar to Meta's Llama 2 from the ground up in PyTorch,...

40
Emerging
1206 SamsungSAILMontreal/ghn3

Code for "Can We Scale Transformers to Predict Parameters of Diverse...

40
Emerging
1207 amazon-science/unified-ept

A Unified Efficient Pyramid Transformer for Semantic Segmentation, ICCVW 2021

40
Emerging
1208 PediaMedAI/AggPose

[IJCAI 2022] Official PyTorch implementation of AggPose: Deep Aggregation...

40
Emerging
1209 lin-tan/clm

For our ICSE23 paper "Impact of Code Language Models on Automated Program...

40
Emerging
1210 muhtalhakhan/Hacktoberfest2024

Hacktoberfest 2024 🧑🏻‍💻 OPEN FIRST Pull Request 🎉

40
Emerging
1211 VPGTrans/VPGTrans

Codes for VPGTrans: Transfer Visual Prompt Generator across LLMs. VL-LLaMA,...

40
Emerging
1212 JackZeng0208/llama.cpp-android-tutorial

llama.cpp tutorial on Android phone

40
Emerging
1213 bodeby/torchstack

🫧 probability-level model ensembling for transformers

40
Emerging
1214 developer239/llama.cpp-ts

llama.cpp 🦙 LLM inference in TypeScript

40
Emerging
1215 GAIR-NLP/ProX

[ICML 2025] Programming Every Example: Lifting Pre-training Data Quality...

40
Emerging
1216 l294265421/alpaca-rlhf

Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback)...

40
Emerging
1217 skit-ai/SpeechLLM

This repository contains the training, inference, evaluation code for...

40
Emerging
1218 amitkedia007/Financial-Fraud-Detection-Using-LLMs

The aim of this dissertation is to assess the effectiveness of LLMs such as ...

40
Emerging
1219 luuyin/OWL

Official Pytorch Implementation of "Outlier Weighed Layerwise Sparsity...

40
Emerging
1220 yuanzhoulvpi2017/DocumentSearch

基于sentence transformers和chatglm实现的文档搜索工具

40
Emerging
1221 vmicheli/delta-iris

Efficient World Models with Context-Aware Tokenization. ICML 2024

40
Emerging
1222 IAAR-Shanghai/Grimoire

Grimoire is All You Need for Enhancing Large Language Models

40
Emerging
1223 aJupyter/ThinkLLM

ThinkLLM:🚀 轻量、高效的大语言模型算法实现

40
Emerging
1224 trrahul/llama2.cs

Inference Llama 2 in one file of pure C#

40
Emerging
1225 ShiZhengyan/DePT

[ICLR 2024] This is the repository for the paper titled "DePT: Decomposed...

40
Emerging
1226 wuwangzhang1216/prometheus

Fully automatic censorship removal for language models. LoRA abliteration +...

40
Emerging
1227 OpenSparseLLMs/LLaMA-MoE-v2

🚀 LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of...

40
Emerging
1228 ukairia777/pytorch-nlp-tutorial

pytorch를 사용하여 텍스트 전처리부터 RAG, 에이전트, LLM 파인튜닝을 정리한 Deep Learning NLP 저장소입니다.

40
Emerging
1229 architkaila/Fine-Tuning-LLMs-for-Medical-Entity-Extraction

Exploring the potential of fine-tuning Large Language Models (LLMs) like...

40
Emerging
1230 molbal/llm-text-completion-finetune

Guide on text completion large language model fine-tuning, including example...

40
Emerging
1231 prajjwal1/fluence

A deep learning library based on Pytorch focussed on low resource language...

40
Emerging
1232 JulesBelveze/bert-squeeze

🛠️ Tools for Transformers compression using PyTorch Lightning ⚡

40
Emerging
1233 kingabzpro/using-llama3-locally

Running llama3 using Ollama-Python, Curl, LangChain, Chroma, and User interface.

40
Emerging
1234 TrustedLLM/LLMDet

LLMDet is a text detection tool that can identify which generated sources...

40
Emerging
1235 rasbt/blog-finetuning-llama-adapters

Supplementary material for "Understanding Parameter-Efficient Finetuning of...

40
Emerging
1236 Bindwell/PLAPT

Codebase and CLI for PLAPT: A state-of-the-art protein-ligand binding...

40
Emerging
1237 jonrbates/turing

A PyTorch library for simulating Turing machines with neural networks, based...

40
Emerging
1238 eduard23144/locoformer

🤖 Explore LocoFormer, a Transformer-XL model that enhances robot locomotion...

40
Emerging
1239 Traffic-Alpha/LLM-Assisted-Light

This repository contains the code for the paper "LLM-Assisted Light:...

40
Emerging
1240 hans00/react-native-transformers-example

Example of transformers.js on React Native

40
Emerging
1241 Omid-Nejati/BEFUnet

A Hybrid CNN-Transformer Architecture for Precise Medical Image Segmentation

40
Emerging
1242 ziplab/LIT

[AAAI 2022] This is the official PyTorch implementation of "Less is More:...

40
Emerging
1243 chenmozhijin/BSRoformer.cpp

GGML-based C++ inference for BS Roformer/Mel-Band-Roformer vocal separation...

40
Emerging
1244 ShuntaroOkuma/adapt-gauge-core

Measure LLM adaptation efficiency — how fast models learn from few examples

40
Emerging
1245 MetaGLM/FinGLM

FinGLM: 致力于构建一个开放的、公益的、持久的金融大模型项目,利用开源开放来促进「AI+金融」。

40
Emerging
1246 ECNU-ICALK/EduChat

An open-source educational chat model from ICALK, East China Normal...

40
Emerging
1247 neulab/knn-transformers

PyTorch + HuggingFace code for RetoMaton: "Neuro-Symbolic Language Modeling...

40
Emerging
1248 yang-ai-lab/SleepLM

SleepLM: Natural-Language Intelligence for Human Sleep

40
Emerging
1249 zyds/transformers-code

手把手带你实战 Huggingface Transformers 课程视频同步更新在B站与YouTube

40
Emerging
1250 YadaYuki/transformer-from-scratch

Transformer from scratch 🙊 (English to Japanese Translator by PyTorch)

40
Emerging
1251 saqib1707/gpt2-from-scratch

PyTorch Implementation of GPT-2

40
Emerging
1252 infocusp/llm_seminar_series

Material for the series of seminars on Large Language Models

40
Emerging
1253 metriccoders/one-line-llm-tuner

This repository is the source code for fine tuning any LLM in just one line 🔥

40
Emerging
1254 zejia-lin/BulletServe

Boosting GPU utilization for LLM serving via dynamic spatial-temporal...

40
Emerging
1255 geobrain-ai/geogalactica

Code and datasets for paper "GeoGalactica: A Scientific Large Language Model...

40
Emerging
1256 stanleylsx/llms_tool

一个基于HuggingFace开发的大语言模型训练、测试工具。支持各模型的webui、终端预测,低参数量及全参数模型训练(预训练、SFT、RM、PPO、D...

40
Emerging
1257 xjywhu/Awesome-Multimodal-LLM-for-Code

Multimodal Large Language Models for Code Generation under Multimodal Scenarios

40
Emerging
1258 jha-lab/acceltran

[TCAD'23] AccelTran: A Sparsity-Aware Accelerator for Transformers

40
Emerging
1259 ChristophReich1996/Swin-Transformer-V2

PyTorch reimplementation of the paper "Swin Transformer V2: Scaling Up...

40
Emerging
1260 HqWu-HITCS/Awesome-Chinese-LLM

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

40
Emerging
1261 VITA-Group/Q-GaLore

Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank...

40
Emerging
1262 aniketmaurya/llm-inference

Large Language Model (LLM) Inference API and Chatbot

40
Emerging
1263 monologg/KoBigBird

🦅 Pretrained BigBird Model for Korean (up to 4096 tokens)

40
Emerging
1264 samestrin/llm-pdf-ocr-api

A Python-based REST API for PDF OCR using AI models with PyTorch and...

40
Emerging
1265 AllenXiangX/SnowflakeNet

(TPAMI 2023) Snowflake Point Deconvolution for Point Cloud Completion and...

40
Emerging
1266 google-research/magvit

Official JAX implementation of MAGVIT: Masked Generative Video Transformer

40
Emerging
1267 DannyArends/DLLM

A minimal, clean D language interface for running LLM inference using...

40
Emerging
1268 AlexandrosChrtn/llama-fine-tune-guide

Fine-tune the newly released Llama-3.2 lightweight models.

40
Emerging
1269 iverly/llamafile-docker

Distribute and run llamafile/LLMs with a single docker image.

40
Emerging
1270 rednote-hilab/dots.llm1

The official repository of the dots.llm1 base and instruct models proposed...

40
Emerging
1271 google-deepmind/gemma_penzai

A JAX Research Toolkit for Visualizing, Manipulating, and Understanding...

40
Emerging
1272 vicgalle/zero-shot-reward-models

ZYN: Zero-Shot Reward Models with Yes-No Questions

40
Emerging
1273 git-cloner/llama-lora-fine-tuning

llama fine-tuning with lora

40
Emerging
1274 RedHatResearch/conext24-NetConfEval

Benchmark for evaluating LLMs in network configuration problems.

40
Emerging
1275 ymcui/Chinese-Mixtral

中文Mixtral混合专家大模型(Chinese Mixtral MoE LLMs)

40
Emerging
1276 takashiishida/paper2slides

Transform any arXiv papers into slides using LLMs

40
Emerging
1277 hkust-nlp/deita

Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]

40
Emerging
1278 gusye1234/llm-as-function

Embed your LLM into a python function

40
Emerging
1279 sanjibnarzary/awesome-llm

Curated list of open source and openly accessible large language models

40
Emerging
1280 modelscope/mcore-bridge

MCore-Bridge: Providing Megatron-Core model definitions for state-of-the-art...

40
Emerging
1281 Sachithx/EntroPE

This includes the codebase for EntroPE (Entropy-Guided Dynamic Patch Encoder...

40
Emerging
1282 softengg-manoj/dreamer4

🌟 Implement Dreamer 4 for training agents within scalable world models,...

40
Emerging
1283 sisinflab/Ducho

Ducho is a Python framework aimed to extract multimodal features used in...

40
Emerging
1284 abaheti95/LoL-RL

Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving...

40
Emerging
1285 harishdeivanayagam/rowfill

Open-source spreadsheets platform for deep research and document processing

40
Emerging
1286 johnmai-dev/NotebookMLX

📋 NotebookMLX - An Open Source version of NotebookLM (Ported NotebookLlama)

40
Emerging
1287 JKevin17/TM-LLM

The official code for "(ISCC 2025) Network Traffic Matrix Imputation via...

40
Emerging
1288 sodascience/workshop_llm_data_collection

This repository contains the code and slides for our workshop on data...

40
Emerging
1289 KolosalAI/kolosal-cli

Super lightweight Ollama + Qwen Code alternative to run Llama 3.3,...

40
Emerging
1290 YuweiYin/FinPT

FinPT: Financial Risk Prediction with Profile Tuning on Pretrained Foundation Models

40
Emerging
1291 tpoisonooo/llama.onnx

LLaMa/RWKV onnx models, quantization and testcase

40
Emerging
1292 prismformore/Multi-Task-Transformer

Code of ICLR2023 paper "TaskPrompter: Spatial-Channel Multi-Task Prompting...

40
Emerging
1293 bigcode-project/selfcodealign

[NeurIPS'24] SelfCodeAlign: Self-Alignment for Code Generation

40
Emerging
1294 upb-lea/mag-net-hub

MagNet Toolkit - Certified Models of the MagNet Challenge

40
Emerging
1295 sunnynguyen-ai/llm-attention-visualizer

Interactive tool for analyzing attention patterns in transformer models with...

40
Emerging
1296 nlpodyssey/cybertron

Cybertron: the home planet of the Transformers in Go

40
Emerging
1297 xNul/code-llama-for-vscode

Use Code Llama with Visual Studio Code and the Continue extension. A local...

40
Emerging
1298 dingo-actual/infini-transformer

PyTorch implementation of Infini-Transformer from "Leave No Context Behind:...

40
Emerging
1299 swordlidev/Efficient-Multimodal-LLMs-Survey

Efficient Multimodal Large Language Models: A Survey

40
Emerging
1300 zyushun/Adam-mini

Code for Adam-mini: Use Fewer Learning Rates To Gain More...

40
Emerging
« Prev 1 2 3 11 12 13 14 15 76 77 78 Next »