All Transformer Models

7,795 models ranked by quality score · Page 10 of 78

Showing 901–1000 of 7,795
# Model Score Tier
901 WangRongsheng/ChatGenTitle

🌟 ChatGenTitle:使用百万arXiv论文信息在LLaMA模型上进行微调的论文题目生成模型

43
Emerging
902 GAIR-NLP/MegaScience

MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning

43
Emerging
903 kastalimohammed1965/CLIP-fine-tune-registers-gated

Vision Transformers Needs Registers. And Gated MLPs. And +20M params. Tiny...

43
Emerging
904 hao-ai-lab/JacobiForcing

Jacobi Forcing: Fast and Accurate Diffusion-style Decoding

43
Emerging
905 openjlc/riscv64-library

Some of the libraries (docs) on the RISCV64 architecture are easy for users...

43
Emerging
906 cleopatra-itn/fair_multimodal_sentiment

Code and Splits for the paper "A Fair and Comprehensive Comparison of...

43
Emerging
907 varunkumar-dev/TransformersDataAugmentation

Code associated with the "Data Augmentation using Pre-trained Transformer...

43
Emerging
908 cdpierse/script_buddy_v2

Script Buddy v2 is a film script text generation tool built using film...

43
Emerging
909 magpie-align/magpie

[ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs...

43
Emerging
910 jasonvanf/llama-trl

LLaMA-TRL: Fine-tuning LLaMA with PPO and LoRA

43
Emerging
911 obss/trapper

State-of-the-art NLP through transformer models in a modular design and...

43
Emerging
912 mutablelogic/go-llm

Large Language Model API interface

43
Emerging
913 AviSoori1x/Tuning-the-Finetuning

Tuning the Finetuning: An exploration of achieving success with QLoRA

43
Emerging
914 Archimedes1618/Madlab

Madlab is an advanced AI development studio designed to streamline the...

43
Emerging
915 eric-ai-lab/MiniGPT-5

Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language...

43
Emerging
916 vaswdeferenss/AI-Dialogue-Memory-Based-on-Hidden-State

🤖 Integrate LSTM into Transformer models to enhance dialog memory, offering...

43
Emerging
917 DAGroup-PKU/MHLA

MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head...

43
Emerging
918 cambridgeltl/visual-med-alpaca

Visual Med-Alpaca is an open-source, multi-modal foundation model designed...

43
Emerging
919 datastone-spirit/spirit-lora-trainer

Spirit Lora Trainer is a robust toolkit for training Flux1-LoRA models with...

43
Emerging
920 CodeWithKyrian/transformers-php

Transformers PHP is a toolkit for PHP developers to add machine learning...

43
Emerging
921 nerve-sparks/iris_android

IRIS is an android app for interfacing with GGUF / llama.cpp models locally.

43
Emerging
922 kyegomez/attn_res

A clean, single-file PyTorch implementation of Attention Residuals (Kimi...

43
Emerging
923 haoliuhl/ringattention

Large Context Attention

43
Emerging
924 VikParuchuri/textbook_quality

Generate textbook-quality synthetic LLM pretraining data

43
Emerging
925 zalkikar/mlm-bias

Measuring Biases in Masked Language Models for PyTorch Transformers. Support...

43
Emerging
926 mytechnotalent/RE-GPT

Inspired by Andrej Karpathy’s "Let’s Build GPT", this project guides you...

43
Emerging
927 datawhalechina/base-llm

从 NLP 到 LLM 的算法全栈教程,在线阅读地址:https://datawhalechina.github.io/base-llm/

43
Emerging
928 modelscope/dash-infer

DashInfer is a native LLM inference engine aiming to deliver...

43
Emerging
929 ethicalabs-ai/kurtis

Kurtis is a fine-tuning, inference and evaluation tool built for SLMs (Small...

43
Emerging
930 RManLuo/graph-constrained-reasoning

Official Implementation of ICML 2025 Paper: "Graph-constrained Reasoning:...

43
Emerging
931 CLAIRE-Labo/EvoTune

Efficiently discovering algorithms via LLMs with evolutionary search and...

43
Emerging
932 ruimalheiro/training-custom-llama

Llama-style transformer in PyTorch with multi-node / multi-GPU training....

43
Emerging
933 aliemo/transfomers-silicon-research

Research and Materials on Hardware implementation of Transformer Model

43
Emerging
934 michael-borck/study-buddy

Desktop AI tutoring app with local inference using Ollama for...

43
Emerging
935 Tongjilibo/build_MiniLLM_from_scratch

从0到1构建一个MiniLLM (pretrain+sft+dpo实践中)

43
Emerging
936 harveybc/predictor

Predictor that uses a configurable plugin-based predictive supervised...

43
Emerging
937 amirhossein-kz/HiFormer

HiFormer: Hierarchical Multi-scale Representations Using Transformers for...

43
Emerging
938 DC-research/TEMPO

The official code for "TEMPO: Prompt-based Generative Pre-trained...

43
Emerging
939 ShivamRajSharma/Transformer-Architectures-From-Scratch

Implementation of transformers based architecture in PyTorch.

43
Emerging
940 Eiztrips/ai-responder

инструмент для создания и обучения моделей, имитирующих стиль общения...

43
Emerging
941 skylight-org/sparse-attention-hub

Advancing the frontier of efficient AI

43
Emerging
942 soldni/pyterrier_sentence_transformers

Create PyTerrier compatible dense indices using any sentence_transformers model

43
Emerging
943 alibaba/GraphTranslator

GraphTranslator:Aligning Graph Model to Large Language Model for Open-ended Tasks

43
Emerging
944 Michael-A-Kuykendall/shimmytok

Pure Rust tokenizer for GGUF models - llama.cpp compatible

43
Emerging
945 dipanjanS/adv_nlp_workshop_odsc_europe22

Extensive tutorials for the Advanced NLP Workshop in Open Data Science...

43
Emerging
946 datamllab/LongLM

[ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning

43
Emerging
947 gohjiayi/suicidal-text-detection

Building a suicidal text detection model and mental health chatbot with deep...

43
Emerging
948 zeozeozeo/ellama

Friendly interface to chat with an Ollama instance.

43
Emerging
949 jianghoucheng/AnyEdit

AnyEdit: Edit Any Knowledge Encoded in Language Models, ICML 2025

43
Emerging
950 huangwl18/language-planner

Official Code for "Language Models as Zero-Shot Planners: Extracting...

43
Emerging
951 IlyaGusev/rulm

Language modeling and instruction tuning for Russian

43
Emerging
952 lxuechen/private-transformers

A codebase that makes differentially private training of transformers easy.

43
Emerging
953 armbues/SiLLM

SiLLM simplifies the process of training and running Large Language Models...

43
Emerging
954 xlang-ai/Binder

[ICLR 2023] Code for the paper "Binding Language Models in Symbolic Languages"

43
Emerging
955 csiro-robotics/HOTFormerLoc

[IEEE/CVF CVPR 2025] Hierarchical Octree Transformer for Versatile Lidar...

43
Emerging
956 chanind/linear-relational

Linear Relational Embeddings (LREs) and Linear Relational Concepts (LRCs)...

43
Emerging
957 njchoma/transformer_image_caption

Image Captioning based on Bottom-Up and Top-Down Attention model

42
Emerging
958 Nuked88/ComfyUI-N-Nodes

A suite of custom nodes for ConfyUI that includes GPT text-prompt...

42
Emerging
959 SomeBottle/Konnyaku

A simple and robust LLM workflow for anime subtitle file translation. | 基于...

42
Emerging
960 canyuchen/ClinicalBench

Code for the paper "ClinicalBench: Can LLMs Beat Traditional ML Models in...

42
Emerging
961 Yachay-AI/byt5-geotagging

Confidence and Byt5 - based geotagging model predicting coordinates from text alone.

42
Emerging
962 deepreinforce-ai/CUDA-L2

CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through...

42
Emerging
963 mala-lab/SEMPO

[NeurIPS 2025] Official implementation of "SEMPO: Lightweight Foundation...

42
Emerging
964 phronmophobic/llama.clj

Run LLMs locally. A clojure wrapper for llama.cpp.

42
Emerging
965 ssbuild/deep_training

deep learning

42
Emerging
966 zetavg/LLaMA-LoRA-Tuner

UI tool for fine-tuning and testing your own LoRA models base on LLaMA,...

42
Emerging
967 AntixK/PyTorch-Model-Compare

Compare neural networks by their feature similarity

42
Emerging
968 hellotransformers/Natural_Language_Processing_with_Transformers

Natural Language Processing with Transformers 中译本,最权威Transformers教程

42
Emerging
969 illiterate/BertClassifier

基于PyTorch的BERT中文文本分类模型(BERT Chinese text classification model implemented by PyTorch)

42
Emerging
970 KolosalAI/kolosal-server

Kolosal AI is an OpenSource and Lightweight alternative to Ollama to run...

42
Emerging
971 NetEase-Media/grps_trtllm

Higher performance OpenAI LLM service than vLLM serve: A pure C++...

42
Emerging
972 princeton-nlp/LLM-Shearing

[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via...

42
Emerging
973 bahree/helloLondon

Historical Language Model for London - A specialized LLM trained on...

42
Emerging
974 ruanchaves/napolab

The Natural Portuguese Language Benchmark (Napolab). Stay up to date with...

42
Emerging
975 the-crypt-keeper/can-ai-code

Self-evaluating interview for AI coders

42
Emerging
976 txsun1997/Black-Box-Tuning

ICML'2022: Black-Box Tuning for Language-Model-as-a-Service & EMNLP'2022:...

42
Emerging
977 EleutherAI/DALLE-mtf

Open-AI's DALL-E for large scale training in mesh-tensorflow.

42
Emerging
978 uclaml/SPPO

The official implementation of Self-Play Preference Optimization (SPPO)

42
Emerging
979 nlp-uoregon/mlmm-evaluation

Multilingual Large Language Models Evaluation Benchmark

42
Emerging
980 ArdaGnsrn/ollama-php

This is a PHP library for Ollama. Ollama is an open-source project that...

42
Emerging
981 luchangli03/export_llama_to_onnx

export llama to onnx

42
Emerging
982 AviSoori1x/seemore

From scratch implementation of a vision language model in pure PyTorch

42
Emerging
983 hitz-zentroa/whisper-lm-transformers

Add n-gram and LLM language model support to HF Transformers Whisper models.

42
Emerging
984 adarshM84/TextLLaMACode

Transform your writing with TextLLaMA! ✍️🚀 Simplify grammar, translate...

42
Emerging
985 CVxTz/music_genre_classification

music genre classification : LSTM vs Transformer

42
Emerging
986 scientific-discovery/LLEMA

[ICLR 2026] LLEMA: Evolutionary Search with LLMs for Multi-Objective...

42
Emerging
987 RobertCsordas/ndr

The official repository for our paper "The Neural Data Router: Adaptive...

42
Emerging
988 jingedawang/TutorialLLM

LLM Tutorial for Everyone.

42
Emerging
989 argosopentech/MetalTranslate

Customizable machine translation in C++

42
Emerging
990 ariya/chat-llm

Chat with an LLM

42
Emerging
991 jd-coderepos/llms4subjects

The official SemEval 2025 Task 5 - LLMs4Subjects - Shared Task Dataset repository

42
Emerging
992 Dartvauder/NeuroSandboxWebUI

(Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image,...

42
Emerging
993 withcaer/curtana

Simplified zero-cost wrapper over llama.cpp powered by the lama-cpp-2 Crate.

42
Emerging
994 Alpha-VLLM/Lumina-T2X

Lumina-T2X is a unified framework for Text to Any Modality Generation

42
Emerging
995 HamedBabaei/LLMs4OM

LLMs4OM: Matching Ontologies with Large Language Models

42
Emerging
996 AbdelStark/attnres

Rust implementation of Attention Residuals from MoonshotAI/Kimi

42
Emerging
997 nv-tlabs/LLaMA-Mesh

Unifying 3D Mesh Generation with Language Models

42
Emerging
998 USC-FORTIS/AD-LLM

[ACL Findings 2025] A benchmark for anomaly detection using large language...

42
Emerging
999 tosiyuki/LLaVA-JP

LLaVA-JP is a Japanese VLM trained by LLaVA method

42
Emerging
1000 FreeOCR-AI/layoutreader

A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.

42
Emerging
« Prev 1 2 3 8 9 10 11 12 76 77 78 Next »