All Transformer Models

7,795 models ranked by quality score · Page 21 of 78

Showing 2001–2100 of 7,795

« Prev Next »

#	Model	Score	Tier	Category	Stars	Language
2001	Ereboas/MagiCodec A single-layer, streaming codec model providing SOTA audio quality and...	35	Emerging	diffusion-language-models	113	Python
2002	vlarine/transformers-ru A list of pretrained Transformer models for the Russian language.	35	Emerging	neural-machine-translation	177	Jupyter Notebook
2003	Yog-Sotho/LLM-fine-tuner Powerful no-code LLM fine-tuner: upload data → train → deploy in minutes....	35	Emerging	llm-fine-tuning	13	Python
2004	nsidn98/LLaMAR Code for our paper LLaMAR: LM-based Long-Horizon Planner for Multi-Agent Robotics	35	Emerging	llm-robot-planning	30	Jupyter Notebook
2005	surrey-nlp/NLP-2026 Labs for COM3029/COMM061 at University of Surrey	35	Emerging	nlp-learning-coursework	1	Jupyter Notebook
2006	hitz-zentroa/whisper-lm Add n-gram and large language model (LLM) support to Whisper models.	35	Emerging	llm-frameworks-libraries	41	Jupyter Notebook
2007	UIC-InDeXLab/RSR An Efficient Matrix Multiplication Algorithm for Accelerating Inference in...	35	Emerging	llm-cuda-optimization	17	Python
2008	JayZhang42/SLED SLED: Self Logits Evolution Decoding for Improving Factuality in Large...	35	Emerging	llm-training-experimentation	119	Python
2009	arrmansa/Basic-UI-for-GPT-Neo-with-low-vram A basic ui for running gpt neo 2.7B on low vram (3 gb Vram minimum)	35	Emerging	gpt2-pretraining-fine-tuning	36	Jupyter Notebook
2010	achimoraites/machine-learning-playground Having fun with ML	35	Emerging	ml-foundations-curricula	11	Jupyter Notebook
2011	yzGuu830/efficient-speech-codec [EMNLP 2024] ESC: Efficient Speech Coding with Cross-Scale Residual Vector...	35	Emerging	power-transformer-design	125	Jupyter Notebook
2012	Baran-phys/Tropical-Attention [NeurIPS 2025] Official code for "Tropical Attention: Neural Algorithmic...	35	Emerging	transformer-architecture-tutorials	27	Python
2013	asigalov61/Orchestrator Local windowed attention multi-instrumental music transformer tailored for...	35	Emerging	ai-music-generation	17	Python
2014	marcobombieri/do-LLM-dream-of-ontologies Repository containing code and dataset of the paper "Do LLM Dream Of Ontologies?"	35	Emerging	llm-domain-datasets	1	Python
2015	HUBioDataLab/SELFormer SELFormer: Molecular Representation Learning via SELFIES Language Models	35	Emerging	molecular-generation-transformers	107	Python
2016	sichunluo/RecRanker [TOIS'24] "RecRanker: Instruction Tuning Large Language Model as Ranker for...	35	Emerging	llm-recommendation-systems	16	Python
2017	krnel-ai/krnel-graph Lightweight representation engineering dataflow operations for agent developers.	35	Emerging	graph-transformers	22	Python
2018	turboline-ai/tsln-python Time Series Lean Notation for python, it is designed to maximize the token...	35	Emerging	llm-framework-abstractions	4	Python
2019	OpenNLPLab/TransnormerLLM Official implementation of TransNormerLLM: A Faster and Better LLM	35	Emerging	llm-implementation-tutorials	252	Python
2020	researchim-ai/models-at-home training models at home	35	Emerging	llm-fine-tuning	34	Python
2021	ShelbyJenkins/llm_utils llm_utils: Basic LLM tools, best practices, and minimal abstraction.	35	Emerging	rust-llm-infrastructure	48	Rust
2022	robertvacareanu/llm4regression Examining how large language models (LLMs) perform across various synthetic...	35	Emerging	llm-frameworks-libraries	162	Python
2023	GURPREETKAURJETHRA/Perfect-LLM-Model-Finder Perfect LLM Model Finder is a tool designed to simplify the overwhelming...	35	Emerging	llm-frameworks-libraries	5	Python
2024	jackaduma/Alpaca-LoRA-RLHF-PyTorch A full pipeline to finetune Alpaca LLM with LoRA and RLHF on consumer...	35	Emerging	rlhf-alignment-training	61	Python
2025	Beomi/exbert-transformers exBERT on Transformers🤗	35	Emerging	bert-model-implementations	10	Python
2026	deepmancer/vlm-toolbox Vision-Language Models Toolbox: Your all-in-one solution for multimodal...	35	Emerging	vision-language-models	12	Jupyter Notebook
2027	amazon-science/recode Releasing code for "ReCode: Robustness Evaluation of Code Generation Models"	35	Emerging	math-reasoning-datasets	58	Python
2028	Yigtwxx/PredictaLM PredictaLM is a lightweight Turkish language model designed for next-word...	35	Emerging	llm-implementation-tutorials	3	Python
2029	declare-lab/Auto-Scaling [Arxiv 2024] Official Implementation of the paper: "Towards Robust...	35	Emerging	instruction-tuning-datasets	9	Jupyter Notebook
2030	teelinsan/parallel-decoding Repository of the paper "Accelerating Transformer Inference for Translation...	35	Emerging	transformer-training-optimization	124	Python
2031	TheBrainLab/SGLFormer Spiking Global-Local Fusion Transformer	35	Emerging	spiking-neural-networks	21	Python
2032	moharamfatema/graduation-project Video vision transformers for hierarchical anomaly detection in video scenes.	35	Emerging	vision-transformer-classification	5	Jupyter Notebook
2033	ngoanpv/llama2_vietnamese A fine-tuned Large Language Model (LLM) for the Vietnamese language based on...	35	Emerging	llm-fine-tuning	17	Python
2034	TIGER-AI-Lab/General-Reasoner General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25]	35	Emerging	llm-reasoning-research	222	Python
2035	Akshint0407/Automated-Answer-Checker AI-powered grading system for educators 🔹 Streamlit web app that automates...	35	Emerging	essay-scoring-grading	4	Python
2036	he-h/rhythm [NeurIPS 2025] RHYTHM: Reasoning with Hierarchical Temporal Tokenization for...	35	Emerging	multimodal-vision-language-models	8	Python
2037	THUDM/Multilingual-GLM The multilingual variant of GLM, a general language model trained with...	35	Emerging	transformer-architecture-education	62	Python
2038	JerryYLi/valhalla-nmt Code repository for CVPR 2022 paper "VALHALLA: Visual Hallucination for...	35	Emerging	multimodal-visual-grounding	28	Python
2039	bminixhofer/tokenkit A toolkit implementing advanced methods to transfer models and model...	35	Emerging	text-tokenization-libraries	64	Python
2040	iamgmujtaba/llama3.2-webUI LLaMa 3.2 Multimodal Web UI is a user-friendly interface for interacting...	35	Emerging	interactive-ai-chat-uis	35	PHP
2041	RenzeLou/Muffin MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following	35	Emerging	instruction-tuning-datasets	16	Python
2042	srvCodes/continual_learning_with_vit Code for our CVPR 2022 workshop paper "Towards Exemplar-Free Continual...	35	Emerging	mathematical-reasoning-transformers	24	Python
2043	InternRobotics/PointLLM [ECCV 2024 Best Paper Candidate & TPAMI 2025] PointLLM: Empowering Large...	35	Emerging	vision-language-instruction-tuning	983	Python
2044	DEV-D-GR8/SignSense This repository contains a transformer-based model for real-time American...	35	Emerging	3d-vision-transformers	12	Jupyter Notebook
2045	xf-zhao/LoT Official implementation of LoT paper: "Enhancing Zero-Shot Chain-of-Thought...	35	Emerging	chain-of-thought-reasoning	30	Python
2046	Tanveer81/ReVisionLLM This is the official implementation of ReVisionLLM: Recursive...	35	Emerging	multimodal-vision-language	43	Python
2047	zjunlp/ModelKinship Exploring Model Kinship for Merging Large Language Models	35	Emerging	diffusion-language-models	27	Python
2048	OpenBMB/VisCPM [ICLR'24 spotlight] Chinese and English Multimodal Large Model Series (Chat...	35	Emerging	vision-language-instruction-tuning	1,070	Python
2049	NVlabs/NFT Implementation of Negative-aware Finetuning (NFT) algorithm for "Bridging...	35	Emerging	rlhf-alignment-training	71	Python
2050	Bruce-Lee-LY/decoding_attention Decoding Attention is specially optimized for MHA, MQA, GQA and MLA using...	35	Emerging	sparse-attention-optimization	46	C++
2051	ai8hyf/llm_split_recall_test Split and Recall: A simple and efficient benchmark to evaluate in-context...	35	Emerging	llm-scaling-architecture	9	Python
2052	nlp-with-transformers/website Website for the Natural Language Processing with Transformers book	35	Emerging	huggingface-learning-resources	28	HTML
2053	AIFEG/BenchLMM [ECCV 2024] BenchLMM: Benchmarking Cross-style Visual Capability of Large...	35	Emerging	domain-specific-benchmarks	86	Python
2054	hemangjoshi37a/hjAlgos AI based algorithmic trading platform for zerodha users	35	Emerging	ai-powered-business-analytics	6	HTML
2055	thushv89/packt_nlp_tensorflow_2 This will contain the code for the 2nd edition of NLP with TensorFlow (Edition 2)	35	Emerging	nlp-learning-coursework	45	Jupyter Notebook
2056	gsarti/t5-flax-gcp Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP	35	Emerging	transformer-frameworks-wrappers	58	Python
2057	Wang-ML-Lab/llm-continual-learning-survey [CSUR 2025] Continual Learning of Large Language Models: A Comprehensive Survey	35	Emerging	llm-research-curation	530	—
2058	leftmove/cria Run LLMs locally with as little friction as possible.	35	Emerging	local-llm-deployment	121	Python
2059	GeeeekExplorer/transformers-patch patches for huggingface transformers to save memory	35	Emerging	llm-implementation-tutorials	35	Python
2060	senadkurtisi/pytorch-image-captioning Transformer & CNN Image Captioning model in PyTorch.	35	Emerging	image-captioning-transformers	44	Python
2061	nlpodyssey/gotokenizers Go implementation of today's most used tokenizers	35	Emerging	tokenizer-libraries	44	Go
2062	BauplanLabs/Making-Databases-Faster-with-LLM-Evolutionary-Sampling Repository hosting code to reproduce our paper (with Stanford and...	35	Emerging	llm-compression-optimization	18	Python
2063	BoHuangLab/Protein-Localization-Transformer Code for CELL-E: Biological Zero-Shot Text-to-Image Synthesis for Protein...	35	Emerging	protein-transformers-ml	29	Python
2064	deep-diver/segformer-tf-transformers This repository demonstrates how to use TensorFlow based SegFormer model in...	35	Emerging	medical-image-segmentation-transformers	30	Jupyter Notebook
2065	raghavagps/pptstab PPTStab: Designing of thermostable proteins with a desired melting temperature	35	Emerging	protein-design-llms	6	Python
2066	opendatalab/UrBench [AAAI 2025]This repo contains evaluation code for the paper “UrBench: A...	35	Emerging	llm-benchmark-leaderboards	36	Python
2067	vicuna-tools/vicuna-installation-guide The "vicuna-installation-guide" provides step-by-step instructions for...	35	Emerging	mistral-ai-tools	282	—
2068	GURPREETKAURJETHRA/PaliGemma-Inference-and-Fine-Tuning PaliGemma Inference and Fine Tuning	35	Emerging	gemma-model-fine-tuning	5	Jupyter Notebook
2069	calpt/awesome-adapter-resources Collection of Tools and Papers related to Adapters / Parameter-Efficient...	35	Emerging	parameter-efficient-adapters	202	Python
2070	fattorib/fusedswiglu Fused SwiGLU Triton kernels	35	Emerging	transformer-architecture-tutorials	12	Python
2071	umbertocappellazzo/Llama-AVSR Official Pytorch implementation of "Large Language Models are Strong...	35	Emerging	multimodal-vision-language	57	Python
2072	UCSC-REAL/TokenCleaning [ICML 2025] Official implementation of paper "Token Cleaning: Fine-Grained...	35	Emerging	instruction-tuning-datasets	51	Python
2073	nipunsadvilkar/roberta-base-mr RoBERTa Marathi Language model trained from scratch during huggingface 🤗 x ...	35	Emerging	huggingface-learning-resources	28	Python
2074	maxi-w/llama2-chat-interface Gradio Chat Interface for Llama 2	35	Emerging	interactive-ai-chat-uis	28	Python
2075	worldbank/LLMs-Practical-Guide A practical introduction to Generative AI and LLMs, equipping professionals...	35	Emerging	llm-learning-resources	8	Jupyter Notebook
2076	HacktivSpace/multidisciplinary-deepfake-detection A solution for deepfake detection across multiple modalities, including...	35	Emerging	ai-content-detection	13	Python
2077	tgautam03/Transformers A Gentle Introduction to Transformers Neural Network	35	Emerging	transformer-architecture-tutorials	14	Jupyter Notebook
2078	xmindflow/MSA-2Net [BMVC 2024] Official repository of the paper titled "MSA^2 Net: Multi-scale...	35	Emerging	medical-image-segmentation-transformers	70	Python
2079	saddam213/LLamaStack ASP.NET Core Web, WebApi & WPF implementations for LLama.cpp & LLamaSharp	35	Emerging	local-llm-deployment	60	C#
2080	ziqipang/RandAR [CVPR 2025 (Oral)] Open implementation of "RandAR"	35	Emerging	vision-language-models	207	Python
2081	ziqipang/LM4VisualEncoding [ICLR 2024 (Spotlight)] "Frozen Transformers in Language Models are...	35	Emerging	multimodal-vision-language	246	Python
2082	sam575/axial-gan Code for "Simultaneous Face Hallucination and Translation for Thermal to...	35	Emerging	3d-vision-transformers	13	Python
2083	thongnt99/learned-sparse-retrieval Unified Learned Sparse Retrieval Framework	35	Emerging	power-transformer-design	68	Python
2084	zjunlp/NLPCC2024_RegulatingLLM [NLPCC 2024] Shared Task 10: Regulating Large Language Models	35	Emerging	ai-generated-text-detection	14	—
2085	FareedKhan-dev/gpt4o-from-scratch Implementation of a GPT-4o like Multimodal from Scratch using Python	35	Emerging	gpt2-pretraining-fine-tuning	78	Jupyter Notebook
2086	akjindal53244/Arithmo Small and Efficient Mathematical Reasoning LLMs	35	Emerging	math-reasoning-datasets	73	Python
2087	declare-lab/CICERO The purpose of this repository is to introduce new dialogue-level...	35	Emerging	ml-api-deployment	64	Python
2088	AI4LIFE-GROUP/LLM_Explainer Code for paper: Are Large Language Models Post Hoc Explainers?	35	Emerging	llm-interpretability-explainability	34	Jupyter Notebook
2089	Wangbiao2/R1-Track R1-Track: Direct Application of MLLMs to Visual Object Tracking via...	35	Emerging	multimodal-vision-language	66	Python
2090	qizhou000/UniEdit [NeurIPS 2025 B & D] UniEdit: A Unified Knowledge Editing Benchmark for...	35	Emerging	rlhf-alignment-training	2	Python
2091	zhchen18/ToMBench ToMBench: Benchmarking Theory of Mind in Large Language Models, ACL 2024.	35	Emerging	domain-specific-benchmarks	66	Python
2092	BatsResearch/planetarium Dataset and benchmark for assessing LLMs in translating natural language...	35	Emerging	llm-robot-planning	65	Python
2093	gustavecortal/gpt-j-fine-tuning-example Fine-tuning 6-Billion GPT-J (& other models) with LoRA and 8-bit compression	35	Emerging	model-fine-tuning-methods	68	Jupyter Notebook
2094	otto-de/TRON ⚡️ Implementation of TRON: Transformer Recommender using Optimized...	35	Emerging	recommendation-systems-transformers	74	Python
2095	yyDing1/ScaleQuest [ACL 2025] We introduce ScaleQuest, a scalable, novel and cost-effective...	35	Emerging	task-oriented-dialogue-systems	68	Python
2096	linydub/azureml-greenai-txtsum Samples for fine-tuning HuggingFace models with AzureML	35	Emerging	text-summarization-transformers	10	—
2097	SkywalkerLuke/TransHLA TransHLA: A hybrid transformer model for peptide-HLA epitope detection.	35	Emerging	protein-transformers-ml	9	Python
2098	aj-naik/Text-Summarization Abstractive and Extractive Text summarization using Transformers.	35	Emerging	text-summarization-transformers	86	Jupyter Notebook
2099	XavierZXY/Zero2Hero 从0到1学习大模型	35	Emerging	llm-learning-resources	19	Jupyter Notebook
2100	viralcode/superGPT Train your own LLM from scratch	35	Emerging	llm-implementation-tutorials	7	Python

« Prev 1 2 3 … 19 20 21 22 23 … 76 77 78 Next »