All Transformer Models

7,795 models ranked by quality score · Page 11 of 78

Showing 1001–1100 of 7,795

« Prev Next »

#	Model	Score	Tier	Category	Stars	Language
1001	asigalov61/SuperPiano Absolutely amazing SOTA Google Colab (Jupyter) Notebooks for...	42	Emerging	music-generation-transformers	88	Jupyter Notebook
1002	johnmai-dev/ChatMLX 🤖✨ChatMLX is a modern, open-source, high-performance chat application for...	42	Emerging	interactive-ai-chat-uis	822	Swift
1003	softmax1/Flash-Attention-Softmax-N CUDA and Triton implementations of Flash Attention with SoftmaxN.	42	Emerging	transformer-architecture-tutorials	73	Python
1004	ashleykleynhans/text-generation-docker Docker image for the Text Generation Web UI: A Gradio web UI for Large...	42	Emerging	prompt-engineering-security	4	Python
1005	RobertCsordas/transformer_generalization The official repository for our paper "The Devil is in the Detail: Simple...	42	Emerging	power-transformer-design	66	Python
1006	harleyszhang/llm_note LLM notes, including model inference, transformer model structure, and llm...	42	Emerging	llm-frameworks-libraries	866	Python
1007	TextGeneratorio/text-generator.io Run Vision LLMs, TTS and STT APIs. Website and API for https://text-generator.io	42	Emerging	text-to-image-generation	39	Python
1008	shushanxingzhe/transformers_ner Add CRF or LSTM+CRF for huggingface transformers bert to perform better on...	42	Emerging	named-entity-recognition	62	Python
1009	Gunale0926/SORSA SORSA: Singular Values and Orthonormal Regularized Singular Vectors...	42	Emerging	lora-qlora-fine-tuning	38	Python
1010	a-tokyo/ai-zero-shot-classifier 🧠 leverage advanced AI embeddings to perform multilingual zero-shot text...	42	Emerging	text-classification-transformers	12	TypeScript
1011	ahmetkumass/yolo-gen Train YOLO + VLM with one command. Auto-generate vision-language training...	42	Emerging	vision-language-models	24	Python
1012	ariannamethod/nanollama Train Llama 3 models from scratch. Any scale, any personality. By Arianna Method.	42	Emerging	multilingual-llm-adaptation	37	Python
1013	xmindflow/Awesome-Transformer-in-Medical-Imaging [MedIA Journal] An ultimately comprehensive paper list of Vision...	42	Emerging	vision-transformer-implementations	218	—
1014	sinanuozdemir/oreilly-ai-pipelines Designing and Deploying LLM Pipelines	42	Emerging	huggingface-learning-resources	37	Jupyter Notebook
1015	bilibili/Index-1.9B A lightweight multilingual LLM	42	Emerging	llm-frameworks-libraries	1,014	Python
1016	monologg/GoEmotions-Korean Korean version of GoEmotions Dataset 😍😢😱	42	Emerging	emotion-detection-transformers	57	Python
1017	SensAI-PT/LLaMa2lang Convenience scripts to finetune (chat-)LLaMa3 and other models for any language	42	Emerging	llm-fine-tuning	313	Python
1018	xyjigsaw/LLM-Pretrain-SFT Scripts of LLM pre-training and fine-tuning (w/wo LoRA, DeepSpeed)	42	Emerging	rlhf-alignment-training	87	Python
1019	monologg/DistilKoBERT Distillation of KoBERT from SKTBrain (Lightweight KoBERT)	42	Emerging	korean-language-models	198	Python
1020	HKUDS/OpenGraph [EMNLP'2024] "OpenGraph: Towards Open Graph Foundation Models"	42	Emerging	graph-language-models	330	Python
1021	HyperCluster-Tech/manimator Transform research papers and mathematical concepts into stunning visual...	42	Emerging	ai-video-generation	52	Python
1022	josStorer/selfhostedAI A collection of one-click self-hosted AI	42	Emerging	prompt-engineering-security	415	Python
1023	Rishit-dagli/Conformer An implementation of Conformer: Convolution-augmented Transformer for Speech...	42	Emerging	transformer-architecture-tutorials	45	Python
1024	tatsu-lab/alpaca_farm A simulation framework for RLHF and alternatives. Develop your RLHF method...	42	Emerging	rlhf-alignment-training	842	Python
1025	gitctrlx/llama.go Llama from scratch in Go.	42	Emerging	local-llm-deployment	104	Go
1026	menon92/BangalASR Transformer based Bangla Speech Recognition \| Encoder Decoder Architecture	42	Emerging	bert-model-implementations	57	Jupyter Notebook
1027	ai-forever/mgpt Multilingual Generative Pretrained Model	42	Emerging	gpt2-pretraining-fine-tuning	207	Jupyter Notebook
1028	amirfeder/CausaLM CausaLM: Causal Model Explanation Through Counterfactual Language Models	42	Emerging	causal-inference-nlp	55	Python
1029	NVlabs/GroupViT Official PyTorch implementation of GroupViT: Semantic Segmentation Emerges...	42	Emerging	vit-image-classification	783	Python
1030	mojivalipour/symbolicgpt Symbolic regression is the task of identifying a mathematical expression...	42	Emerging	julia-ml-frameworks	57	Python
1031	oxpig/CaLM Protein language model trained on coding DNA	42	Emerging	protein-language-models	53	Python
1032	grctest/FastAPI-BitNet Running Microsoft's BitNet inference framework via FastAPI, Uvicorn and Docker.	42	Emerging	mcp-demo-examples	38	Python
1033	aimclub/FEDOT.LLM LLM-based prototype for nexgen AutoML	42	Emerging	llm-frameworks-libraries	30	Python
1034	LLukas22/llm-rs-python Unofficial python bindings for the rust llm library. 🐍❤️🦀	42	Emerging	local-llm-deployment	76	Python
1035	mmaaz60/EdgeNeXt [CADL'22, ECCVW] Official repository of paper titled "EdgeNeXt: Efficiently...	42	Emerging	vision-transformer-implementations	411	Python
1036	CAMeL-Lab/CAMeLBERT Code and models for "The Interplay of Variant, Size, and Task Type in Arabic...	42	Emerging	bert-model-frameworks	55	Python
1037	garyb9/twitter-llm-bot Fully automatic asynchronous AI operated Twitter bot using Large Language...	42	Emerging	conversational-chatbot-applications	51	Python
1038	therealoliver/Deepdive-llama3-from-scratch Achieve the llama3 inference step-by-step, grasp the core concepts, master...	42	Emerging	llm-implementation-from-scratch	626	Jupyter Notebook
1039	thuml/Flowformer About Code release for "Flowformer: Linearizing Transformers with...	42	Emerging	self-supervised-learning	333	Python
1040	LowinLi/fastgpt ⚡ boost inference speed of GPT models in transformers by onnxruntime	42	Emerging	transformer-training-optimization	52	Python
1041	sedthh/BeatLearning Open Source Generative AI Models for Automatic Rhythm Game Beatmap...	42	Emerging	music-generation-transformers	61	Python
1042	ayaka14732/llama-2-jax JAX implementation of the Llama 2 model	42	Emerging	llama-model-implementations	216	Python
1043	biodatlab/thonburian-whisper Thonburian Whisper: Open models for fine-tuned Whisper in Thai. Try our demo...	42	Emerging	whisper-speech-transcription	186	Jupyter Notebook
1044	dmanuel64/codablellm A framework for creating and curating high-quality code datasets tailored...	42	Emerging	synthetic-data-generation	3	Python
1045	hyperonym/basaran Basaran is an open-source alternative to the OpenAI text completion API. It...	42	Emerging	gpt2-pretraining-fine-tuning	1,290	Python
1046	jankais3r/LLaMA_MPS Run LLaMA (and Stanford-Alpaca) inference on Apple Silicon GPUs.	42	Emerging	llm-inference-engines	585	Python
1047	woodRock/fishy-business Machine Learning for Rapid Evaporative Ionization Mass Spectrometry for...	42	Emerging	academic-thesis-repositories	3	Python
1048	NVIDIA/Cosmos-Tokenizer A suite of image and video neural tokenizers	42	Emerging	tokenizer-libraries	1,716	Jupyter Notebook
1049	sangmichaelxie/doremi Pytorch implementation of DoReMi, a method for optimizing the data mixture...	42	Emerging	llm-knowledge-distillation	352	HTML
1050	gotzmann/llama.go llama.go is like llama.cpp in pure Golang!	42	Emerging	local-llm-deployment	1,398	Go
1051	declare-lab/flan-alpaca This repository contains code for extending the Stanford Alpaca synthetic...	42	Emerging	multilingual-llm-adaptation	357	Python
1052	LISA-ITMO/LLM-resume-moderator Автоматизирует модерацию резюме на русском языке с помощью LLM. Для...	42	Emerging	llm-training-experimentation	5	Jupyter Notebook
1053	ZinYY/Online_RLHF A PyTorch implementation of the paper "Provably Efficient Online RLHF with...	42	Emerging	rlhf-alignment-training	89	Python
1054	sayakpaul/robustness-vit Contains code for the paper "Vision Transformers are Robust Learners" (AAAI 2022).	42	Emerging	vision-transformer-implementations	122	Jupyter Notebook
1055	waltonfuture/Diabetica [SCI-FM@ICLR 2025] Specialized LLMs capable of handling various diabetes tasks	42	Emerging	healthcare-ai-diagnostics	55	Python
1056	Dicklesworthstone/llm_introspective_compression_and_metacognition A novel approach for transformer model introspection that enables saving,...	42	Emerging	neural-data-compression	31	—
1057	ZO-Bench/ZO-LLM [ICML‘24] Official code for the paper "Revisiting Zeroth-Order Optimization...	42	Emerging	llm-knowledge-distillation	124	Python
1058	zjunlp/KnowledgeCircuits [NeurIPS 2024] Knowledge Circuits in Pretrained Transformers	42	Emerging	llm-knowledge-editing	164	Python
1059	liuqidong07/LLM-ESR [NeurIPS'24 Spotlight] The official implementation code of LLM-ESR.	42	Emerging	llm-recommendation-systems	49	Python
1060	efeslab/fiddler [ICLR'25] Fast Inference of MoE Models with CPU-GPU Orchestration	42	Emerging	mixture-of-experts-llms	262	Python
1061	harleyszhang/lite_llama A light llama-like llm inference framework based on the triton kernel.	42	Emerging	llama-model-implementations	174	Python
1062	shivendrra/SmallLanguageModel a LLM cookbook, for building your own from scratch, all the way from...	42	Emerging	llm-implementation-tutorials	168	Jupyter Notebook
1063	sail-sg/understand-r1-zero Understanding R1-Zero-Like Training: A Critical Perspective	42	Emerging	llm-reasoning-research	1,224	Python
1064	A-baoYang/alpaca-7b-chinese Finetune LLaMA-7B with Chinese instruction datasets	42	Emerging	llm-fine-tuning	137	Python
1065	taufeeque9/codebook-features Sparse and discrete interpretability tool for neural networks	42	Emerging	transformer-interpretability-mechanistic	64	Python
1066	omron-sinicx/crystalframer The official code respository for "Rethinking the role of frames for...	42	Emerging	graph-transformers	15	Python
1067	nickduran/align2-linguistic-alignment ALIGN 2.0: Modern Python package for multi-level linguistic alignment...	42	Emerging	rlhf-alignment-training	4	Python
1068	RobbenRibery/TuoTuo TuoTuo is a Topic Modeling library for Researchers and Engineers	42	Emerging	gpt-model-fine-tuning	6	Jupyter Notebook
1069	golsun/DialogRPT EMNLP 2020: "Dialogue Response Ranking Training with Large-Scale Human Feedback Data"	42	Emerging	ml-api-deployment	345	Python
1070	westlake-repl/IDvs.MoRec End-to-end Training for Multimodal Recommendation Systems	42	Emerging	llm-recommendation-systems	166	Python
1071	nuhmanpk/quick-llama Run Ollama models on Google Colab	42	Emerging	local-llm-deployment	4	Python
1072	leehanchung/lora-instruct Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA	42	Emerging	lora-qlora-fine-tuning	104	Python
1073	macabdul9/AnyGen A Unified and Minimalist Pipeline for Generating Outputs with LLMs...	42	Emerging	prompt-engineering-security	7	Python
1074	thunlp/InfLLM The code of our paper "InfLLM: Unveiling the Intrinsic Capacity of LLMs for...	42	Emerging	llm-scaling-architecture	395	Python
1075	sinanuozdemir/foundations-of-gen-ai Transformer Architectures for Generative AI	42	Emerging	ml-foundations-curricula	103	Jupyter Notebook
1076	Azure99/BlossomData A fluent, scalable, and easy-to-use LLM data processing framework.	42	Emerging	llm-inference-engines	28	Python
1077	Knuckles-Team/genius-chatbot Chatbot that uses any desired hugging face model or allows for scalable...	42	Emerging	conversational-chatbot-applications	1	Python
1078	RobertCsordas/modules The official repository for our paper "Are Neural Nets Modular? Inspecting...	41	Emerging	mathematical-reasoning-transformers	46	Python
1079	Beomi/InfiniTransformer Unofficial PyTorch/🤗Transformers(Gemma/Llama3) implementation of Leave No...	41	Emerging	transformer-architecture-tutorials	375	Python
1080	AdrianBZG/llama-multimodal-vqa Multimodal Instruction Tuning for Llama 3	41	Emerging	vision-language-instruction-tuning	51	Python
1081	jshilong/GPT4RoI (ECCVW 2025)GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest	41	Emerging	multimodal-vision-language	551	Python
1082	adaptivetokensampling/ATS Adaptive Token Sampling for Efficient Vision Transformers (ECCV 2022 Oral...	41	Emerging	vision-transformer-implementations	104	Shell
1083	jxiw/MambaInLlama [NeurIPS 2024] Official Repository of The Mamba in the Llama: Distilling and...	41	Emerging	diffusion-language-models	238	Python
1084	ictnlp/LLaVA-Mini LLaVA-Mini is a unified large multimodal model (LMM) that can support the...	41	Emerging	vision-language-instruction-tuning	562	Python
1085	Pyenb/Ollama-models A collection of zipped Ollama models for offline use. Simply download,...	41	Emerging	ollama-chat-interfaces	88	Shell
1086	declare-lab/instruct-eval This repository contains code to quantitatively evaluate instruction-tuned...	41	Emerging	instruction-tuning-datasets	552	Python
1087	poloclub/LLM-Attributor LLM Attributor: Attribute LLM's Generated Text to Training Data	41	Emerging	llm-interpretability-explainability	76	Jupyter Notebook
1088	audioku/meta-transfer-learning Implementation of meta-transfer-learning for ASR and LM (ACL 2020)	41	Emerging	end-to-end-asr-frameworks	52	Python
1089	eugenehp/bitnet-cpp-rs Rust bindings for bitnet.cpp based on llama-cpp-4	41	Emerging	local-llm-deployment	15	Rust
1090	zjysteven/mink-plus-plus [ICLR'25 Spotlight] Min-K%++: Improved baseline for detecting pre-training...	41	Emerging	llm-quantization-methods	54	Python
1091	JohnMachado11/Build-a-Large-Language-Model-from-Scratch Building a GPT-like LLM from scratch with PyTorch.	41	Emerging	llm-implementation-tutorials	337	Python
1092	mlpc-ucsd/BLIVA (AAAI 2024) BLIVA: A Simple Multimodal LLM for Better Handling of Text-rich...	41	Emerging	vision-language-instruction-tuning	260	Python
1093	Eamon2009/Transformer-language-model An educational implementation of a GPT-style language model built from...	41	Emerging	machine-translation-transformers	12	Jupyter Notebook
1094	okuvshynov/slowllama Finetune llama2-70b and codellama on MacBook Air without quantization	41	Emerging	lora-qlora-fine-tuning	450	Python
1095	Beomi/KcELECTRA 🤗 Korean Comments ELECTRA: 한국어 댓글로 학습한 ELECTRA 모델	41	Emerging	korean-language-models	261	—
1096	zai-org/GLM-Edge GLM Series Edge Models	41	Emerging	llm-frameworks-libraries	160	Python
1097	louisbrulenaudet/tsdae Transformer-based Denoising AutoEncoder for Sentence Transformers...	41	Emerging	transformer-frameworks-wrappers	9	Python
1098	datadreamer-dev/DataDreamer DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. 🤖💤	41	Emerging	gpt2-pretraining-fine-tuning	1,100	Python
1099	HarderThenHarder/transformers_tasks ⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification,...	41	Emerging	model-evaluation-diagnostics	2,412	Jupyter Notebook
1100	Kakz/prometheus-llm PrometheusLLM is a unique transformer architecture inspired by dignity and...	41	Emerging	llm-finetuning-frameworks	5	Python

« Prev 1 2 3 … 9 10 11 12 13 … 76 77 78 Next »