All Transformer Models

7,795 models ranked by quality score · Page 15 of 78

Showing 1401–1500 of 7,795

« Prev Next »

#	Model	Score	Tier	Category	Stars	Language
1401	loong64/ollama Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other...	39	Emerging	local-llm-deployment	9	Dockerfile
1402	cloudmercato/ollama-benchmark Handy tool to measure the performance and efficiency of LLMs workloads.	39	Emerging	domain-specific-benchmarks	76	Python
1403	laelhalawani/gguf_llama Wrapper for simplified use of Llama2 GGUF quantized models.	39	Emerging	llm-quantization-methods	7	Python
1404	WangJingyao07/Awesome-GRPO Codebase of GRPO: Implementations and Resources of GRPO and Its Variants	39	Emerging	rlhf-alignment-training	276	Python
1405	ArchAIve-Project/Backend A complex Flask API system empowered by custom ML models, LLMs and...	39	Emerging	ml-api-deployment	2	Python
1406	UKPLab/5pils Code associated with the EMNLP 2024 Main paper: "Image, tell me your story!"...	39	Emerging	llm-interpretability-explainability	45	Python
1407	takara-ai/SwarmFormer A pytorch implementation of SwarmFormer for text classification.	39	Emerging	attention-mechanism-implementations	16	Python
1408	deep-div/Custom-Transformer-Pytorch A clean, ground-up implementation of the Transformer architecture in...	39	Emerging	transformer-architecture-tutorials	16	Jupyter Notebook
1409	MrYxJ/calculate-flops.pytorch The calflops is designed to calculate FLOPs、MACs and Parameters in all...	39	Emerging	llm-inference-engines	927	Python
1410	KishanBagaria/OCLB 🦙 One Click Llama Button for DeviantArt.com	39	Emerging	interactive-ai-chat-uis	17	JavaScript
1411	praj2408/Text-Summarizer-Project The text summarizer project is an innovative tool designed to condense...	39	Emerging	text-summarization-transformers	16	Jupyter Notebook
1412	StyrbjornKall/TRIDENT A collection of transformer-based models and developmental scripts presented...	39	Emerging	hate-speech-detection	16	Jupyter Notebook
1413	mshenoda/roberta-spam RoBERTa based Spam Message Detection	39	Emerging	spam-detection-transformers	18	Jupyter Notebook
1414	amoffat/HeimdaLLM Constrain LLM output	39	Emerging	llm-frameworks-libraries	113	Python
1415	franjgs/llm-rl-finance-trader Hybrid project integrating Large Language Models (LLM) for financial news...	39	Emerging	ai-stock-analysis	1	Python
1416	czg1225/dParallel [ICLR 2026] dParallel: Learnable Parallel Decoding for dLLMs	39	Emerging	diffusion-language-models	62	Python
1417	yifanzhang-pro/AutoMathText [ACL 2025 Findings] Autonomous Data Selection with Zero-shot Generative...	39	Emerging	math-reasoning-datasets	90	Python
1418	FareedKhan-dev/create-million-parameter-llm-from-scratch Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.	39	Emerging	llm-implementation-from-scratch	201	Jupyter Notebook
1419	complex-reasoning/RPG [ICLR 2026] RPG: KL-Regularized Policy Gradient (https://arxiv.org/abs/2505.17508)	39	Emerging	rlhf-alignment-training	65	Python
1420	THUDM/LongAlign [EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs	39	Emerging	llm-knowledge-editing	259	Python
1421	virevolai/logos-shift-client Replace expensive LLM calls with finetunes automatically	39	Emerging	llm-finetuning-frameworks	66	Python
1422	xiuqhou/Relation-DETR [ECCV2024 Oral] Official implementation of the paper "Relation DETR:...	39	Emerging	object-detection-transformers	254	Python
1423	horseee/Awesome-Efficient-LLM A curated list for Efficient Large Language Models	39	Emerging	llm-compression-optimization	1,967	Python
1424	AyushExel/trolo An SDK for Transformers + YOLO and other SSD family models	39	Emerging	3d-vision-transformers	64	Jupyter Notebook
1425	ramonclaudio/perplexity-ai-toolkit A lightweight Python API wrapper and CLI for Perplexity’s Sonar language models.	39	Emerging	prompt-engineering-security	65	Python
1426	padeler/PE-former 2D Human Pose estimation using transformers. Implementation in Pytorch	39	Emerging	3d-vision-transformers	34	Python
1427	TatevKaren/BabyGPT-Build_GPT_From_Scratch BabyGPT: Build Your Own GPT Large Language Model from Scratch Pre-Training...	39	Emerging	gpt2-pretraining-fine-tuning	116	Python
1428	nicola-decao/KnowledgeEditor Code for Editing Factual Knowledge in Language Models	39	Emerging	rlhf-alignment-training	142	Python
1429	FareedKhan-dev/qwen3-MoE-from-scratch A Step-by-Step Implementation of Qwen 3 MoE Architecture from Scratch	39	Emerging	mixture-of-experts-llms	76	Jupyter Notebook
1430	HKUDS/GraphEdit "GraphEdit: Large Language Models for Graph Structure Learning"	39	Emerging	graph-language-models	143	Python
1431	zhenyi4/codi Official repository for "CODI: Compressing Chain-of-Thought into Continuous...	39	Emerging	chain-of-thought-reasoning	73	Python
1432	akshitac8/OW-DETR [CVPR 2022] Official Pytorch code for OW-DETR: Open-world Detection Transformer	39	Emerging	object-detection-transformers	257	Python
1433	Sea-Snell/CALM-Dialogue Official code for the paper "Context-Aware Language Modeling for...	39	Emerging	chatgpt-api-tutorials	34	Python
1434	jackaduma/Vicuna-LoRA-RLHF-PyTorch A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer...	39	Emerging	rlhf-alignment-training	221	Python
1435	NSLab-CUK/Unified-Graph-Transformer Unified Graph Transformer (UGT) is a novel Graph Transformer model...	39	Emerging	graph-transformers	28	Python
1436	Mj23978/sam-assistant 🤖 Sam-assistant is a personal assistant that is designed to understand your...	39	Emerging	conversational-chatbot-applications	53	Python
1437	PCfVW/hf-fetch-model Fast HuggingFace model downloads for Rust — an embeddable library for...	39	Emerging	browser-based-ml-inference	1	Rust
1438	Md-Emon-Hasan/InformaTruth Fine-tuned roberta-base classifier on the LIAR dataset. Aaccepts multiple...	39	Emerging	fake-news-detection	1	Jupyter Notebook
1439	INWLY/LWTformer LWTformer: A Detail-Aware, Learnable Wavelet-Transformer for Ancient Chinese...	39	Emerging	low-light-image-restoration	5	Python
1440	Lamorati92/LLMs-from-scratch 📚 Build and train your own GPT-like Large Language Model from scratch with...	39	Emerging	llm-fine-tuning-frameworks	2	Jupyter Notebook
1441	leaderj1001/CLIP CLIP: Connecting Text and Image (Learning Transferable Visual Models From...	39	Emerging	clip-vision-language	83	Python
1442	declare-lab/red-instruct Codes and datasets of the paper Red-Teaming Large Language Models using...	39	Emerging	llm-training-experimentation	108	Python
1443	slSeanWU/Compose_and_Embellish Official PyTorch implementation of ICASSP 2023 paper "Compose & Embellish:...	39	Emerging	music-generation-transformers	33	Python
1444	knotgrass/attention several types of attention modules written in PyTorch for learning purposes	39	Emerging	transformer-architecture-tutorials	53	Python
1445	GAIR-NLP/OctoThinker Revisiting Mid-training in the Era of Reinforcement Learning Scaling	39	Emerging	lora-qlora-fine-tuning	185	Jupyter Notebook
1446	Lupin1998/Awesome-MIM [Survey] Masked Modeling for Self-supervised Representation Learning on...	39	Emerging	multimodal-vision-language-models	353	Python
1447	18907305772/FuseAI FuseAI Project	39	Emerging	llm-thesis-research	93	Python
1448	ariG23498/gemma3-object-detection Fine tune Gemma 3 on an object detection task	39	Emerging	lora-qlora-fine-tuning	100	Python
1449	StevenRice99/LLM-IK LLM-IK: Solving Inverse Kinematics using Large Language Models	39	Emerging	llm-fine-tuning-frameworks	7	Python
1450	nihalsangeeth/behaviour-seq-transformer Pytorch implementation of "Behaviour Sequence Transformer for E-commerce...	39	Emerging	transformer-architecture-tutorials	23	Python
1451	rishub-tamirisa/tamper-resistance [ICLR 2025] Official Repository for "Tamper-Resistant Safeguards for...	39	Emerging	safety-robustness-evaluation	67	Python
1452	microsoft/COCO-LM [NeurIPS 2021] COCO-LM: Correcting and Contrasting Text Sequences for...	39	Emerging	mathematical-reasoning-transformers	118	Python
1453	xiuqhou/Salience-DETR [CVPR 2024] Official implementation of the paper "Salience DETR: Enhancing...	39	Emerging	object-detection-transformers	225	Jupyter Notebook
1454	waroad/losver Source Code for LOSVER: Line-Level Modifiability Signal-Guided Vulnerability...	39	Emerging	vulnerability-detection-llm	2	Python
1455	asprenger/ray_vllm_inference A simple service that integrates vLLM with Ray Serve for fast and scalable...	39	Emerging	llm-inference-serving	78	Python
1456	kssteven418/BigLittleDecoder [NeurIPS'23] Speculative Decoding with Big Little Decoder	39	Emerging	speculative-decoding-algorithms	96	Python
1457	hitz-zentroa/GoLLIE Guideline following Large Language Model for Information Extraction	39	Emerging	llm-training-experimentation	431	Python
1458	SPUTNIKAI/LeechTransformer Leech-Lila: A Geometric Attention Transformer(Language Model) with the Leech...	39	Emerging	llm-implementation-tutorials	4	Jupyter Notebook
1459	armbues/SiLLM-examples Examples for using the SiLLM framework for training and running Large...	39	Emerging	apple-silicon-llm-inference	16	Python
1460	g8a9/ferret A python package for benchmarking interpretability techniques on Transformers.	39	Emerging	model-evaluation-diagnostics	215	Python
1461	truefoundry/models Community-maintained registry of AI/LLM model configurations - pricing,...	39	Emerging	llm-pricing-comparison	4	—
1462	hpcaitech/SwiftInfer Efficient AI Inference & Serving	39	Emerging	llm-inference-engines	480	Python
1463	Nithin-Holla/meme_challenge Repository containing code from team Kingsterdam for the Hateful Memes Challenge	39	Emerging	hate-speech-detection	23	Python
1464	monarch-initiative/pheval.llm Analysis of LLMs for Clinical Observations	39	Emerging	clinical-llm-tools	7	Python
1465	wpeebles/G.pt Official PyTorch Implementation of "Learning to Learn with Generative Models...	39	Emerging	gpt2-pretraining-fine-tuning	345	Python
1466	nlpaueb/greek-bert A Greek edition of BERT pre-trained language model	38	Emerging	bert-model-implementations	148	Python
1467	NohTow/PPL-MCTS Repository for the code of the "PPL-MCTS: Constrained Textual Generation...	38	Emerging	creative-text-generation	66	Python
1468	VectorInstitute/atomgen Library for handling atomistic graph datasets focusing on transformer-based...	38	Emerging	molecular-generation-transformers	8	Python
1469	chef-transformer/chef-transformer Chef Transformer 🍲 .	38	Emerging	transformer-architecture-tutorials	85	Python
1470	HxCodeWarrior/StellarByte 从零实现基础的Transformer的Decoerder-Only模型，并进行模型升级，构建专属于自己的LLM模型	38	Emerging	llm-implementation-from-scratch	6	Python
1471	zarzouram/image_captioning_with_transformers Pytorch implementation of image captioning using transformer-based model.	38	Emerging	image-captioning-transformers	68	Jupyter Notebook
1472	madibabaiasl/MobileRobotGPT4LLaMA2024 Deployment of Large Language Models to Control Mobile Robots at the Edge	38	Emerging	vision-language-instruction-tuning	11	Python
1473	BaiTheBest/SparseLLM Official Repo for SparseLLM: Global Pruning of LLMs (NeurIPS 2024)	38	Emerging	llm-compression-optimization	67	Python
1474	IAmPara0x/Yuno Yuno is context based search engine for anime.	38	Emerging	semantic-search-retrieval	380	Python
1475	Koratahiu/Advanced_Optimizers A family of highly efficient, lightweight yet powerful optimizers.	38	Emerging	llm-compression-optimization	21	Python
1476	TrelisResearch/install-guides Various installation guides for Large Language Models	38	Emerging	llm-fine-tuning	77	Jupyter Notebook
1477	baaivision/EVE EVE Series: Encoder-Free Vision-Language Models from BAAI	38	Emerging	multimodal-vision-language	368	Python
1478	BodhiSearch/BodhiApp Run Open Source/Open Weight LLMs locally with OpenAI compatible APIs	38	Emerging	local-llm-deployment	132	Rust
1479	argonne-lcf/LLM-Inference-Bench LLM-Inference-Bench	38	Emerging	llm-inference-engines	60	Jupyter Notebook
1480	BhabhaAI/dataformer Solving data for LLMs - Create quality synthetic datasets!	38	Emerging	synthetic-data-generation	151	Python
1481	lukashermann/hulc Hierarchical Universal Language Conditioned Policies	38	Emerging	trajectory-prediction-ml	77	Python
1482	Traffic-Alpha/iLLM-TSC This repository contains the code for the paper“iLLM-TSC: Integration...	38	Emerging	competitive-agent-games	70	Python
1483	ByteDance-Seed/FlexPrefill Code for paper: [ICLR2025 Oral] FlexPrefill: A Context-Aware Sparse...	38	Emerging	mixture-of-experts-llms	164	Python
1484	robert-mcdermott/LLM-Image-Classification Image Classification Testing with LLMs	38	Emerging	text-classification	72	Python
1485	intersun/LightningDOT source code and pre-trained/fine-tuned checkpoint for NAACL 2021 paper LightningDOT	38	Emerging	parameter-efficient-adapters	72	Python
1486	deep-div/Fine-Tuning-LLMs-and-VisionModels Fine-Tuning LLMs (Gemma, LLaMA, Mistral, etc.) A practical guide to...	38	Emerging	lora-qlora-fine-tuning	17	Jupyter Notebook
1487	toyaix/TritonLLM LLM Inference via Triton (Flexible & Modular): Focused on Kernel...	38	Emerging	llm-inference-engines	76	Python
1488	Nkluge-correa/Tucano Natively pre-trained open-source Portuguese language models.	38	Emerging	multilingual-llm-adaptation	79	Jupyter Notebook
1489	AlexIoannides/transformers-gen-ai Developing generative language models using transformers.	38	Emerging	gpt-model-fine-tuning	11	Jupyter Notebook
1490	openpsi-project/ReaLHF Super-Efficient RLHF Training of LLMs with Parameter Reallocation	38	Emerging	rlhf-alignment-training	333	Python
1491	sshh12/multi_token Embed arbitrary modalities (images, audio, documents, etc) into large...	38	Emerging	multimodal-vision-language	191	Python
1492	jdaln/dgx-spark-inference-stack Serve the home! Inference stack for your Nvidia DGX Spark aka the Grace...	38	Emerging	llm-inference-engines	26	JavaScript
1493	nercone-dev/zeta-llm-tool Fully Open-source LLM Tool	38	Emerging	llm-scaling-architecture	5	Python
1494	icon-lab/BolT Fused Window Transformers for fMRI Time Series Analysis...	38	Emerging	transformer-interpretability-mechanistic	34	Python
1495	styfeng/TinyDialogues Code & data for the EMNLP 2024 paper: Is Child-Directed Speech Effective...	38	Emerging	creative-text-generation	12	Python
1496	KishanBagaria/dAbot 🤖 CLI tool to automate stuff on DeviantArt.com	38	Emerging	llm-terminal-automation	21	Python
1497	Whiax/BERT-Transformer-Pytorch Basic implementation of BERT and Transformer in Pytorch in one short python...	38	Emerging	transformer-architecture-education	45	Python
1498	xingyizhou/GTR Global Tracking Transformers, CVPR 2022	38	Emerging	3d-vision-transformers	379	Python
1499	NTU-SQUAD/transformers-coqa Albert for Conversational Question Answering Challenge	38	Emerging	question-answering-systems	22	Python
1500	singhsidhukuldeep/Text-Summarizer Comparing state of the art models for text summary generation	38	Emerging	text-summarization-transformers	19	Jupyter Notebook

« Prev 1 2 3 … 13 14 15 16 17 … 76 77 78 Next »