All Transformer Models

7,795 models ranked by quality score · Page 6 of 78

Showing 501–600 of 7,795

« Prev Next »

#	Model	Score	Tier	Category	Stars	Language
501	dali92002/DocEnTR DocEnTr: An end-to-end document image enhancement transformer - ICPR 2022	47	Emerging	3d-vision-transformers	186	Jupyter Notebook
502	zackshen/gguf a GGUF file parser	47	Emerging	llm-quantization-methods	17	Rust
503	noahho/CAAFE Semi-automatic feature engineering process using Language Models and your...	47	Emerging	feature-selection-frameworks	182	Python
504	conceptofmind/LaMDA-rlhf-pytorch Open-source pre-training implementation of Google's LaMDA in PyTorch. Adding...	47	Emerging	rlhf-alignment-training	470	Python
505	Tzohar/PassLLM World's most accurate password guessing AI tool. A PyTorch implementation of...	47	Emerging	llm-training-experimentation	85	Python
506	kenhktsui/anyclassifier One Line To Build Zero-Data Classifiers in Minutes	47	Emerging	text-classification	64	Python
507	EleutherAI/gpt-neo An implementation of model parallel GPT-2 and GPT-3-style models using the...	47	Emerging	gpt2-pretraining-fine-tuning	8,286	Python
508	awslabs/mlm-scoring Python library & examples for Masked Language Model Scoring (ACL 2020)	47	Emerging	end-to-end-asr-frameworks	348	Python
509	mim-solutions/bert_for_longer_texts BERT classification model for processing texts longer than 512 tokens. Text...	47	Emerging	text-classification-transformers	146	Python
510	rxn4chemistry/rxn-onmt-models Training of OpenNMT-based RXN models	47	Emerging	molecular-generation-transformers	2	Python
511	x-tabdeveloping/turftopic Robust and fast topic models with sentence-transformers.	47	Emerging	text-clustering-topic-modeling	94	Python
512	Gleghorn-Lab/Protify Low code molecular property prediction	47	Emerging	protein-transformers-ml	11	Python
513	jobergum/browser-ml-inference Edge Inference in Browser with Transformer NLP model	47	Emerging	browser-based-ml-inference	316	Jupyter Notebook
514	predibase/lorax Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs	47	Emerging	lora-qlora-fine-tuning	3,735	Python
515	lorenzorovida/FHE-BERT-Tiny Source code for the paper "Transformer-based Language Models and Homomorphic...	47	Emerging	gpt-model-fine-tuning	32	Jupyter Notebook
516	dorarad/gansformer Generative Adversarial Transformers	47	Emerging	multimodal-fusion-transformers	1,346	Python
517	dusty-nv/NanoLLM Optimized local inference for LLMs with HuggingFace-like APIs for...	47	Emerging	nlp-fundamentals-tutorials	359	Python
518	kyegomez/SimplifiedTransformers SimplifiedTransformer simplifies transformer block without affecting...	47	Emerging	transformer-architecture-education	15	Python
519	jackaduma/Recurrent-LLM The open-source LLM implementation of paper: RecurrentGPT: Interactive...	47	Emerging	multilingual-llm-adaptation	203	Python
520	chuanyangjin/MMToM-QA [🏆Outstanding Paper Award at ACL 2024] MMToM-QA: Multimodal Theory of Mind...	47	Emerging	vision-language-models	154	Python
521	geeks-of-data/knowledge-gpt Extract knowledge from all information sources using gpt and other language...	47	Emerging	llm-implementation-from-scratch	291	Python
522	monologg/KoBERT-Transformers KoBERT on 🤗 Huggingface Transformers 🤗 (with Bug Fixed)	47	Emerging	korean-language-models	212	Python
523	qcri/LLMeBench Benchmarking Large Language Models	47	Emerging	domain-specific-benchmarks	105	Python
524	vinjn/llm-metahuman An open solution for AI-powered photorealistic digital humans.	47	Emerging	llm-terminal-automation	138	Python
525	The-AI-Summer/self-attention-cv Implementation of various self-attention mechanisms focused on computer...	47	Emerging	transformer-architecture-tutorials	1,215	Python
526	The-Swarm-Corporation/MedGuard MedGuard is a robust, production-grade Python library that ensures HIPAA...	47	Emerging	therapeutic-chatbot-applications	15	Python
527	back2matching/turboquant First open-source TurboQuant KV cache compression for LLM inference. Drop-in...	47	Emerging	llm-quantization-methods	5	Python
528	ycq091044/BIOT BIOT - A framework for pretraining biosignals at scale. Large EEG pre-trained models.	47	Emerging	academic-thesis-repositories	182	Python
529	ssbuild/chatglm_finetuning chatglm 6b finetuning and alpaca finetuning	47	Emerging	llm-finetuning-frameworks	1,537	Python
530	soulteary/docker-llama2-chat Play LLaMA2 (official / 中文版 / INT4 / llama2.cpp) Together! ONLY 3 STEPS! (...	47	Emerging	local-llm-deployment	538	Python
531	xusenlinzy/api-for-open-llm Openai style api for open large language models, using LLMs just as chatgpt!...	47	Emerging	multilingual-llm-adaptation	2,468	Python
532	cedrickchee/awesome-transformer-nlp A curated list of NLP resources focused on Transformer networks, attention...	47	Emerging	transformer-architecture-tutorials	1,131	—
533	svdrecbd/mhc-mlx MLX + Metal implementation of mHC: Manifold-Constrained Hyper-Connections by...	47	Emerging	mathematical-reasoning-transformers	3	Python
534	ARM-software/keyword-transformer Official implementation of the Keyword Transformer: https://arxiv.org/abs/2104.00769	47	Emerging	transformer-architecture-education	138	Jupyter Notebook
535	r2d4/rellm Exact structure out of any language model completion.	47	Emerging	llm-training-experimentation	514	Python
536	mlabonne/llm-datasets Curated list of datasets and tools for post-training.	47	Emerging	llm-domain-datasets	4,319	—
537	Zefan-Cai/KVCache-Factory Unified KV Cache Compression Methods for Auto-Regressive Models	47	Emerging	kv-cache-optimization	1,309	Python
538	bobazooba/xllm 🦖 X—LLM: Cutting Edge & Easy LLM Finetuning	47	Emerging	llm-training-experimentation	408	Python
539	deepseek-ai/Janus Janus-Series: Unified Multimodal Understanding and Generation Models	47	Emerging	multimodal-vision-language	17,708	Python
540	kyegomez/Lets-Verify-Step-by-Step "Improving Mathematical Reasoning with Process Supervision" by OPENAI	47	Emerging	gpt2-pretraining-fine-tuning	114	Python
541	jhkchan/translategemma-cli Local CLI for Google's TranslateGemma translation models with multi-platform...	47	Emerging	machine-translation-systems	21	Python
542	davidpirogov/toon-llm Token-Oriented Object Notation (TOON) is an LLM-optimized data serialization...	47	Emerging	llm-learning-resources	9	Python
543	LM-Kit/lm-kit-net-samples .NET samples for LM-Kit.NET	47	Emerging	local-llm-deployment	38	C#
544	showlab/Show-o [ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer...	47	Emerging	multimodal-vision-language	1,894	Python
545	jeya-maria-jose/TransWeather Pytorch Code for the paper TransWeather - CVPR 2022	47	Emerging	time-series-forecasting-transformers	220	Python
546	cztomsik/ava All-in-one desktop app for running LLMs locally.	47	Emerging	llm-terminal-automation	465	TypeScript
547	AliHaiderAhmad001/GPT-from-Scratch-with-Tensorflow Implementation for "Improving Language Understanding by Generative...	47	Emerging	gpt-multilingual-training	19	Python
548	sinanuozdemir/oreilly-llm-rl-alignment This training offers an intensive exploration into the frontier of...	47	Emerging	rlhf-alignment-training	59	Jupyter Notebook
549	leaderj1001/BottleneckTransformers Bottleneck Transformers for Visual Recognition	47	Emerging	vision-transformer-implementations	279	Python
550	Uminosachi/open-llm-webui This repository contains a web application designed to execute relatively...	47	Emerging	interactive-ai-chat-uis	47	Python
551	prrao87/tweet-stance-prediction Applying NLP transfer learning techniques to predict Tweet stance toward a topic	47	Emerging	disaster-tweet-classification	107	Jupyter Notebook
552	mirpo/fastapi-gen Build LLM-enabled FastAPI applications without build configuration.	47	Emerging	local-llm-deployment	11	Python
553	horseee/LLM-Pruner [NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language...	47	Emerging	llm-pruning-compression	1,109	Python
554	haotian-liu/LLaVA [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V...	47	Emerging	vision-language-instruction-tuning	24,554	Python
555	ictnlp/LLaMA-Omni LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction...	47	Emerging	multimodal-vision-language	3,128	Python
556	vectorch-ai/ScaleLLM A high-performance inference system for large language models, designed for...	47	Emerging	llm-inference-engines	491	C++
557	The-FinAI/PIXIU This repository introduces PIXIU, an open-source resource featuring the...	47	Emerging	multilingual-llm-adaptation	835	Jupyter Notebook
558	Cardinal-Operations/ORLM ORLM: Training Large Language Models for Optimization Modeling	47	Emerging	llm-scaling-architecture	237	Python
559	willyfh/graph-transformer An unofficial implementation of Graph Transformer (Masked Label Prediction:...	47	Emerging	graph-neural-networks	35	Python
560	NVlabs/Eagle Eagle: Frontier Vision-Language Models with Data-Centric Strategies	47	Emerging	vision-language-instruction-tuning	931	Python
561	kyegomez/MHMoE Community Implementation of the paper: "Multi-Head Mixture-of-Experts" In PyTorch	47	Emerging	mathematical-reasoning-transformers	29	Python
562	tigerchen52/query_level_uncertainty query-level uncertainty in LLMs	47	Emerging	llm-reasoning-research	9	Python
563	DaoD/INTERS This is the repository for our paper "INTERS: Unlocking the Power of Large...	47	Emerging	instruction-tuning-datasets	207	Python
564	jiwidi/Behavior-Sequence-Transformer-Pytorch This is a pytorch implementation for the BST model from Alibaba...	47	Emerging	transformer-architecture-tutorials	176	Jupyter Notebook
565	HHousen/TransformerSum Models to perform neural summarization (extractive and abstractive) using...	47	Emerging	text-summarization-tools	439	Python
566	locuslab/wanda A simple and effective LLM pruning approach.	47	Emerging	llm-compression-optimization	854	Python
567	VinAIResearch/PhoBERT PhoBERT: Pre-trained language models for Vietnamese (EMNLP-2020 Findings)	47	Emerging	korean-language-models	775	—
568	codewithdark-git/Building-LLMs-from-scratch This repository guides you through the process of building a GPT-style Large...	47	Emerging	llm-implementation-from-scratch	51	Jupyter Notebook
569	sagorbrur/bangla-bert Bangla-Bert is a pretrained bert model for Bengali language	47	Emerging	bert-model-implementations	83	Jupyter Notebook
570	Event-AHU/Medical_Image_Analysis Foundation models based medical image analysis	47	Emerging	clinical-llm-tools	213	Python
571	kyegomez/SingLoRA This repository provides a minimal, single-file implementation of SingLoRA...	47	Emerging	llm-framework-abstractions	44	Python
572	DmitryNekrasov/ai-code-completion-idea-plugin Implementation of IntelliJ IDEA code completion plugin using a local LLM.	47	Emerging	code-completion-copilots	18	Kotlin
573	hiyouga/ChatGLM-Efficient-Tuning Fine-tuning ChatGLM-6B with PEFT \| 基于 PEFT 的高效 ChatGLM 微调	47	Emerging	rlhf-alignment-training	3,732	Python
574	kayoyin/transformer-slt Sign Language Translation with Transformers (COLING'2020, ECCV'20 SLRTP Workshop)	47	Emerging	sign-language-recognition	160	ASL
575	alpa-projects/alpa Training and serving large-scale neural networks with auto parallelization.	47	Emerging	llm-cuda-optimization	3,188	Python
576	ymcui/Chinese-LLaMA-Alpaca-2 中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs...	47	Emerging	multilingual-llm-adaptation	7,163	Python
577	raymin0223/mixture_of_recursions Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive...	47	Emerging	mixture-of-experts-llms	548	Python
578	fashn-AI/fashn-human-parser Human parsing model for fashion and virtual try-on applications	47	Emerging	3d-vision-transformers	24	Python
579	AviSoori1x/makeMoE From scratch implementation of a sparse mixture of experts language model...	46	Emerging	mixture-of-experts-llms	793	Jupyter Notebook
580	xNul/chat-llama-discord-bot A Discord Bot for chatting with LLaMA, Vicuna, Alpaca, MPT, or any other...	46	Emerging	messaging-platform-chatbots	120	Python
581	chaitjo/learning-tsp Code for the paper 'Learning TSP Requires Rethinking Generalization' (CP 2021)	46	Emerging	mathematical-reasoning-transformers	241	Jupyter Notebook
582	davidiommi/Pytorch--3D-Medical-Images-Segmentation--SALMON Segmentation deep learning ALgorithm based on MONai toolbox: single and...	46	Emerging	medical-image-segmentation-transformers	124	Python
583	intel/intel-extension-for-transformers ⚡ Build your chatbot within minutes on your favorite device; offer SOTA...	46	Emerging	llm-chat-interfaces	2,177	Python
584	JIA-Lab-research/LongLoRA Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)	46	Emerging	llm-fine-tuning	2,694	Python
585	FoundationVision/Liquid (Accepted by IJCV) Liquid: Language Models are Scalable and Unified...	46	Emerging	multimodal-vision-language-models	640	Python
586	mit-han-lab/lite-transformer [ICLR 2020] Lite Transformer with Long-Short Range Attention	46	Emerging	machine-translation-transformers	610	Python
587	FudanDISC/DISC-LawLLM [中文法律大模型] DISC-LawLLM: an intelligent legal system powered by large language...	46	Emerging	legal-document-analysis	874	Python
588	kmeng01/memit Mass-editing thousands of facts into a transformer memory (ICLR 2023)	46	Emerging	llm-knowledge-editing	543	Python
589	voidful/TFkit 🤖📇 handling multiple nlp task in one pipeline	46	Emerging	bert-model-implementations	57	Python
590	quantium-ai/research Research experiments exploring uncommon quant techniques.	46	Emerging	ml-foundations-curricula	34	Jupyter Notebook
591	j-min/VL-T5 PyTorch code for "Unifying Vision-and-Language Tasks via Text Generation" (ICML 2021)	46	Emerging	multimodal-fusion-transformers	374	Python
592	MagedSaeed/generate-sequences A python package made to generate sequences (greedy and beam-search) from...	46	Emerging	creative-text-generation	18	Python
593	KRR-Oxford/HierarchyTransformers Language Models as Hierarchy Encoders	46	Emerging	transformer-architecture-tutorials	40	Python
594	THU-SI/Spatial-MLLM [NeurIPS 2025] Official implementation of Spatial-MLLM: Boosting MLLM...	46	Emerging	multimodal-vision-language	447	Python
595	KristiyanVachev/Leaf-Question-Generation Easy to use and understand multiple-choice question generation algorithm...	46	Emerging	question-answering-systems	139	Jupyter Notebook
596	Paranioar/Awesome_Matching_Pretraining_Transfering The Paper List of Large Multi-Modality Model (Perception, Generation,...	46	Emerging	multimodal-vision-language-models	445	—
597	verifai/multiLLM 🚀 Invoke multiple large language models concurrently and the rank results....	46	Emerging	llm-frameworks-libraries	83	Python
598	bytedance/byteir A model compilation solution for various hardware	46	Emerging	llm-inference-engines	465	MLIR
599	thu-nics/MoA [CoLM'25] The official implementation of the paper	46	Emerging	mixture-of-experts-llms	156	Python
600	palewire/first-llm-classifier Learn how journalists use large-language models to organize and analyze...	46	Emerging	text-classification	10	Jupyter Notebook

« Prev 1 2 3 4 5 6 7 8 … 76 77 78 Next »