All Transformer Models

7,795 models ranked by quality score · Page 10 of 78

Showing 901–1000 of 7,795

« Prev Next »

#	Model	Score	Tier	Category	Stars	Language
901	WangRongsheng/ChatGenTitle 🌟 ChatGenTitle：使用百万arXiv论文信息在LLaMA模型上进行微调的论文题目生成模型	43	Emerging	multilingual-llm-adaptation	840	Python
902	GAIR-NLP/MegaScience MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning	43	Emerging	lora-qlora-fine-tuning	113	Python
903	kastalimohammed1965/CLIP-fine-tune-registers-gated Vision Transformers Needs Registers. And Gated MLPs. And +20M params. Tiny...	43	Emerging	clip-image-embeddings	5	Python
904	hao-ai-lab/JacobiForcing Jacobi Forcing: Fast and Accurate Diffusion-style Decoding	43	Emerging	speculative-decoding-algorithms	143	Python
905	openjlc/riscv64-library Some of the libraries (docs) on the RISCV64 architecture are easy for users...	43	Emerging	local-llm-deployment	69	—
906	cleopatra-itn/fair_multimodal_sentiment Code and Splits for the paper "A Fair and Comprehensive Comparison of...	43	Emerging	review-sentiment-classification	10	Python
907	varunkumar-dev/TransformersDataAugmentation Code associated with the "Data Augmentation using Pre-trained Transformer...	43	Emerging	essay-scoring-grading	135	Python
908	cdpierse/script_buddy_v2 Script Buddy v2 is a film script text generation tool built using film...	43	Emerging	gpt2-pretraining-fine-tuning	47	Jupyter Notebook
909	magpie-align/magpie [ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs...	43	Emerging	llm-domain-datasets	834	Python
910	jasonvanf/llama-trl LLaMA-TRL: Fine-tuning LLaMA with PPO and LoRA	43	Emerging	lora-qlora-fine-tuning	238	Python
911	obss/trapper State-of-the-art NLP through transformer models in a modular design and...	43	Emerging	transformer-frameworks-wrappers	47	Python
912	mutablelogic/go-llm Large Language Model API interface	43	Emerging	llm-orchestration-routing	8	Go
913	AviSoori1x/Tuning-the-Finetuning Tuning the Finetuning: An exploration of achieving success with QLoRA	43	Emerging	lora-qlora-fine-tuning	46	Python
914	Archimedes1618/Madlab Madlab is an advanced AI development studio designed to streamline the...	43	Emerging	local-llm-deployment	11	TypeScript
915	eric-ai-lab/MiniGPT-5 Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language...	43	Emerging	gpt2-pretraining-fine-tuning	863	Python
916	vaswdeferenss/AI-Dialogue-Memory-Based-on-Hidden-State 🤖 Integrate LSTM into Transformer models to enhance dialog memory, offering...	43	Emerging	llm-chatbot-applications	2	Python
917	DAGroup-PKU/MHLA MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head...	43	Emerging	compositional-t2i-generation	133	Python
918	cambridgeltl/visual-med-alpaca Visual Med-Alpaca is an open-source, multi-modal foundation model designed...	43	Emerging	clinical-llm-tools	394	Python
919	datastone-spirit/spirit-lora-trainer Spirit Lora Trainer is a robust toolkit for training Flux1-LoRA models with...	43	Emerging	lora-training-tools	87	Python
920	CodeWithKyrian/transformers-php Transformers PHP is a toolkit for PHP developers to add machine learning...	43	Emerging	php-ai-sdks	743	PHP
921	nerve-sparks/iris_android IRIS is an android app for interfacing with GGUF / llama.cpp models locally.	43	Emerging	local-llm-deployment	267	Kotlin
922	kyegomez/attn_res A clean, single-file PyTorch implementation of Attention Residuals (Kimi...	43	Emerging	transformer-architecture-tutorials	8	Python
923	haoliuhl/ringattention Large Context Attention	43	Emerging	transformer-architecture-tutorials	770	Python
924	VikParuchuri/textbook_quality Generate textbook-quality synthetic LLM pretraining data	43	Emerging	synthetic-data-generation	509	Python
925	zalkikar/mlm-bias Measuring Biases in Masked Language Models for PyTorch Transformers. Support...	43	Emerging	bias-detection-transformers	4	Python
926	mytechnotalent/RE-GPT Inspired by Andrej Karpathy’s "Let’s Build GPT", this project guides you...	43	Emerging	gpt2-pretraining-fine-tuning	27	Jupyter Notebook
927	datawhalechina/base-llm 从 NLP 到 LLM 的算法全栈教程，在线阅读地址：https://datawhalechina.github.io/base-llm/	43	Emerging	llm-training-experimentation	421	Jupyter Notebook
928	modelscope/dash-infer DashInfer is a native LLM inference engine aiming to deliver...	43	Emerging	llm-inference-engines	273	C
929	ethicalabs-ai/kurtis Kurtis is a fine-tuning, inference and evaluation tool built for SLMs (Small...	43	Emerging	lora-qlora-fine-tuning	6	Python
930	RManLuo/graph-constrained-reasoning Official Implementation of ICML 2025 Paper: "Graph-constrained Reasoning:...	43	Emerging	llm-knowledge-graph-generation	238	Python
931	CLAIRE-Labo/EvoTune Efficiently discovering algorithms via LLMs with evolutionary search and...	43	Emerging	llm-agent-training-gyms	130	Python
932	ruimalheiro/training-custom-llama Llama-style transformer in PyTorch with multi-node / multi-GPU training....	43	Emerging	lora-qlora-fine-tuning	21	Python
933	aliemo/transfomers-silicon-research Research and Materials on Hardware implementation of Transformer Model	43	Emerging	machine-translation-transformers	299	Jupyter Notebook
934	michael-borck/study-buddy Desktop AI tutoring app with local inference using Ollama for...	43	Emerging	study-aid-generators	10	TypeScript
935	Tongjilibo/build_MiniLLM_from_scratch 从0到1构建一个MiniLLM (pretrain+sft+dpo实践中)	43	Emerging	llm-implementation-tutorials	537	Python
936	harveybc/predictor Predictor that uses a configurable plugin-based predictive supervised...	43	Emerging	financial-return-prediction	5	Python
937	amirhossein-kz/HiFormer HiFormer: Hierarchical Multi-scale Representations Using Transformers for...	43	Emerging	medical-image-segmentation-transformers	144	Jupyter Notebook
938	DC-research/TEMPO The official code for "TEMPO: Prompt-based Generative Pre-trained...	43	Emerging	time-series-forecasting-transformers	133	Python
939	ShivamRajSharma/Transformer-Architectures-From-Scratch Implementation of transformers based architecture in PyTorch.	43	Emerging	transformer-architecture-education	55	Python
940	Eiztrips/ai-responder инструмент для создания и обучения моделей, имитирующих стиль общения...	43	Emerging	conversational-chatbot-applications	24	Python
941	skylight-org/sparse-attention-hub Advancing the frontier of efficient AI	43	Emerging	sparse-attention-optimization	54	Python
942	soldni/pyterrier_sentence_transformers Create PyTerrier compatible dense indices using any sentence_transformers model	43	Emerging	model-evaluation-diagnostics	6	Python
943	alibaba/GraphTranslator GraphTranslator:Aligning Graph Model to Large Language Model for Open-ended Tasks	43	Emerging	graph-language-models	118	Python
944	Michael-A-Kuykendall/shimmytok Pure Rust tokenizer for GGUF models - llama.cpp compatible	43	Emerging	llm-quantization-methods	14	Rust
945	dipanjanS/adv_nlp_workshop_odsc_europe22 Extensive tutorials for the Advanced NLP Workshop in Open Data Science...	43	Emerging	nlp-learning-coursework	51	Jupyter Notebook
946	datamllab/LongLM [ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning	43	Emerging	diffusion-language-models	666	Python
947	gohjiayi/suicidal-text-detection Building a suicidal text detection model and mental health chatbot with deep...	43	Emerging	emotion-detection-transformers	42	Jupyter Notebook
948	zeozeozeo/ellama Friendly interface to chat with an Ollama instance.	43	Emerging	interactive-ai-chat-uis	92	Rust
949	jianghoucheng/AnyEdit AnyEdit: Edit Any Knowledge Encoded in Language Models, ICML 2025	43	Emerging	llm-knowledge-editing	46	Python
950	huangwl18/language-planner Official Code for "Language Models as Zero-Shot Planners: Extracting...	43	Emerging	llm-implementation-from-scratch	278	Jupyter Notebook
951	IlyaGusev/rulm Language modeling and instruction tuning for Russian	43	Emerging	llm-learning-resources	465	Jupyter Notebook
952	lxuechen/private-transformers A codebase that makes differentially private training of transformers easy.	43	Emerging	transformer-architecture-tutorials	185	Python
953	armbues/SiLLM SiLLM simplifies the process of training and running Large Language Models...	43	Emerging	apple-silicon-llm-inference	284	Python
954	xlang-ai/Binder [ICLR 2023] Code for the paper "Binding Language Models in Symbolic Languages"	43	Emerging	math-reasoning-datasets	325	Python
955	csiro-robotics/HOTFormerLoc [IEEE/CVF CVPR 2025] Hierarchical Octree Transformer for Versatile Lidar...	43	Emerging	3d-vision-transformers	26	Python
956	chanind/linear-relational Linear Relational Embeddings (LREs) and Linear Relational Concepts (LRCs)...	43	Emerging	llm-training-experimentation	10	Python
957	njchoma/transformer_image_caption Image Captioning based on Bottom-Up and Top-Down Attention model	42	Emerging	image-caption-generation	104	Jupyter Notebook
958	Nuked88/ComfyUI-N-Nodes A suite of custom nodes for ConfyUI that includes GPT text-prompt...	42	Emerging	interactive-ai-chat-uis	237	Python
959	SomeBottle/Konnyaku A simple and robust LLM workflow for anime subtitle file translation. \| 基于...	42	Emerging	ai-subtitle-translation	4	Python
960	canyuchen/ClinicalBench Code for the paper "ClinicalBench: Can LLMs Beat Traditional ML Models in...	42	Emerging	clinical-llm-tools	31	Python
961	Yachay-AI/byt5-geotagging Confidence and Byt5 - based geotagging model predicting coordinates from text alone.	42	Emerging	multimodal-fusion-transformers	160	Python
962	deepreinforce-ai/CUDA-L2 CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through...	42	Emerging	llm-cuda-optimization	472	Cuda
963	mala-lab/SEMPO [NeurIPS 2025] Official implementation of "SEMPO: Lightweight Foundation...	42	Emerging	time-series-forecasting-transformers	18	Python
964	phronmophobic/llama.clj Run LLMs locally. A clojure wrapper for llama.cpp.	42	Emerging	local-llm-deployment	173	Clojure
965	ssbuild/deep_training deep learning	42	Emerging	lora-qlora-fine-tuning	151	Python
966	zetavg/LLaMA-LoRA-Tuner UI tool for fine-tuning and testing your own LoRA models base on LLaMA,...	42	Emerging	lora-qlora-fine-tuning	476	Python
967	AntixK/PyTorch-Model-Compare Compare neural networks by their feature similarity	42	Emerging	academic-thesis-repositories	379	Python
968	hellotransformers/Natural_Language_Processing_with_Transformers Natural Language Processing with Transformers 中译本，最权威Transformers教程	42	Emerging	transformer-frameworks-wrappers	568	—
969	illiterate/BertClassifier 基于PyTorch的BERT中文文本分类模型（BERT Chinese text classification model implemented by PyTorch）	42	Emerging	text-classification-transformers	203	Python
970	KolosalAI/kolosal-server Kolosal AI is an OpenSource and Lightweight alternative to Ollama to run...	42	Emerging	local-llm-deployment	13	C++
971	NetEase-Media/grps_trtllm Higher performance OpenAI LLM service than vLLM serve: A pure C++...	42	Emerging	llm-framework-abstractions	158	Python
972	princeton-nlp/LLM-Shearing [ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via...	42	Emerging	llm-pruning-compression	642	Python
973	bahree/helloLondon Historical Language Model for London - A specialized LLM trained on...	42	Emerging	transformer-implementation-education	29	Python
974	ruanchaves/napolab The Natural Portuguese Language Benchmark (Napolab). Stay up to date with...	42	Emerging	nlp-learning-coursework	72	Python
975	the-crypt-keeper/can-ai-code Self-evaluating interview for AI coders	42	Emerging	ai-powered-business-analytics	602	Python
976	txsun1997/Black-Box-Tuning ICML'2022: Black-Box Tuning for Language-Model-as-a-Service & EMNLP'2022:...	42	Emerging	black-box-optimization	271	Python
977	EleutherAI/DALLE-mtf Open-AI's DALL-E for large scale training in mesh-tensorflow.	42	Emerging	text-to-image-generation	431	Python
978	uclaml/SPPO The official implementation of Self-Play Preference Optimization (SPPO)	42	Emerging	direct-preference-optimization	583	Python
979	nlp-uoregon/mlmm-evaluation Multilingual Large Language Models Evaluation Benchmark	42	Emerging	evaluation-frameworks-metrics	132	Python
980	ArdaGnsrn/ollama-php This is a PHP library for Ollama. Ollama is an open-source project that...	42	Emerging	interactive-ai-chat-uis	207	PHP
981	luchangli03/export_llama_to_onnx export llama to onnx	42	Emerging	llama-model-implementations	135	Python
982	AviSoori1x/seemore From scratch implementation of a vision language model in pure PyTorch	42	Emerging	llm-implementation-tutorials	255	Jupyter Notebook
983	hitz-zentroa/whisper-lm-transformers Add n-gram and LLM language model support to HF Transformers Whisper models.	42	Emerging	llm-implementation-tutorials	14	Python
984	adarshM84/TextLLaMACode Transform your writing with TextLLaMA! ✍️🚀 Simplify grammar, translate...	42	Emerging	interactive-ai-chat-uis	3	JavaScript
985	CVxTz/music_genre_classification music genre classification : LSTM vs Transformer	42	Emerging	audio-classification-transformers	63	Python
986	scientific-discovery/LLEMA [ICLR 2026] LLEMA: Evolutionary Search with LLMs for Multi-Objective...	42	Emerging	llm-agent-training-gyms	12	Python
987	RobertCsordas/ndr The official repository for our paper "The Neural Data Router: Adaptive...	42	Emerging	power-transformer-design	34	Python
988	jingedawang/TutorialLLM LLM Tutorial for Everyone.	42	Emerging	llm-learning-resources	80	Jupyter Notebook
989	argosopentech/MetalTranslate Customizable machine translation in C++	42	Emerging	neural-machine-translation	56	C++
990	ariya/chat-llm Chat with an LLM	42	Emerging	interactive-ai-chat-uis	18	JavaScript
991	jd-coderepos/llms4subjects The official SemEval 2025 Task 5 - LLMs4Subjects - Shared Task Dataset repository	42	Emerging	llm-domain-datasets	7	—
992	Dartvauder/NeuroSandboxWebUI (Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image,...	42	Emerging	ocr-document-extraction	108	Python
993	withcaer/curtana Simplified zero-cost wrapper over llama.cpp powered by the lama-cpp-2 Crate.	42	Emerging	local-llm-deployment	2	Rust
994	Alpha-VLLM/Lumina-T2X Lumina-T2X is a unified framework for Text to Any Modality Generation	42	Emerging	text-to-image-generation	2,254	Python
995	HamedBabaei/LLMs4OM LLMs4OM: Matching Ontologies with Large Language Models	42	Emerging	llm-training-experimentation	42	Python
996	AbdelStark/attnres Rust implementation of Attention Residuals from MoonshotAI/Kimi	42	Emerging	attention-mechanism-implementations	47	Rust
997	nv-tlabs/LLaMA-Mesh Unifying 3D Mesh Generation with Language Models	42	Emerging	multimodal-vision-language	1,145	Python
998	USC-FORTIS/AD-LLM [ACL Findings 2025] A benchmark for anomaly detection using large language...	42	Emerging	llm-research-curation	41	Python
999	tosiyuki/LLaVA-JP LLaVA-JP is a Japanese VLM trained by LLaVA method	42	Emerging	multimodal-vision-language	64	Python
1000	FreeOCR-AI/layoutreader A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.	42	Emerging	ocr-document-extraction	314	Python

« Prev 1 2 3 … 8 9 10 11 12 … 76 77 78 Next »