All Transformer Models

7,795 models ranked by quality score · Page 5 of 78

Showing 401–500 of 7,795

« Prev Next »

#	Model	Score	Tier	Category	Stars	Language
401	Deep-Spark/DeepSparkInference DeepSparkInference has selected 216 inference models of both small and large...	49	Emerging	llm-inference-engines	28	Python
402	sapientinc/HRM Hierarchical Reasoning Model Official Release	49	Emerging	llm-reasoning-research	12,358	Python
403	MediaBrain-SJTU/MING 明医 (MING)：中文医疗问诊大模型	49	Emerging	multilingual-llm-adaptation	1,109	Python
404	higgsfield-ai/higgsfield Fault-tolerant, highly scalable GPU orchestration, and a machine learning...	49	Emerging	llm-inference-engines	3,558	Jupyter Notebook
405	muxi-ai/onellm Unified interface for interacting with various LLMs hundreds of models,...	49	Emerging	llm-orchestration-platforms	44	Python
406	Leeroo-AI/mergoo A library for easily merging multiple LLM experts, and efficiently train the...	49	Emerging	llm-training-experimentation	507	Python
407	rese1f/MovieChat [CVPR 2024] MovieChat: From Dense Token to Sparse Memory for Long Video Understanding	49	Emerging	vision-language-instruction-tuning	688	Python
408	kyegomez/MultiModalMamba A novel implementation of fusing ViT with Mamba into a fast, agile, and high...	49	Emerging	3d-vision-transformers	465	Python
409	EvelynFan/FaceFormer [CVPR 2022] FaceFormer: Speech-Driven 3D Facial Animation with Transformers	49	Emerging	character-motion-animation	907	Python
410	wxhcore/bumblecore An LLM training framework built from the ground up, featuring a custom...	49	Emerging	llm-frameworks-libraries	63	Python
411	shell-nlp/gpt_server gpt_server是一个用于生产级部署LLMs、Embedding、Reranker、ASR、TTS、文生图、图片编辑和文生视频的开源框架。	49	Emerging	multi-provider-llm-interfaces	246	Python
412	riyanshibohra/TuneKit Upload your data → Get a fine-tuned SLM. Free.	49	Emerging	llm-fine-tuning	138	Python
413	VectorInstitute/vector-inference Efficient LLM inference on Slurm clusters.	49	Emerging	llm-inference-serving	95	Python
414	tjake/Jlama Jlama is a modern LLM inference engine for Java	49	Emerging	local-llm-deployment	1,259	Java
415	wuwangzhang1216/abliterix Fully automatic censorship removal for language models. LoRA abliteration +...	49	Emerging	—	47	Python
416	floriankark/cs224n-win2223 Code and written solutions of the assignments of the Stanford CS224N:...	49	Emerging	nlp-learning-coursework	272	Python
417	time-series-foundation-models/lag-llama Lag-Llama: Towards Foundation Models for Probabilistic Time Series Forecasting	49	Emerging	multilingual-llm-adaptation	1,556	Python
418	vtuber-plan/langport Langport is a language model inference service	49	Emerging	prompt-engineering-security	94	Python
419	tomaarsen/attention_sinks Extend existing LLMs way beyond the original training length with constant...	49	Emerging	transformer-architecture-tutorials	736	Python
420	ngxson/wllama WebAssembly binding for llama.cpp - Enabling on-browser LLM inference	49	Emerging	local-llm-deployment	1,013	TypeScript
421	dell-research-harvard/linktransformer A convenient way to link, deduplicate, aggregate and cluster data(frames) in...	49	Emerging	transformer-architecture-tutorials	135	Python
422	maxischuh/TwinBooster Package for TwinBooster. Enables fast and powerful zero-shot molecular...	49	Emerging	chemistry-llm-benchmarks	6	Python
423	jy-yuan/KIVI [ICML 2024] KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV Cache	49	Emerging	llm-quantization-methods	359	Python
424	balisujohn/localwriter A LibreOffice Writer extension that adds local-inference generative AI features.	49	Emerging	ai-powered-business-analytics	163	Python
425	EnnengYang/Awesome-Model-Merging-Methods-Theories-Applications Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications...	49	Emerging	diffusion-language-models	689	—
426	Shivanandroy/KeyPhraseTransformer KeyPhraseTransformer lets you quickly extract key phrases, topics, themes...	49	Emerging	t5-mt5-fine-tuning	106	Python
427	huggingface/tflite-android-transformers DistilBERT / GPT-2 for on-device inference thanks to TensorFlow Lite with...	49	Emerging	transformer-frameworks-wrappers	420	Java
428	IntelLabs/nlp-architect A model library for exploring state-of-the-art deep learning topologies and...	49	Emerging	model-evaluation-diagnostics	2,935	Python
429	hscspring/hcgf Humanable Chat Generative-model Fine-tuning \| LLM微调	49	Emerging	rlhf-alignment-training	207	Python
430	yoshoku/llama_cpp.rb llama_cpp.rb provides Ruby bindings for llama.cpp	49	Emerging	local-llm-deployment	232	C
431	alephpi/Texo A minimalist SOTA LaTeX OCR model with only 20M parameters, running in...	49	Emerging	ocr-document-extraction	747	Python
432	OpenGVLab/OmniQuant [ICLR2024 spotlight] OmniQuant is a simple and powerful quantization...	49	Emerging	llm-quantization-techniques	890	Python
433	HUSTAI/uie_pytorch PaddleNLP UIE模型的PyTorch版实现	49	Emerging	transformer-architecture-tutorials	683	Python
434	MadryLab/context-cite Attribute (or cite) statements generated by LLMs back to in-context information.	49	Emerging	llm-interpretability-explainability	325	Jupyter Notebook
435	AMontgomerie/question_generator An NLP system for generating reading comprehension questions	49	Emerging	question-answering-systems	298	Python
436	intel/ipex-llm Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM,...	49	Emerging	llm-inference-engines	8,724	Python
437	oripress/AlgoTune AlgoTune is a NeurIPS 2025 benchmark made up of 154 math, physics, and...	49	Emerging	code-model-training	95	Python
438	multimodal-art-projection/YuE YuE: Open Full-song Music Generation Foundation Model, something similar to...	49	Emerging	music-generation-transformers	6,083	Python
439	larslorch/avici Amortized Inference for Causal Structure Learning, NeurIPS 2022	49	Emerging	mathematical-reasoning-transformers	72	Python
440	helpmefindaname/transformer-smaller-training-vocab Temporary remove unused tokens during training to save ram and speed.	48	Emerging	transformer-architecture-tutorials	23	Python
441	graphdeeplearning/graphtransformer Graph Transformer Architecture. Source code for "A Generalization of...	48	Emerging	graph-transformers	1,019	Python
442	WangRongsheng/XrayGLM 🩺 首个会看胸部X光片的中文多模态医学大模型 \| The first Chinese Medical Multimodal Model that...	48	Emerging	clinical-llm-tools	1,066	Python
443	curiousily/Deploy-BERT-for-Sentiment-Analysis-with-FastAPI Deploy BERT for Sentiment Analysis as REST API using FastAPI, Transformers...	48	Emerging	review-sentiment-classification	209	Python
444	jmont-dev/ollama-hpp Modern, Header-only C++ bindings for the Ollama API.	48	Emerging	local-llm-deployment	213	C++
445	fcakyon/video-transformers Easiest way of fine-tuning HuggingFace video classification models	48	Emerging	vision-transformer-implementations	148	Python
446	OFA-Sys/Chinese-CLIP Chinese version of CLIP which achieves Chinese cross-modal retrieval and...	48	Emerging	clip-image-embeddings	5,820	Jupyter Notebook
447	Beomi/KoAlpaca KoAlpaca: 한국어 명령어를 이해하는 오픈소스 언어모델 (KoAlpaca: An open-source language model...	48	Emerging	multilingual-llm-adaptation	1,578	Jupyter Notebook
448	ChanithaAbey/AI-Agent-for-Stock-Prediction An AI Agent for stock data analysis, news rerieval, and prediction; powered...	48	Emerging	ai-powered-business-analytics	20	Python
449	xrsrke/toolformer Implementation of Toolformer: Language Models Can Teach Themselves to Use Tools	48	Emerging	llm-robot-planning	144	Jupyter Notebook
450	hila-chefer/Transformer-Explainability [CVPR 2021] Official PyTorch implementation for Transformer Interpretability...	48	Emerging	explainability-interpretability-frameworks	1,981	Jupyter Notebook
451	X-D-Lab/LangChain-ChatGLM-Webui 基于LangChain和ChatGLM-6B等系列LLM的针对本地知识库的自动问答	48	Emerging	multilingual-llm-adaptation	3,307	Python
452	steering-vectors/steering-vectors Steering vectors for transformer language models in Pytorch / Huggingface	48	Emerging	llm-knowledge-editing	140	Python
453	kyegomez/GPT4o Community Open Source Implementation of GPT4o in PyTorch	48	Emerging	gpt2-pretraining-fine-tuning	26	Shell
454	VHellendoorn/Code-LMs Guide to using pre-trained large language models of source code	48	Emerging	llm-finetuning-frameworks	1,842	Python
455	blegat/LINMA2472 Course material for the course LINMA2472 at UCLouvain	48	Emerging	llm-learning-resources	2	Julia
456	ymcui/Chinese-LLaMA-Alpaca 中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)	48	Emerging	multilingual-llm-adaptation	18,970	Python
457	fboulnois/llama-cpp-docker Run llama.cpp in a GPU accelerated Docker container	48	Emerging	local-llm-deployment	63	Dockerfile
458	cheahjs/free-llm-api-resources A list of free LLM inference resources accessible via API.	48	Emerging	local-llm-deployment	15,475	Python
459	TUDB-Labs/mLoRA An Efficient "Factory" to Build Multiple LoRA Adapters	48	Emerging	lora-qlora-fine-tuning	373	Python
460	NVIDIA-AI-IOT/nanoowl A project that optimizes OWL-ViT for real-time inference with NVIDIA TensorRT.	48	Emerging	transformer-training-optimization	409	Python
461	Tencent/TencentPretrain Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo	48	Emerging	nlp-learning-resources	1,090	Python
462	kyegomez/LIMoE Implementation of the "the first large-scale multimodal mixture of experts...	48	Emerging	mixup-augmentation-frameworks	36	Python
463	snowby666/poe-api-wrapper 👾 A Python API wrapper for Poe.com. With this, you will have free access to...	48	Emerging	multi-provider-llm-interfaces	1,113	Python
464	yuriwa/crewai-sheets-ui Use google sheets as a gui for crewAI	48	Emerging	interactive-ai-chat-uis	76	Python
465	deepset-ai/FARM :house_with_garden: Fast & easy transfer learning for NLP. Harvesting...	48	Emerging	bert-model-frameworks	1,752	Python
466	microsoft/augmented-interpretable-models Interpretable and efficient predictors using pre-trained language models....	48	Emerging	llm-interpretability-explainability	44	Jupyter Notebook
467	mallorbc/Finetune_LLMs Repo for fine-tuning Casual LLMs	48	Emerging	lora-qlora-fine-tuning	458	Python
468	FoundationVision/Infinity [CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for...	48	Emerging	text-to-image-generation	1,553	Python
469	nuance1979/llama-server LLaMA Server combines the power of LLaMA C++ with the beauty of Chatbot UI.	48	Emerging	interactive-ai-chat-uis	134	Python
470	gjbex/Deploying-LLMs-locally Material for a training on AI tools	48	Emerging	llm-training-experimentation	18	Jupyter Notebook
471	AndrewZhe/lawyer-llama 中文法律LLaMA (LLaMA for Chinese legel domain)	48	Emerging	multilingual-llm-adaptation	984	Python
472	local-ai-zone/local-ai-zone.github.io Discover the Best AI Models for Your PC	48	Emerging	local-llm-deployment	20	HTML
473	affjljoo3581/GPT2 PyTorch Implementation of OpenAI GPT-2	48	Emerging	gpt2-language-models	357	Python
474	MiniMax-AI/MiniMax-01 The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model...	48	Emerging	llm-frameworks-libraries	3,363	Python
475	Esmail-ibraheem/Axon AI research lab🔬: implementations of AI papers and theoretical research:...	48	Emerging	ml-foundations-curricula	18	Python
476	chengchingwen/Transformers.jl Julia Implementation of Transformer models	48	Emerging	julia-ml-frameworks	568	Julia
477	datawhalechina/llms-from-scratch-cn 仅需Python基础，从0构建大语言模型；从0逐步构建GLM4\Llama3\RWKV6，深入理解大模型原理	48	Emerging	llm-implementation-from-scratch	4,010	Jupyter Notebook
478	rojagtap/transformer-abstractive-summarization Abstractive Text Summarization using Transformer	48	Emerging	text-summarization-transformers	168	Jupyter Notebook
479	bryanlimy/tf2-transformer-chatbot Transformer Chatbot in TensorFlow 2 with TPU support.	48	Emerging	chatgpt-api-tutorials	131	Jupyter Notebook
480	monologg/GoEmotions-pytorch Pytorch Implementation of GoEmotions 😍😢😱	48	Emerging	emotion-detection-transformers	166	Python
481	kyegomez/HLT Implementation of the transformer from the paper: "Real-World Humanoid...	48	Emerging	transformer-architecture-tutorials	62	Python
482	explosion/curated-transformers 🤖 A PyTorch library of curated Transformer models and their composable components	48	Emerging	transformer-frameworks-wrappers	894	Python
483	ruanchaves/hashformers Accurate word segmentation for hashtags and text, powered by Transformers...	48	Emerging	transformer-frameworks-wrappers	77	Python
484	Thinklab-SJTU/Crossformer Official implementation of our ICLR 2023 paper "Crossformer: Transformer...	48	Emerging	time-series-forecasting-transformers	669	Python
485	NVIDIA/FasterTransformer Transformer related optimization, including BERT, GPT	48	Emerging	transformer-architecture-education	6,398	C++
486	worldbank/REaLTabFormer A suite of auto-regressive and Seq2Seq (sequence-to-sequence) transformer...	48	Emerging	creative-text-generation	244	Jupyter Notebook
487	slwang-ustc/nano-vllm-v1 Nano vLLM with vLLM v1's request scheduling strategy and chunked prefill	48	Emerging	llm-inference-engines	61	Python
488	THUDM/LongWriter [ICLR 2025] LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs	48	Emerging	diffusion-language-models	1,839	Python
489	IbrahimSobh/llms Large Language Models: In this repository Language models are introduced...	48	Emerging	llm-training-experimentation	394	Jupyter Notebook
490	OscarKjell/text Using Transformers from HuggingFace in R	48	Emerging	huggingface-learning-resources	157	R
491	microsoft/DialoGPT Large-scale pretraining for dialogue	48	Emerging	chatgpt-api-tutorials	2,422	Python
492	SakanaAI/doc-to-lora Hypernetworks that update LLMs to remember factual information	48	Emerging	agent-memory-infrastructure	545	Python
493	tensorops/TransformerX Flexible Python library providing building blocks (layers) for reproducible...	48	Emerging	transformer-architecture-tutorials	53	Python
494	SearchSavior/OpenArc Inference engine for Intel devices. Serve LLMs, VLMs, Whisper, Kokoro-TTS,...	48	Emerging	llm-inference-engines	341	Python
495	thammegowda/nllb-serve Meta's "No Language Left Behind" models served as web app and REST API	48	Emerging	neural-machine-translation	256	Python
496	spencerbraun/anomaly_transformer_pytorch PyTorch implementation of Anomaly Transformer: Time Series Anomaly Detection...	48	Emerging	time-series-forecasting-transformers	252	Jupyter Notebook
497	jianghoucheng/AlphaEdit AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models,...	48	Emerging	llm-knowledge-editing	423	Python
498	kakaobrain/kogpt KakaoBrain KoGPT (Korean Generative Pre-trained Transformer)	48	Emerging	gpt2-pretraining-fine-tuning	1,014	Python
499	ALucek/ppt2desc Convert PowerPoint files into semantically rich text using vision language models	48	Emerging	ai-presentation-generation	113	Python
500	Facico/Chinese-Vicuna Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model ——...	48	Emerging	multilingual-llm-adaptation	4,136	C

« Prev 1 2 3 4 5 6 7 … 76 77 78 Next »