All Transformer Models

7,795 models ranked by quality score · Page 13 of 78

Showing 1201–1300 of 7,795

« Prev Next »

#	Model	Score	Tier	Category	Stars	Language
1201	ai4co/parco [NeurIPS 2025] PARCO: Parallel AutoRegressive Combinatorial Optimization	40	Emerging	mathematical-reasoning-transformers	44	Python
1202	hongyehu/Machine_Learning_Quantum_State_Tomography An unofficial pytorch implementation of using generative models to do...	40	Emerging	machine-translation-transformers	37	Python
1203	cdli-gh/Semi-Supervised-NMT-for-Sumerian-English Exploring the Limits of Low-Resource Neural Machine Translation	40	Emerging	neural-machine-translation	34	Jupyter Notebook
1204	asigalov61/Allegro-Music-Transformer Full-attention multi-instrumental music transformer featuring asymmetrical...	40	Emerging	ai-music-generation	48	Python
1205	clabrugere/scratch-llm Implements a LLM similar to Meta's Llama 2 from the ground up in PyTorch,...	40	Emerging	llm-implementation-from-scratch	38	Python
1206	SamsungSAILMontreal/ghn3 Code for "Can We Scale Transformers to Predict Parameters of Diverse...	40	Emerging	graph-transformers	39	Shell
1207	amazon-science/unified-ept A Unified Efficient Pyramid Transformer for Semantic Segmentation, ICCVW 2021	40	Emerging	medical-image-segmentation-transformers	31	Python
1208	PediaMedAI/AggPose [IJCAI 2022] Official PyTorch implementation of AggPose: Deep Aggregation...	40	Emerging	3d-vision-transformers	30	Python
1209	lin-tan/clm For our ICSE23 paper "Impact of Code Language Models on Automated Program...	40	Emerging	vulnerability-detection-llm	63	Python
1210	muhtalhakhan/Hacktoberfest2024 Hacktoberfest 2024 🧑🏻‍💻 OPEN FIRST Pull Request 🎉	40	Emerging	ai-powered-saas-startups	8	HTML
1211	VPGTrans/VPGTrans Codes for VPGTrans: Transfer Visual Prompt Generator across LLMs. VL-LLaMA,...	40	Emerging	multimodal-vision-language	269	Python
1212	JackZeng0208/llama.cpp-android-tutorial llama.cpp tutorial on Android phone	40	Emerging	llm-docker-deployments	155	—
1213	bodeby/torchstack 🫧 probability-level model ensembling for transformers	40	Emerging	transformer-architecture-tutorials	3	Python
1214	developer239/llama.cpp-ts llama.cpp 🦙 LLM inference in TypeScript	40	Emerging	local-llm-deployment	3	C++
1215	GAIR-NLP/ProX [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality...	40	Emerging	llm-quantization-methods	266	Python
1216	l294265421/alpaca-rlhf Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback)...	40	Emerging	rlhf-alignment-training	117	Python
1217	skit-ai/SpeechLLM This repository contains the training, inference, evaluation code for...	40	Emerging	llm-scaling-architecture	130	Python
1218	amitkedia007/Financial-Fraud-Detection-Using-LLMs The aim of this dissertation is to assess the effectiveness of LLMs such as ...	40	Emerging	ai-stock-analysis	86	Jupyter Notebook
1219	luuyin/OWL Official Pytorch Implementation of "Outlier Weighed Layerwise Sparsity...	40	Emerging	llm-compression-optimization	81	Python
1220	yuanzhoulvpi2017/DocumentSearch 基于sentence transformers和chatglm实现的文档搜索工具	40	Emerging	semantic-search-retrieval	157	Python
1221	vmicheli/delta-iris Efficient World Models with Context-Aware Tokenization. ICML 2024	40	Emerging	mathematical-reasoning-transformers	119	Python
1222	IAAR-Shanghai/Grimoire Grimoire is All You Need for Enhancing Large Language Models	40	Emerging	multilingual-llm-adaptation	117	Python
1223	aJupyter/ThinkLLM ThinkLLM：🚀 轻量、高效的大语言模型算法实现	40	Emerging	llm-frameworks-libraries	114	Jupyter Notebook
1224	trrahul/llama2.cs Inference Llama 2 in one file of pure C#	40	Emerging	local-llm-deployment	102	C#
1225	ShiZhengyan/DePT [ICLR 2024] This is the repository for the paper titled "DePT: Decomposed...	40	Emerging	llm-knowledge-distillation	102	Python
1226	wuwangzhang1216/prometheus Fully automatic censorship removal for language models. LoRA abliteration +...	40	Emerging	lora-qlora-fine-tuning	33	Python
1227	OpenSparseLLMs/LLaMA-MoE-v2 🚀 LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of...	40	Emerging	llm-implementation-from-scratch	93	Python
1228	ukairia777/pytorch-nlp-tutorial pytorch를 사용하여 텍스트 전처리부터 RAG, 에이전트, LLM 파인튜닝을 정리한 Deep Learning NLP 저장소입니다.	40	Emerging	llm-learning-resources	89	Jupyter Notebook
1229	architkaila/Fine-Tuning-LLMs-for-Medical-Entity-Extraction Exploring the potential of fine-tuning Large Language Models (LLMs) like...	40	Emerging	llm-fine-tuning	89	Python
1230	molbal/llm-text-completion-finetune Guide on text completion large language model fine-tuning, including example...	40	Emerging	llm-fine-tuning	87	Python
1231	prajjwal1/fluence A deep learning library based on Pytorch focussed on low resource language...	40	Emerging	transformer-architecture-tutorials	70	Python
1232	JulesBelveze/bert-squeeze 🛠️ Tools for Transformers compression using PyTorch Lightning ⚡	40	Emerging	bert-model-implementations	85	Python
1233	kingabzpro/using-llama3-locally Running llama3 using Ollama-Python, Curl, LangChain, Chroma, and User interface.	40	Emerging	generative-ai-learning	59	Jupyter Notebook
1234	TrustedLLM/LLMDet LLMDet is a text detection tool that can identify which generated sources...	40	Emerging	ai-generated-text-detection	84	Python
1235	rasbt/blog-finetuning-llama-adapters Supplementary material for "Understanding Parameter-Efficient Finetuning of...	40	Emerging	llm-fine-tuning	48	Jupyter Notebook
1236	Bindwell/PLAPT Codebase and CLI for PLAPT: A state-of-the-art protein-ligand binding...	40	Emerging	protein-transformers-ml	114	Mathematica
1237	jonrbates/turing A PyTorch library for simulating Turing machines with neural networks, based...	40	Emerging	transformer-architecture-tutorials	2	Python
1238	eduard23144/locoformer 🤖 Explore LocoFormer, a Transformer-XL model that enhances robot locomotion...	40	Emerging	transformer-architecture-tutorials	4	Python
1239	Traffic-Alpha/LLM-Assisted-Light This repository contains the code for the paper "LLM-Assisted Light:...	40	Emerging	multimodal-vision-language-models	99	Python
1240	hans00/react-native-transformers-example Example of transformers.js on React Native	40	Emerging	browser-based-ml-inference	75	TypeScript
1241	Omid-Nejati/BEFUnet A Hybrid CNN-Transformer Architecture for Precise Medical Image Segmentation	40	Emerging	medical-image-segmentation-transformers	73	Python
1242	ziplab/LIT [AAAI 2022] This is the official PyTorch implementation of "Less is More:...	40	Emerging	transformer-architecture-tutorials	97	Python
1243	chenmozhijin/BSRoformer.cpp GGML-based C++ inference for BS Roformer/Mel-Band-Roformer vocal separation...	40	Emerging	llm-inference-engines	8	C++
1244	ShuntaroOkuma/adapt-gauge-core Measure LLM adaptation efficiency — how fast models learn from few examples	40	Emerging	evaluation-frameworks-metrics	5	Python
1245	MetaGLM/FinGLM FinGLM: 致力于构建一个开放的、公益的、持久的金融大模型项目，利用开源开放来促进「AI+金融」。	40	Emerging	multilingual-llm-adaptation	2,194	HTML
1246	ECNU-ICALK/EduChat An open-source educational chat model from ICALK, East China Normal...	40	Emerging	multilingual-llm-adaptation	913	Jupyter Notebook
1247	neulab/knn-transformers PyTorch + HuggingFace code for RetoMaton: "Neuro-Symbolic Language Modeling...	40	Emerging	transformer-architecture-tutorials	286	Python
1248	yang-ai-lab/SleepLM SleepLM: Natural-Language Intelligence for Human Sleep	40	Emerging	llm-scaling-architecture	29	Jupyter Notebook
1249	zyds/transformers-code 手把手带你实战 Huggingface Transformers 课程视频同步更新在B站与YouTube	40	Emerging	huggingface-learning-resources	3,853	Jupyter Notebook
1250	YadaYuki/transformer-from-scratch Transformer from scratch 🙊 (English to Japanese Translator by PyTorch)	40	Emerging	transformer-architecture-education	31	Python
1251	saqib1707/gpt2-from-scratch PyTorch Implementation of GPT-2	40	Emerging	gpt2-pretraining-fine-tuning	31	Python
1252	infocusp/llm_seminar_series Material for the series of seminars on Large Language Models	40	Emerging	llm-learning-resources	34	Jupyter Notebook
1253	metriccoders/one-line-llm-tuner This repository is the source code for fine tuning any LLM in just one line 🔥	40	Emerging	llm-fine-tuning	4	Python
1254	zejia-lin/BulletServe Boosting GPU utilization for LLM serving via dynamic spatial-temporal...	40	Emerging	llm-inference-engines	37	Python
1255	geobrain-ai/geogalactica Code and datasets for paper "GeoGalactica: A Scientific Large Language Model...	40	Emerging	llm-domain-datasets	40	Python
1256	stanleylsx/llms_tool 一个基于HuggingFace开发的大语言模型训练、测试工具。支持各模型的webui、终端预测，低参数量及全参数模型训练(预训练、SFT、RM、PPO、D...	40	Emerging	llm-benchmark-leaderboards	223	Python
1257	xjywhu/Awesome-Multimodal-LLM-for-Code Multimodal Large Language Models for Code Generation under Multimodal Scenarios	40	Emerging	code-model-training	221	—
1258	jha-lab/acceltran [TCAD'23] AccelTran: A Sparsity-Aware Accelerator for Transformers	40	Emerging	power-transformer-design	58	Python
1259	ChristophReich1996/Swin-Transformer-V2 PyTorch reimplementation of the paper "Swin Transformer V2: Scaling Up...	40	Emerging	vision-transformer-optimization	205	Python
1260	HqWu-HITCS/Awesome-Chinese-LLM 整理开源的中文大语言模型，以规模较小、可私有化部署、训练成本较低的模型为主，包括底座模型，垂直领域微调及应用，数据集与教程等。	40	Emerging	multilingual-llm-adaptation	22,371	—
1261	VITA-Group/Q-GaLore Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank...	40	Emerging	llm-quantization-techniques	203	Python
1262	aniketmaurya/llm-inference Large Language Model (LLM) Inference API and Chatbot	40	Emerging	llm-inference-engines	127	Python
1263	monologg/KoBigBird 🦅 Pretrained BigBird Model for Korean (up to 4096 tokens)	40	Emerging	korean-language-models	201	Python
1264	samestrin/llm-pdf-ocr-api A Python-based REST API for PDF OCR using AI models with PyTorch and...	40	Emerging	ocr-document-extraction	34	Python
1265	AllenXiangX/SnowflakeNet (TPAMI 2023) Snowflake Point Deconvolution for Point Cloud Completion and...	40	Emerging	3d-vision-transformers	200	Python
1266	google-research/magvit Official JAX implementation of MAGVIT: Masked Generative Video Transformer	40	Emerging	transformer-frameworks-wrappers	995	Python
1267	DannyArends/DLLM A minimal, clean D language interface for running LLM inference using...	40	Emerging	langchain-framework-learning	5	D
1268	AlexandrosChrtn/llama-fine-tune-guide Fine-tune the newly released Llama-3.2 lightweight models.	40	Emerging	llm-fine-tuning	22	Python
1269	iverly/llamafile-docker Distribute and run llamafile/LLMs with a single docker image.	40	Emerging	local-llm-deployment	74	Dockerfile
1270	rednote-hilab/dots.llm1 The official repository of the dots.llm1 base and instruct models proposed...	40	Emerging	llm-learning-resources	490	—
1271	google-deepmind/gemma_penzai A JAX Research Toolkit for Visualizing, Manipulating, and Understanding...	40	Emerging	lora-qlora-fine-tuning	90	Jupyter Notebook
1272	vicgalle/zero-shot-reward-models ZYN: Zero-Shot Reward Models with Yes-No Questions	40	Emerging	llm-recommendation-systems	35	Python
1273	git-cloner/llama-lora-fine-tuning llama fine-tuning with lora	40	Emerging	lora-qlora-fine-tuning	140	Python
1274	RedHatResearch/conext24-NetConfEval Benchmark for evaluating LLMs in network configuration problems.	40	Emerging	domain-specific-benchmarks	34	Python
1275	ymcui/Chinese-Mixtral 中文Mixtral混合专家大模型（Chinese Mixtral MoE LLMs）	40	Emerging	llm-compression-optimization	610	Python
1276	takashiishida/paper2slides Transform any arXiv papers into slides using LLMs	40	Emerging	generative-ai-platforms	75	Python
1277	hkust-nlp/deita Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]	40	Emerging	instruction-tuning-datasets	591	Python
1278	gusye1234/llm-as-function Embed your LLM into a python function	40	Emerging	llm-function-calling	22	Python
1279	sanjibnarzary/awesome-llm Curated list of open source and openly accessible large language models	40	Emerging	multilingual-llm-adaptation	25	—
1280	modelscope/mcore-bridge MCore-Bridge: Providing Megatron-Core model definitions for state-of-the-art...	40	Emerging	—	31	Python
1281	Sachithx/EntroPE This includes the codebase for EntroPE (Entropy-Guided Dynamic Patch Encoder...	40	Emerging	time-series-forecasting-transformers	41	Python
1282	softengg-manoj/dreamer4 🌟 Implement Dreamer 4 for training agents within scalable world models,...	40	Emerging	mathematical-reasoning-transformers	4	Python
1283	sisinflab/Ducho Ducho is a Python framework aimed to extract multimodal features used in...	40	Emerging	multimodal-fusion-transformers	26	Python
1284	abaheti95/LoL-RL Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving...	40	Emerging	variational-autoencoders-nlp	26	Python
1285	harishdeivanayagam/rowfill Open-source spreadsheets platform for deep research and document processing	40	Emerging	interactive-ai-chat-uis	368	TypeScript
1286	johnmai-dev/NotebookMLX 📋 NotebookMLX - An Open Source version of NotebookLM (Ported NotebookLlama)	40	Emerging	llm-training-experimentation	339	Jupyter Notebook
1287	JKevin17/TM-LLM The official code for "(ISCC 2025) Network Traffic Matrix Imputation via...	40	Emerging	llm-scaling-architecture	6	Python
1288	sodascience/workshop_llm_data_collection This repository contains the code and slides for our workshop on data...	40	Emerging	llm-learning-resources	1	Jupyter Notebook
1289	KolosalAI/kolosal-cli Super lightweight Ollama + Qwen Code alternative to run Llama 3.3,...	40	Emerging	local-llm-deployment	466	TypeScript
1290	YuweiYin/FinPT FinPT: Financial Risk Prediction with Profile Tuning on Pretrained Foundation Models	40	Emerging	gpt-model-fine-tuning	39	Python
1291	tpoisonooo/llama.onnx LLaMa/RWKV onnx models, quantization and testcase	40	Emerging	llama-model-implementations	366	Python
1292	prismformore/Multi-Task-Transformer Code of ICLR2023 paper "TaskPrompter: Spatial-Channel Multi-Task Prompting...	40	Emerging	vision-transformer-optimization	327	Python
1293	bigcode-project/selfcodealign [NeurIPS'24] SelfCodeAlign: Self-Alignment for Code Generation	40	Emerging	llm-knowledge-editing	323	Python
1294	upb-lea/mag-net-hub MagNet Toolkit - Certified Models of the MagNet Challenge	40	Emerging	power-transformer-design	18	Python
1295	sunnynguyen-ai/llm-attention-visualizer Interactive tool for analyzing attention patterns in transformer models with...	40	Emerging	attention-mechanism-implementations	14	Python
1296	nlpodyssey/cybertron Cybertron: the home planet of the Transformers in Go	40	Emerging	transformer-frameworks-wrappers	325	Go
1297	xNul/code-llama-for-vscode Use Code Llama with Visual Studio Code and the Continue extension. A local...	40	Emerging	code-completion-copilots	569	Python
1298	dingo-actual/infini-transformer PyTorch implementation of Infini-Transformer from "Leave No Context Behind:...	40	Emerging	transformer-architecture-tutorials	298	Python
1299	swordlidev/Efficient-Multimodal-LLMs-Survey Efficient Multimodal Large Language Models: A Survey	40	Emerging	llm-research-curation	389	—
1300	zyushun/Adam-mini Code for Adam-mini: Use Fewer Learning Rates To Gain More...	40	Emerging	llm-compression-optimization	453	Python

« Prev 1 2 3 … 11 12 13 14 15 … 76 77 78 Next »