All Transformer Models

7,795 models ranked by quality score · Page 16 of 78

Showing 1501–1600 of 7,795

« Prev Next »

#	Model	Score	Tier	Category	Stars	Language
1501	parvbhullar/superpilot LLMs based multi-model framework for building AI apps.	38	Emerging	llm-framework-abstractions	21	Python
1502	deep-symbolic-mathematics/Multimodal-Symbolic-Regression [ICLR 2024 Spotlight] SNIP on Symbolic Regression: Deep Symbolic Regression...	38	Emerging	mathematical-reasoning-transformers	21	Python
1503	jaco-bro/MLX.zig MLX.zig: Phi-4, Llama 3.2, and Whisper in Zig	38	Emerging	local-llm-deployment	33	Zig
1504	Infini-AI-Lab/vortex_torch Vortex: A Flexible and Efficient Sparse Attention Framework	38	Emerging	sparse-attention-optimization	49	Python
1505	InhwanBae/LMTrajectory Official Code for "Can Language Beat Numerical Regression? Language-Based...	38	Emerging	game-playing-agents	159	Python
1506	daniel-furman/sft-demos Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and...	38	Emerging	rlhf-alignment-training	77	Jupyter Notebook
1507	zjukg/KoPA [Paper][ACM MM 2024] Making Large Language Models Perform Better in...	38	Emerging	llm-knowledge-graph-generation	214	Python
1508	Longyichen/Alpaca-family-library Summarize all open source Large Languages Models and low-cost replication...	38	Emerging	multilingual-llm-adaptation	136	—
1509	hao-ai-lab/Consistency_LLM [ICML 2024] CLLMs: Consistency Large Language Models	38	Emerging	llm-interpretability-explainability	413	Python
1510	AIoT-MLSys-Lab/Efficient-LLMs-Survey [TMLR 2024] Efficient Large Language Models: A Survey	38	Emerging	llm-research-curation	1,256	—
1511	miranthajayatilake/nanoQA Question-answering on your own data with Large Language Models (LLMs)	38	Emerging	question-answering-systems	23	Python
1512	ivanfioravanti/wine_variety_classification Examples on how to use various LLM providers with a Wine Classification problem	38	Emerging	llm-learning-resources	129	Python
1513	otadk/nuxt-edge-ai Nuxt module for local-first AI apps with server-side WASM inference via...	38	Emerging	browser-based-ml-inference	33	TypeScript
1514	EagleW/Stage-wise-Fine-tuning Code for Stage-wise Fine-tuning for Graph-to-Text Generation	38	Emerging	gpt2-pretraining-fine-tuning	26	Lex
1515	dbmdz/berts DBMDZ BERT, DistilBERT, ELECTRA, GPT-2 and ConvBERT models	38	Emerging	bert-model-implementations	159	—
1516	rohit901/VANE-Bench [NAACL'25] Contains code and documentation for our VANE-Bench paper.	38	Emerging	domain-specific-benchmarks	23	Python
1517	dohlee/chromoformer The official code implementation for Chromoformer in PyTorch. (Lee et al.,...	38	Emerging	mixup-augmentation-frameworks	37	Python
1518	samestrin/llm-newsletter-generator llm-newsletter-generator transforms a valid RSS feed into a "Newsletter"...	38	Emerging	youtube-video-summarization	13	Python
1519	WENGSYX/LMTuner LMTuner: Make the LLM Better for Everyone	38	Emerging	llm-finetuning-frameworks	38	Python
1520	kyegomez/qformer Implementation of Qformer from BLIP2 in Zeta Lego blocks.	38	Emerging	vision-language-models	48	Python
1521	amin-tehrani/ollama-colab Serve Ollama LLMs on Google Colab (free plan) using Ngrok	38	Emerging	local-llm-deployment	26	Jupyter Notebook
1522	cocktailpeanut/dalai The simplest way to run LLaMA on your local machine	38	Emerging	local-llm-deployment	12,980	CSS
1523	RightNow-AI/TIDE Dynamic per-token early exit for LLM inference. Skip layers tokens don't need	38	Emerging	llm-cost-tracking	3	Python
1524	Kagamma/llama-pas Free Pascal bindings for llama.cpp	38	Emerging	local-llm-deployment	23	Pascal
1525	jie-jw-wu/human-eval-comm HumanEvalComm: Evaluating Communication Skill of Code LLM and LLM Agent	38	Emerging	code-model-training	11	Python
1526	pmichel31415/are-16-heads-really-better-than-1 Code for the paper "Are Sixteen Heads Really Better than One?"	38	Emerging	transformer-architecture-education	175	Shell
1527	ma2za/telegram-llm-bot Telegram LLM bot backed by OpenAI, Whisper, Beam, LLaMA, Weaviate, MinIO and MongoDB	38	Emerging	messaging-platform-chatbots	112	Python
1528	IvanBongiorni/maximal A TensorFlow-compatible Python library that provides models and layers to...	38	Emerging	transformer-architecture-tutorials	9	Python
1529	cmhungsteve/Awesome-Transformer-Attention An ultimately comprehensive paper list of Vision Transformer/Attention,...	38	Emerging	vision-transformer-implementations	5,022	—
1530	chenhan97/TimeLlama The official repo of TimeLlama, an instruction-finetuned Llama2 series that...	38	Emerging	llm-frameworks-libraries	43	Python
1531	hasanirtiza/PedesFormer-Transformer-Networks-For-Pedestrian-Detection Transformer Networks for Pedestrian Detection	38	Emerging	3d-vision-transformers	43	Python
1532	AnkitNayak-eth/llmBench llmBench is a high-depth benchmarking tool designed to measure the raw...	38	Emerging	domain-specific-benchmarks	24	Python
1533	di37/finetuning-quantize-evaluate Fine-Tune, Quantize, Evaluate: The Complete Guide — LLMs, VLMs, and Embedding Models	38	Emerging	llm-fine-tuning	13	Typst
1534	takara-ai/go-attention A full attention mechanism and transformer in pure go.	38	Emerging	attention-mechanism-implementations	451	Go
1535	botisan-ai/sentence-transformers.js Run sentence-transformers (SBERT) compatible models in Node.js or browser.	38	Emerging	browser-based-ml-inference	24	TypeScript
1536	rust-dd/iTransformer An iTransformer implementation in Rust	38	Emerging	browser-based-ml-inference	19	Rust
1537	pyladiesams/eval-llm-based-apps-jan2025 Create an evaluation framework for your LLM based app. Incorporate it into...	38	Emerging	evaluation-frameworks-metrics	8	Jupyter Notebook
1538	albrateanu/ModalFormer [2025] ModalFormer: Multimodal Transformer for Low-Light Image Enhancement	38	Emerging	multimodal-fusion-transformers	25	Python
1539	AmpereComputingAI/llama.cpp Ampere optimized llama.cpp	38	Emerging	llm-inference-engines	33	Python
1540	mbzuai-oryx/Awesome-LLM-Post-training Awesome Reasoning LLM Tutorial/Survey/Guide	38	Emerging	llm-reasoning-research	2,321	Python
1541	datawhalechina/diy-llm 🎓 系统性大语言模型构建课程｜🛠️ 覆盖预训练数据工程、Tokenizer、Transformer、MoE、GPU 编程...	38	Emerging	llm-learning-resources	62	Jupyter Notebook
1542	rosinality/halite Acceleration framework for Human Alignment Learning	38	Emerging	rlhf-alignment-training	13	Python
1543	iflytek/VLE VLE: Vision-Language Encoder (VLE: 视觉-语言多模态预训练模型)	38	Emerging	multimodal-vision-language	194	Python
1544	bwittmann/transoar A 3D medical Detection Transformer library. Papers accepted @ MIDL22 & MELBA23/02.	38	Emerging	medical-image-segmentation	72	Python
1545	biswassanket/DocSegTr A Bottom-Up Instance Segmentation Strategy for segmenting document instances...	38	Emerging	medical-image-segmentation-transformers	59	Python
1546	lenguajenatural-ai/autotransformers A Python package for automatically training and comparing language models.	38	Emerging	machine-translation-transformers	49	Python
1547	viddexa/moderators One package to moderate them all	38	Emerging	hate-speech-detection	5	Python
1548	osainz59/Ask2Transformers A Framework for Textual Entailment based Zero Shot text classification	38	Emerging	text-classification-transformers	153	Python
1549	EvilFreelancer/impruver A set of scripts and configurations for pretraining of Large Language Models (LLM)	38	Emerging	llm-training-experimentation	36	Python
1550	Sandipan99/IndMask IndMask: Inductive Explanation for Multivariate Time Series Black-box Model	38	Emerging	transformer-interpretability-mechanistic	5	Python
1551	Nkluge-correa/TeenyTinyLlama A pair of tiny foundational models trained in Brazilian Portuguese.🦙🦙	38	Emerging	llama-model-implementations	44	Python
1552	yizhangele/llm-guided-mod-optimization Implementation for “Hierarchical Optimization via LLM-Guided Objective...	38	Emerging	prompt-optimization-systems	3	Python
1553	epfml/llm-optimizer-benchmark Benchmarking Optimizers for LLM Pretraining	38	Emerging	domain-specific-benchmarks	56	Python
1554	DirtyHarryLYL/Transformer-in-Vision Recent Transformer-based CV and related works.	38	Emerging	vision-transformer-optimization	1,339	—
1555	Kirill-Kravtsov/drophead-pytorch An implementation of drophead regularization for pytorch transformers	38	Emerging	transformer-architecture-tutorials	19	Python
1556	dcaffo98/transpormer TranSPormer: a transformer for the Travelling Salesman Problem	38	Emerging	transformer-architecture-education	26	Python
1557	TrevTron/indiedroid-nova-llm Running Llama 3.1 8B and other LLMs on RK3588 NPU - benchmarks and setup guides	38	Emerging	llm-inference-engines	3	Python
1558	kolinko/effort An implementation of bucketMul LLM inference	38	Emerging	apple-silicon-llm-inference	227	Swift
1559	NiuTrans/LMT Building a inclusive, scalable, and high-performance multilingual translation model	38	Emerging	llm-scaling-architecture	125	Python
1560	jlin816/dynalang Code for "Learning to Model the World with Language." ICML 2024 Oral.	38	Emerging	llm-finetuning-frameworks	414	Python
1561	ymoslem/Adaptive-MT-LLM-Fine-tuning Fine-tuning Open-Source LLMs for Adaptive Machine Translation	38	Emerging	llm-fine-tuning	92	Jupyter Notebook
1562	yueyu1030/AttrPrompt [NeurIPS 2023] This is the code for the paper `Large Language Model as...	38	Emerging	llm-interpretability-explainability	156	Python
1563	mikemayuare/apetokenizer Tokenizer for chemnical SMILES and SELFIES for use in transformers models.	38	Emerging	molecular-generation-transformers	26	Python
1564	shufangxun/LLaVA-MoD [ICLR 2025] LLaVA-MoD: Making LLaVA Tiny via MoE-Knowledge Distillation	38	Emerging	llm-knowledge-distillation	223	Python
1565	OFA-Sys/OFASys OFASys: A Multi-Modal Multi-Task Learning System for Building Generalist Models	38	Emerging	multimodal-fusion-transformers	151	Python
1566	awneesht/KVShuttle Benchmark & decision framework for KV cache transfer compression in...	38	Emerging	llm-quantization-methods	5	Python
1567	HillZhang1999/ICD Code & Data for our Paper "Alleviating Hallucinations of Large Language...	38	Emerging	llm-hallucination-mitigation	69	Python
1568	ZongXR/8th-National-AI-Training-Competition 第八届全国职工职业技能大赛人工智能训练师赛项	38	Emerging	ml-foundations-curricula	13	Jupyter Notebook
1569	OFA-Sys/ExpertLLaMA An opensource ChatBot built with ExpertPrompting which achieves 96% of...	38	Emerging	messaging-platform-chatbots	302	Python
1570	LostBeard/SpawnDev.BlazorJS.TransformersJS Use Transformers.js from Blazor WebAssembly to run pretrained models with...	38	Emerging	browser-based-ml-inference	8	C#
1571	katanaml/table-query-model Table Query with ML	38	Emerging	ml-benchmarking-frameworks	14	Python
1572	GiovanniGatti/socratic-llm Training pipeline for fine tuning Phi-3-mini-instruct to follow the Socratic method	38	Emerging	llm-fine-tuning	31	Python
1573	wenge-research/YAYI 雅意大模型：为客户打造安全可靠的专属大模型，基于大规模中英文多领域指令数据训练的 LlaMA 2 & BLOOM...	38	Emerging	multilingual-llm-adaptation	3,051	Python
1574	Curated-Awesome-Lists/awesome-llms-fine-tuning Explore a comprehensive collection of resources, tutorials, papers, tools,...	38	Emerging	llm-training-experimentation	505	—
1575	JinhaoLee/WCA [ICML 2024] Visual-Text Cross Alignment: Refining the Similarity Score in...	38	Emerging	multimodal-vision-language	19	Python
1576	minosvasilias/godot-dodo Finetuning large language models for GDScript generation.	38	Emerging	lora-qlora-fine-tuning	567	Python
1577	InnovatorLM/Innovator-VL Fully Open-source Multimodal Language Models for Science Discovery	38	Emerging	multimodal-vision-language	130	Python
1578	OnlyTerp/kvtc First open-source KVTC implementation (NVIDIA, ICLR 2026) -- 8-32x KV cache...	38	Emerging	kv-cache-optimization	5	Python
1579	iVishalr/GPT A minimal and efficient Pytorch implementation of OpenAI's GPT (Generative...	38	Emerging	gpt2-pretraining-fine-tuning	18	Jupyter Notebook
1580	ManasVardhan/bench-my-llm 🏎️ Dead-simple LLM benchmarking CLI - latency, cost, and quality metrics	38	Emerging	llm-evaluation-benchmarking	1	Python
1581	VikingOwl91/vessel A lightweight, local-first web UI for managing Ollama models.	38	Emerging	interactive-ai-chat-uis	3	Svelte
1582	icon-lab/SLATER Official implementation of the paper: Unsupervised MRI Reconstruction via...	38	Emerging	3d-vision-transformers	41	Python
1583	arrmansa/Basic-UI-for-GPT-J-6B-with-low-vram A repository to run gpt-j-6b on low vram machines (4.2 gb minimum vram for...	38	Emerging	gpt2-pretraining-fine-tuning	113	Jupyter Notebook
1584	stylellm/stylellm_models StyleLLM文风大模型：基于大语言模型的文本风格迁移项目。Text style transfer base on Large Language...	38	Emerging	llm-training-experimentation	352	—
1585	sotiraslab/AgileFormer This the repo for the paper tiltled "AgileFormer: Spatially Agile...	38	Emerging	medical-image-segmentation-transformers	81	Python
1586	JosefAlbers/VL-JEPA VL-JEPA (Vision-Language Joint Embedding Predictive Architecture) in MLX	38	Emerging	multimodal-vision-language	76	Python
1587	kyaiooiayk/Awesome-LLM-Large-Language-Models-Notes What can I do with a LLM model?	38	Emerging	llm-training-experimentation	157	Jupyter Notebook
1588	efeslab/Nanoflow A throughput-oriented high-performance serving framework for LLMs	38	Emerging	llm-inference-engines	949	Jupyter Notebook
1589	SqueezeAILab/LLM2LLM [ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement	38	Emerging	llm-quantization-methods	194	Python
1590	eqimp/hogwild_llm Official PyTorch implementation for Hogwild! Inference: Parallel LLM...	38	Emerging	llm-cuda-optimization	140	Python
1591	zhanshijinwat/Steel-LLM Train a 1B LLM with 1T tokens from scratch by personal	38	Emerging	llm-implementation-from-scratch	791	Jupyter Notebook
1592	kyegomez/CNNGPT This CNN-based language model leverages causal and dilated convolutions,...	38	Emerging	gpt2-pretraining-fine-tuning	4	Python
1593	anthonyfoust/ai-stack-homelab Complete AI automation stack optimized for Mac Mini M4, but can work in...	38	Emerging	local-llm-deployment	7	Shell
1594	Gurumurthy30/Stackformer Modular PyTorch transformer library for building, training, and...	38	Emerging	transformer-architecture-tutorials	7	Python
1595	itsnamgyu/block-transformer Block Transformer: Global-to-Local Language Modeling for Fast Inference...	38	Emerging	kv-cache-optimization	163	Python
1596	Sakeeb91/text2sql-agent Self-correcting AI agent for natural language to SQL using HuggingFace...	38	Emerging	multi-agent-orchestration	3	Python
1597	WhereIsAI/BiLLM Tool for converting LLMs from uni-directional to bi-directional by removing...	38	Emerging	llm-training-experimentation	65	Python
1598	tomekkorbak/pretraining-with-human-feedback Code accompanying the paper Pretraining Language Models with Human Preferences	38	Emerging	rlhf-alignment-training	180	Python
1599	sayakpaul/probing-vits Probing the representations of Vision Transformers.	38	Emerging	vit-image-classification	340	Jupyter Notebook
1600	ccdv-ai/convert_checkpoint_to_lsg Efficient Attention for Long Sequence Processing	38	Emerging	transformer-architecture-tutorials	98	Python

« Prev 1 2 3 … 14 15 16 17 18 … 76 77 78 Next »