Local LLM Deployment Transformer Models

Tools and resources for running, hosting, and serving open-source LLMs locally or on private infrastructure without cloud dependencies. Includes deployment platforms, free API gateways, optimization guides, and access control for self-hosted models. Does NOT include model training, fine-tuning frameworks, or cloud-based LLM services.

There are 257 local llm deployment models tracked. 2 score above 70 (verified tier). The highest-rated is ludwig-ai/ludwig at 77/100 with 11,657 stars. 9 of the top 10 are actively maintained.

Get all 257 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=local-llm-deployment&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

#	Model	Score	Tier	Stars	Language
1	ludwig-ai/ludwig Low-code framework for building custom LLMs, neural networks, and other AI models	77	Verified	11,657	Python
2	withcatai/node-llama-cpp Run AI models locally on your machine with node.js bindings for llama.cpp....	73	Verified	1,942	TypeScript
3	mudler/LocalAI :robot: The free, Open Source alternative to OpenAI, Claude and others....	67	Established	43,530	Go
4	zhudotexe/kani kani (カニ) is a highly hackable microframework for tool-calling language...	66	Established	599	Python
5	SciSharp/LLamaSharp A C#/.NET library to run LLM (🦙LLaMA/LLaVA) on your local device efficiently.	65	Established	3,572	C#
6	kaito-project/aikit 🏗️ Fine-tune, build, and deploy open-source LLMs easily!	64	Established	512	Go
7	mybigday/llama.rn React Native binding of llama.cpp	64	Established	851	C++
8	UbiquitousLearning/mllm Fast Multimodal LLM on Mobile Devices	64	Established	1,429	C++
9	bentoml/OpenLLM Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible...	63	Established	12,161	Python
10	mostlygeek/llama-swap Reliable model swapping for any local OpenAI/Anthropic compatible server -...	62	Established	2,775	Go
11	mukel/llama3.java Practical Llama 3 inference in Java	59	Established	800	Java
12	floneum/floneum Instant, controllable, local pre-trained AI models in Rust	57	Established	2,153	Rust
13	Michael-A-Kuykendall/shimmy ⚡ Python-free Rust inference server — OpenAI-API compatible. GGUF +...	57	Established	3,793	Rust
14	sgl-project/ome Open Model Engine (OME) — Kubernetes operator for LLM serving, GPU...	56	Established	393	Go
15	tattn/LocalLLMClient Swift package to run local LLMs on iOS, macOS, Linux	56	Established	168	Swift
16	Mobile-Artificial-Intelligence/llama_sdk lcpp is a dart implementation of llama.cpp used by the mobile artificial...	56	Established	115	C++
17	guinmoon/LLMFarm llama and other large language models on iOS and MacOS offline using GGML library.	55	Established	1,994	C
18	tairov/llama2.mojo Inference Llama 2 in one file of pure 🔥	54	Established	2,119	Mojo
19	guinmoon/llmfarm_core.swift Swift library to work with llama and other large language models.	54	Established	278	C++
20	nekomeowww/ollama-operator 🚢 Yet another operator for running large language models on Kubernetes with...	52	Established	234	Go
21	Strvm/meta-ai-api Llama 3 API 70B & 405B (MetaAI Reverse Engineered)	50	Established	396	Python
22	camenduru/text-generation-webui-colab A colab gradio web UI for running Large Language Models	50	Established	2,093	Jupyter Notebook
23	sammcj/ingest Parse files (e.g. code repos) and websites to clipboard or a file for...	50	Established	367	Go
24	BeRo1985/pasllm PasLLM - LLM inference engine in Object Pascal (synced from my private work...	50	Established	76	Pascal
25	tjake/Jlama Jlama is a modern LLM inference engine for Java	49	Emerging	1,259	Java
26	ngxson/wllama WebAssembly binding for llama.cpp - Enabling on-browser LLM inference	49	Emerging	1,013	TypeScript
27	yoshoku/llama_cpp.rb llama_cpp.rb provides Ruby bindings for llama.cpp	49	Emerging	232	C
28	jmont-dev/ollama-hpp Modern, Header-only C++ bindings for the Ollama API.	48	Emerging	213	C++
29	fboulnois/llama-cpp-docker Run llama.cpp in a GPU accelerated Docker container	48	Emerging	63	Dockerfile
30	cheahjs/free-llm-api-resources A list of free LLM inference resources accessible via API.	48	Emerging	15,475	Python
31	local-ai-zone/local-ai-zone.github.io Discover the Best AI Models for Your PC	48	Emerging	20	HTML
32	soulteary/docker-llama2-chat Play LLaMA2 (official / 中文版 / INT4 / llama2.cpp) Together! ONLY 3 STEPS! (...	47	Emerging	538	Python
33	LM-Kit/lm-kit-net-samples .NET samples for LM-Kit.NET	47	Emerging	38	C#
34	mirpo/fastapi-gen Build LLM-enabled FastAPI applications without build configuration.	47	Emerging	11	Python
35	belladoreai/llama3-tokenizer-js JS tokenizer for LLaMA 3 and LLaMA 3.1	46	Emerging	117	JavaScript
36	mfoud444/ollamafreeapi OllamaFreeAPI: Free Distributed API for Ollama LLMs Public gateway to our...	46	Emerging	101	Python
37	nova-land/gbnf-compiler Plug n Play GBNF Compiler for llama.cpp	46	Emerging	28	Python
38	sobelio/llm-chain `llm-chain` is a powerful rust crate for building chains in large language...	45	Emerging	1,593	Rust
39	donderom/llm4s Scala 3 bindings for llama.cpp 🦙	45	Emerging	65	Scala
40	hybridgroup/yzma Go with your own intelligence - Go applications that directly integrate...	44	Emerging	350	Go
41	mdrokz/rust-llama.cpp LLama.cpp rust bindings	44	Emerging	416	Rust
42	absadiki/pyllamacpp Python bindings for llama.cpp	44	Emerging	68	C++
43	iaalm/llama-api-server A OpenAI API compatible REST server for llama.	44	Emerging	209	Python
44	openjlc/riscv64-library Some of the libraries (docs) on the RISCV64 architecture are easy for users...	43	Emerging	69	—
45	Archimedes1618/Madlab Madlab is an advanced AI development studio designed to streamline the...	43	Emerging	11	TypeScript
46	nerve-sparks/iris_android IRIS is an android app for interfacing with GGUF / llama.cpp models locally.	43	Emerging	267	Kotlin
47	phronmophobic/llama.clj Run LLMs locally. A clojure wrapper for llama.cpp.	42	Emerging	173	Clojure
48	KolosalAI/kolosal-server Kolosal AI is an OpenSource and Lightweight alternative to Ollama to run...	42	Emerging	13	C++
49	withcaer/curtana Simplified zero-cost wrapper over llama.cpp powered by the lama-cpp-2 Crate.	42	Emerging	2	Rust
50	gitctrlx/llama.go Llama from scratch in Go.	42	Emerging	104	Go
51	LLukas22/llm-rs-python Unofficial python bindings for the rust llm library. 🐍❤️🦀	42	Emerging	76	Python
52	gotzmann/llama.go llama.go is like llama.cpp in pure Golang!	42	Emerging	1,398	Go
53	nuhmanpk/quick-llama Run Ollama models on Google Colab	42	Emerging	4	Python
54	eugenehp/bitnet-cpp-rs Rust bindings for bitnet.cpp based on llama-cpp-4	41	Emerging	15	Rust
55	KolosalAI/Kolosal Kolosal AI is an OpenSource and Lightweight alternative to LM Studio to run...	41	Emerging	440	C++
56	fardjad/node-llmatic Use self-hosted LLMs with an OpenAI compatible API	41	Emerging	64	TypeScript
57	developer239/llama.cpp-ts llama.cpp 🦙 LLM inference in TypeScript	40	Emerging	3	C++
58	trrahul/llama2.cs Inference Llama 2 in one file of pure C#	40	Emerging	102	C#
59	iverly/llamafile-docker Distribute and run llamafile/LLMs with a single docker image.	40	Emerging	74	Dockerfile
60	KolosalAI/kolosal-cli Super lightweight Ollama + Qwen Code alternative to run Llama 3.3,...	40	Emerging	466	TypeScript
61	loong64/llama.cpp LLM inference in C/C++	40	Emerging	3	—
62	cgbur/llama2.zig Inference Llama 2 in one file of pure Zig	40	Emerging	211	Zig
63	mybigday/llama.node Node.js binding of llama.cpp	39	Emerging	19	C++
64	hpretila/llama.net .NET wrapper for LLaMA.cpp for LLaMA language model inference on CPU. 🦙	39	Emerging	58	C#
65	belladoreai/llama-tokenizer-js JS tokenizer for LLaMA 1 and 2	39	Emerging	363	JavaScript
66	mdegans/drama_llama Yet another `llama.cpp` Rust wrapper	39	Emerging	12	Rust
67	loong64/ollama Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other...	39	Emerging	9	Dockerfile
68	BodhiSearch/BodhiApp Run Open Source/Open Weight LLMs locally with OpenAI compatible APIs	38	Emerging	132	Rust
69	jaco-bro/MLX.zig MLX.zig: Phi-4, Llama 3.2, and Whisper in Zig	38	Emerging	33	Zig
70	amin-tehrani/ollama-colab Serve Ollama LLMs on Google Colab (free plan) using Ngrok	38	Emerging	26	Jupyter Notebook
71	cocktailpeanut/dalai The simplest way to run LLaMA on your local machine	38	Emerging	12,980	CSS
72	Kagamma/llama-pas Free Pascal bindings for llama.cpp	38	Emerging	23	Pascal
73	anthonyfoust/ai-stack-homelab Complete AI automation stack optimized for Mac Mini M4, but can work in...	38	Emerging	7	Shell
74	K024/llm-sharp Language models in C#	37	Emerging	50	C#
75	Agora-Lab-AI/Atom a suite of finetuned LLMs for atomically precise function calling 🧪	37	Emerging	17	Python
76	dirmacs/lancor A Rust client library for llama.cpp's OpenAI-compatible API server	37	Emerging	2	Rust
77	SeungyounShin/Llama2-Code-Interpreter Make Llama2 use Code Execution, Debug, Save Code, Reuse it, Access to Internet	37	Emerging	685	Python
78	adalkiran/llama-nuts-and-bolts A holistic way of understanding how Llama and its components run in...	37	Emerging	317	Go
79	FlatlinerDOA/PerceptivePyro Run and train Transformer based Large Language Models (LLMS) natively in...	37	Emerging	24	C#
80	Thrasher-Software/sigil A local-first LLM development studio. Build, test, and customize inference...	37	Emerging	17	CSS
81	openshieldai/openshield OpenShield is a new generation security layer for AI models	37	Emerging	84	Go
82	c0sogi/llama-api An OpenAI-like LLaMA inference API	36	Emerging	113	Python
83	trzy/llava-cpp-server LLaVA server (llama.cpp).	36	Emerging	183	C++
84	sashazykov/json-repair-rb A simple Ruby gem designed to repair broken JSON strings	36	Emerging	10	Ruby
85	dravenk/ollama-zig Ollama Zig library	36	Emerging	35	Zig
86	Aloereed/llama.cpp-server-ohos Llama.cpp server for OpenHarmony	36	Emerging	9	C++
87	diogok/llama.cpp.zig A build.zig for llama.cpp	36	Emerging	1	Zig
88	dev-sufyaan/Nexlify Unified API platform for free access to enterprise-grade AI models from...	36	Emerging	13	Python
89	abhisheknair10/llama3.cu Lightweight Llama 3 8B Inference Engine in CUDA C	36	Emerging	54	Cuda
90	nikolaydubina/llama2.go LLaMA-2 in native Go	35	Emerging	194	Go
91	hoof-ai/hoof "Just hoof it!" - A spotlight like interface to Ollama	35	Emerging	63	Rust
92	leftmove/cria Run LLMs locally with as little friction as possible.	35	Emerging	121	Python
93	saddam213/LLamaStack ASP.NET Core Web, WebApi & WPF implementations for LLama.cpp & LLamaSharp	35	Emerging	60	C#
94	fermyon/ai-examples A collection of serverless apps that show how Fermyon's Serverless AI...	34	Emerging	50	Rust
95	chelsea0x3b/llama-dfdx LLaMa 7b with CUDA acceleration implemented in rust. Minimal GPU memory needed!	34	Emerging	111	Rust
96	5aharsh/collama Run Ollama LLM models in Google Colab for free	34	Emerging	38	Jupyter Notebook
97	OneInterface/realtime-bakllava llama.cpp with BakLLaVA model describes what does it see	34	Emerging	379	Python
98	moritztng/fltr Like grep but for natural language questions. Based on Mistral 7B or Mixtral 8x7B.	34	Emerging	387	Rust
99	daskol/llama.py Python bindings to llama.cpp	33	Emerging	27	C
100	zatevakhin/obsidian-local-llm Obsidian Local LLM is a plugin for Obsidian that provides access to a...	33	Emerging	135	TypeScript
101	cgjosephlee/ollama-save-load Save and load ollama models just like operating docker images.	33	Emerging	26	Python
102	AlenVelocity/langchain-llama Run LLAMA LLMs in Node with Langchain	33	Emerging	39	TypeScript
103	benct/kotlin-cheat-sheet :star: Kotlin <3 Cheat Sheet, Collection Extension Functions and General Examples	33	Emerging	7	—
104	maifeeulasad/LocalLLaMA 📚 LocalLLaMA Archive — Community-powered static archive for r/LocalLLaMA	33	Emerging	8	TypeScript
105	Uralstech/vid-orca Deploy LLaMA-2 Chat on Google Cloud.	33	Emerging	4	Python
106	andreiramani/jadi4llamacpp Just another drop in for llama.cpp	32	Emerging	1	—
107	codewithdark-git/llama-3-Hackathon LLaMA Genius is an AI-powered research assistant designed to help users...	32	Emerging	1	Python
108	BerkeliumLabs/Berkelium-labs Your personal AI Lab, accessible everywhere! Explore, experiment, and...	32	Emerging	2	TypeScript
109	rabilrbl/llamafile-builder A simple github actions script to build a llamafile and uploads to huggingface	32	Emerging	17	Python
110	zTgx/llmweb-rs Webpage to structured data in Rust & LLM	32	Emerging	16	Rust
111	didier-durand/llms-in-clouds Experiments with LLMs in clouds (powered by SGLang)	32	Emerging	6	Python
112	alvion427/PerroPastor Run Llama based LLMs in Unity entirely in compute shaders with no dependencies	32	Emerging	106	C#
113	avatsaev/av-local-llm-api Allows to easily run local REST API with a custom LLM, running locally or...	31	Emerging	4	Python
114	makllama/makllama MaK(Mac+Kubernetes)llama - Running and orchestrating large language models...	31	Emerging	45	Go
115	lrusso/llama3pure Three inference engines for Llama 3: pure C for desktop systems, pure...	31	Emerging	21	HTML
116	yfedoseev/llmkit Production-grade LLM client - Rust, Python, TypeScript. 100+ providers,...	31	Emerging	12	Rust
117	RahulSChand/llama2.c-for-dummies Step by step explanation/tutorial of llama2.c	31	Emerging	225	C
118	khiwniti/kaggle-llm-api 🤖 Comprehensive solution for running Ollama/vLLM API servers in Kaggle...	31	Emerging	2	Python
119	kurnevsky/llama-cpp.el A client for llama-cpp server	30	Emerging	28	Emacs Lisp
120	tunib-ai/joker AI model designed to test the effectiveness in handling external ethical attacks.	30	Emerging	11	Python
121	seanpm2001/DALL-E_LLaMA 🤖️🦙️🧠️ DALL-E LLaMA is a combination of DALL-E and LLaMA (Large Language...	30	Emerging	2	Python
122	seanpm2001/DALL-E_LLaMA_Docs 🤖️🦙️🧠️📖️ The official documentation source repository for DALL-E LLaMA, a...	30	Emerging	2	Markdown
123	UgurkanTech/ArchNetAI ArchNetAI is a Python library that leverages the Ollama API for generating...	29	Experimental	3	Python
124	Adriankhl/godot-llm-template Godot LLM Template/Demo	29	Experimental	32	GDScript
125	kassane/ollama-d D bindings for the Ollama API	29	Experimental	3	D
126	johnsutor/llama-jarvis Turn any LLM into Jarvis	29	Experimental	6	Python
127	fuglede/llama.ttf A font for writing tiny stories	28	Experimental	319	Rust
128	hurui200320/llama-cpp-kt The Kotlin wrapper of llama.cpp, powered by JNA	27	Experimental	13	Kotlin
129	LastBotInc/llama2j Pure Java Llama2 inference with optional multi-GPU CUDA implementation	27	Experimental	13	Java
130	haormj/llama2.go Inference Llama 2 in one file of pure go	27	Experimental	16	Go
131	fbaldassarri/llama-cpp-container Docker image to deploy a llama-cpp container with conda-ready environments	27	Experimental	17	Dockerfile
132	frinknet/gelli Containerized LLM for any use-case big or small	26	Experimental	1	Shell
133	ksylvest/omniai-llama An implementation of the OmniAI interface for Llama.	26	Experimental	1	Ruby
134	anglerfishlyy/llm-watch-grafana AI observability Grafana plugin tracking real-time LLM metrics — latency,...	26	Experimental	1	JavaScript
135	lwch/llama2.go Port of Facebook's LLaMA 2 model in pure go and use little memory	26	Experimental	36	Go
136	mkagenius/llm-token-visualizer See How Big Exactly A 128k Token Text Is	25	Experimental	4	TypeScript
137	Brazilian-willametteriver232/llama.swift 🚀 Access llama.cpp easily in your Swift projects, leveraging precompiled...	25	Experimental	2	Swift
138	zerob13/modelinfo-cli A CLI to query AI model capabilities, context limits, and pricing from...	25	Experimental	8	TypeScript
139	PCfVW/plip-rs Mechanistic interpretability toolkit for code LLMs, in Rust. Analysis of...	25	Experimental	8	Rust
140	sc0v0ne/udemy_course_mastering_ollama_build_private_local_llm_apps_with_python Udemy Course Mastering Ollama Build Private Local LLM Apps with Python	24	Experimental	3	Python
141	cvedix/omnisdk On-device AI deloper platform	24	Experimental	2	C++
142	mhajder/llama.cpp-updater A shell script to automatically update or build llama.cpp with optimal GPU...	24	Experimental	3	Shell
143	Stoksweet/modlable A platform for building, training and running inference on TensorflowJS...	24	Experimental	1	TypeScript
144	Andrew2077/Alpaca Simple Q/A app, where i created a UI for alpaca (fine tuned LLAMA) model...	24	Experimental	4	Jupyter Notebook
145	ariannamethod/yent.yo diffusion AI with a bad character	23	Experimental	2	Go
146	lenticularis39/llama2.inferno Inference Llama 2 in one file of pure Limbo	23	Experimental	2	Limbo
147	waqasm86/Ubuntu-Cuda-Llama.cpp-Executable Pre-built llama.cpp CUDA binary for Ubuntu 22.04. No compilation required -...	22	Experimental	1	Python
148	bkataru/chatllm.zig Zig wrapper for chatllm.cpp - LLM inference with 70+ model architectures	22	Experimental	1	Zig
149	ChristianHohlfeld/ollama-local-docker Ollama Local Docker - A simple Docker-based setup for running Ollama's API...	22	Experimental	2	HTML
150	austinweis/alpaca.cpp-gui GUI for GGML Alpaca models	22	Experimental	2	HTML
151	Root1V/axonium-sdk A production-grade Python SDK for llama-server that streamlines...	22	Experimental	1	Python
152	RichardHam-co-uk/ProjectLodestar AI development environment with 90% cost savings. Routes between 8 LLM...	22	Experimental	1	Python
153	mrtrizer/UnityLlamaCpp Llama.cpp in Unity, straightforward and clean	22	Experimental	19	C#
154	whyisitworking/llama-bro On-device LLM inference SDK for Android, powered by llama.cpp. Run GGUF...	22	Experimental	—	Kotlin
155	estrify/ProjectLodestar 🌟 Optimize AI development with Lodestar by smartly routing between free...	21	Experimental	—	Python
156	tokenrouter/tokenrouter-python Official Python SDK for TokenRouter - an intelligent LLM routing service...	21	Experimental	—	Python
157	qxoticai/qxotic AI engine for the JVM	21	Experimental	—	Java
158	KolosalAI/kolosal-desktop Kolosal AI is an OpenSource and Lightweight alternative to LM Studio to run...	21	Experimental	5	Svelte
159	nathanborror/swift-llama A Swift client library for interacting with Meta's Llama API.	21	Experimental	4	Swift
160	chromejaw/free-llm-api A list of free LLM inference resources accessible via API.	21	Experimental	—	Python
161	ahmedmagood/cpu-slm 🖥️ Explore CPU-SLM, a Rust-based SLM/LLM project that runs on CPU, offering...	21	Experimental	—	Rust
162	veerapatel/llm.nexus 🌐 Streamline integration with various LLM providers using LLM.Nexus, a .NET...	21	Experimental	—	—
163	blackboxprogramming/ai-chain AI Chain — Distributed multi-node LLM inference with automatic failover....	21	Experimental	—	Python
164	wk-y/rama-swap ramalama-based model swapping server	21	Experimental	—	Go
165	nerdsupremacist/LlamaLang Repository for the Llama Programming Language. Work In Progress	21	Experimental	11	Python
166	aratan/ApiCloudLLaMA The idea is to make an api that everyone can consume in their GPT4-like...	21	Experimental	13	Go
167	sak96/rust_llama_app Chat bot (llama) written in rust using Yew and Tauri.	21	Experimental	1	Rust
168	seehiong/micronaut-llama3 A high-performance Llama3 implementation using Micronaut and GraalVM Native Image	21	Experimental	31	Java
169	nininau/awesome-llm-services 🔍 Discover 106+ open-source LLM services and tools for AI, ideal for local...	21	Experimental	—	TypeScript
170	invergent-ai/surogate-website Website for surogate.ai	21	Experimental	—	JavaScript
171	tbogdala/woolyrust A high-level Rust wrapper around llama.cpp for text generation AI with LLMs.	20	Experimental	7	Rust
172	llamajs/llama A dynamic logger for the dynamic developer	20	Experimental	5	TypeScript
173	xxxbf0222/LlamaDeck A command-line tool for quickly managing and experimenting with multiple...	20	Experimental	5	Python
174	secret-ai-labs/awesome-local-llm Your complete guide to running powerful AI models locally in 2025. Covers...	20	Experimental	4	—
175	CameLLM/CameLLM Run your favourite LLMs locally on macOS from Swift	20	Experimental	82	Swift
176	unaidedelf8777/faster-outlines A Lazy, high throughput and blazing fast structured text generation backend.	20	Experimental	5	Rust
177	jihadkhawaja/Llama.Grammar GBNF converter for llama.cpp Grammar directly from C# types	20	Experimental	3	C#
178	tbogdala/woolycore The core wrapper around llama.cpp in C to provide an easy surface to build...	20	Experimental	5	C++
179	themaximalist/ModelDeployer API Proxy for AI models, rate limiting, management and more!	19	Experimental	4	CSS
180	ferranpons/Llamatik-Server Remote inference backend implementing the same API as the Llamatik library...	19	Experimental	—	Kotlin
181	Jshulgach/NeuroBridge NeuroBridge: Where AI perception meets real-time robotics control	19	Experimental	1	Python
182	leaxer-ai/leaxer-llama Pre-built llama.cpp binaries for Leaxer	19	Experimental	—	—
183	Komdosh/kLLaMa-jvm Simple example of using llama.cpp with kotlin (JVM)	19	Experimental	—	C++
184	EZForever/llama.cpp-static Static builds of llama.cpp (Currently only amd64 server builds are available)	19	Experimental	—	Dockerfile
185	lufixSch/auto_llama Supercharge your local LLM	19	Experimental	4	Python
186	JinHanLei/LLM-Stream-Service Streaming API and Web page for Large Language Models (Llama3) based on...	19	Experimental	3	Python
187	TimeSurgeLabs/promptproxy Call many AIs from a single API.	19	Experimental	3	Go
188	gyanaranjans/llma-rust A simple webapp to showcase the ability to write a simple chatbot webapp...	19	Experimental	3	Rust
189	iakashpaul/Ghudsavar Ghudsavar (Horse rider) - Is a quick llama.cpp server for CPU only runtimes	19	Experimental	3	Dockerfile
190	ns408/local-ai-setup Run modern AI models on older laptops - optimized for 2nd-gen Intel hardware	19	Experimental	—	Shell
191	antononcube/Raku-WWW-LLaMA Raku package that provides access to the algorithms/models of (the...	19	Experimental	1	Raku
192	MaoJianwei/llama.cpp-arm-armv7l-Raspberry-Pi-Release-Prebuild On the Releases page, you can download pre-built binaries for arm, armv7l...	19	Experimental	2	—
193	lenML/llama2-tokenizer.js llama2 tokenizer for javascript	18	Experimental	2	TypeScript
194	coderonion/awesome-mojo-max-mlir A collection of some awesome public MAX platform, Mojo programming language...	18	Experimental	41	—
195	JavaLLM/llama4j An easy-to-use Java SDK for running LLaMA models on edge devices, powered by...	18	Experimental	23	Java
196	niansa/libjustlm Super easy to use library for doing LLaMA/GPT-J stuff! - Mirror of:...	18	Experimental	2	C++
197	updcon/libmisc-clj DKD miscellaneous for Clojure development	17	Experimental	1	Clojure
198	NeuralWeights/Llama-Server-AuthKeys Authorization tokens to access llama.cpp server (LM Studio, Ollama, Msty,...	17	Experimental	1	Python
199	asaddi/lv-serve Llama 3.2 Vision OpenAI-like API server	17	Experimental	1	Python
200	shakfu/llamalib Thin cython, pybind11, and nanobind wrappers around llama.cpp	17	Experimental	1	Cython
201	3axislabs/llm4j Build Context Aware LLM Apps using Java	17	Experimental	1	—
202	Inferra/Inferra-Python-SDK Official Python SDK for Inferra API access	17	Experimental	1	Python
203	entelecheia/llama-factory-container Container for LLaMA-Factory	16	Experimental	—	Shell
204	lennor-tan/openrouter-free-model 🌐 Explore and manage free models on OpenRouter effortlessly with our web...	15	Experimental	—	TypeScript
205	harpertoken/memoraxx LLaMA-style models with memory persistence.	15	Experimental	—	—
206	ainize-team/free-llama-api Run Meta Llama 3.2 API without your GPU for free. We always support lastest model 🧡	14	Experimental	15	TypeScript
207	Gaolingx/llama.cpp-Launcher run llama.cpp quickly and conveniently.	14	Experimental	1	Batchfile
208	iakashpaul/Portal Android app for the Hole in your Palm project, making LLMs accessible on-device!	14	Experimental	18	Kotlin
209	miga1999/AirClaw Run OpenClaw locally on any GPU or CPU without API costs, supporting large...	14	Experimental	—	Shell
210	Atsusheeesh/vllm-daily 📊 Summarize merged PRs daily with vLLM, ensuring you stay updated on key...	14	Experimental	—	—
211	botosadam/matryoshka 🚀 Build Ruby gems that utilize Rust for enhanced performance through two...	14	Experimental	—	—
212	kashan-alam/ai-backend-fastapi AI-powered backend API built with FastAPI, JWT authentication, rate...	14	Experimental	1	Python
213	Root1V/llm-security JWT-based authentication and authorization gateway for locally deployed LLM...	14	Experimental	1	Python
214	j-f1/LLM-Playground Play with LLaMA & GPT-3!	13	Experimental	11	Swift
215	eccenca/llama-index-cmem llama-index tools eccenca Corporate Memory Integration	13	Experimental	—	Python
216	diogok/llamautils Some python utilities for running llama.cpp on linux	13	Experimental	—	Python
217	zTgx/llama.rust LLM inference in Rust	13	Experimental	—	Rust
218	georon/llama_test_proj Skeleton project to run and test Llama and Chromadb locally on a gaming...	13	Experimental	—	Python
219	SanMog/Uroboros Automated red-teaming framework for LLMs. Tests GPT-4o, Claude, Llama...	13	Experimental	—	Python
220	juansalnac/API-mega-list 🌐 Discover a comprehensive collection of APIs to enhance your projects and...	13	Experimental	—	JavaScript
221	tbogdala/ai_notepad A lightweight Rust application to test interaction with large language...	13	Experimental	2	Rust
222	Abdullahali77/AI_Testing_CLI A specialized command-line tool that generates Python unit tests for your...	13	Experimental	—	Python
223	NavodPeiris/node_llama run llama models using llamafile and communicate with llama models through...	13	Experimental	—	JavaScript
224	nherx/free-llm-api-resources 🤖 Discover free API access and credits for various legitimate large language...	13	Experimental	—	Python
225	JohnClaw/chatllm.v V-lang api wrapper for llm-inference chatllm.cpp	12	Experimental	7	C
226	CameLLM/CameLLM-Llama Plugin for CameLLM supporting the LLaMA family of models	12	Experimental	6	C
227	JohnClaw/chatllm.cs C# api wrapper for llm-inference chatllm.cpp	12	Experimental	5	C#
228	JohnClaw/chatllm.vb VB.NET api wrapper for llm-inference chatllm.cpp	12	Experimental	5	Visual Basic .NET
229	JohnClaw/chatllm.nim Nim api-wrapper for llm-inference chatllm.cpp	11	Experimental	4	C
230	Namangupta123/AlgoMonitor System Monitoring and Management with AI insights	11	Experimental	1	Python
231	Chelsea486MHz/mobula Cloud deployment of LLAMA AI using DALAI	11	Experimental	—	Python
232	einzig-diego/LLaMA-CPP-Server-Endpoint-API Examples of how to call the REST api endpoint of the C++ port of LLaMA by...	11	Experimental	—	Python
233	leozqin/hops A load-balancing reverse proxy server that enables you to address a fleet of...	11	Experimental	—	Python
234	k-arthik-r/ALSATE An automated log monitoring system that continuously analyzes Linux sys-logs...	11	Experimental	—	Jupyter Notebook
235	jazibjohar/ai-text-structor A powerful asynchronous framework for orchestrating Large Language Model...	11	Experimental	—	Python
236	pAI-OS/fetch_llama_cpp llama.cpp downloader that selects the latest and best available binaries for...	11	Experimental	—	Python
237	m9m9ra/llama.swiftui It`s my playground to test mokpell llama swift lib	11	Experimental	—	C
238	numq/text-generation JVM library for text generation, written in Kotlin and based on the C++...	11	Experimental	—	Kotlin
239	fasuizu-br/brainiall-llm-gateway Brainiall LLM Gateway — 113+ AI models via OpenAI-compatible API. Claude,...	11	Experimental	—	—
240	aruntemme/llamacpp-swap-boilerplate A cross-platform template for running and managing llama-swap with...	11	Experimental	—	Shell
241	JohnClaw/llama-3.2-1b.vb llama 3.2 1b fp16 cpu inference in one file of pure VB.NET	11	Experimental	2	Visual Basic .NET
242	pnsk-lab/CrossLM Transferability for LLM?	11	Experimental	—	TypeScript
243	aryamanshrivastava/Ollama_Base Some gen ai projects	11	Experimental	—	Python
244	Xaxeric/run-ollama-colab A lightweight setup to run Ollama (for local LLMs like LLaMA 3, Mistral,...	11	Experimental	—	Jupyter Notebook
245	JohnClaw/chatllm.rs rust api wrapper for llm-inference chatllm.cpp	11	Experimental	3	Rust
246	tacheometry/rbxts-llama TS typings for the Lua Library for Immutable Data (Llama) by Frelerik	11	Experimental	3	Lua
247	felipemacedo1/hacka-llama-go Hackathon project integrating Llama AI models with Go backend for...	11	Experimental	—	Go
248	akshat2602/Omistral Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, and other...	11	Experimental	—	Rust
249	AK2k30/Llama-2-Deployment-on-AWS This is an api of LLama-2, so anybody can use this api to use LLama-2 for free.	11	Experimental	—	Jupyter Notebook
250	mattzcarey/llama.js run LLMs (llama, mamba, nemo, mistral) at native speeds from Javascript, Typescript.	11	Experimental	4	TypeScript
251	JohnClaw/chatllm.d D-lang api wrapper for llm-inference chatllm.cpp	10	Experimental	2	D
252	JohnClaw/chatllm.kt kotlin api wrapper for llm-inference chatllm.cpp	10	Experimental	2	C
253	mikesdatawork/101-series A comprehensive collection of the 101 most useful commands for various...	10	Experimental	1	Shell
254	nemmusu/run-llama-server This is a Bash script to automatically launch llama-server, detects...	10	Experimental	1	Shell
255	paulpierre/vllm-docker test Llama-3.2-11B-Vision-Instruct 4-bit quant quickly on an a100 40GB	10	Experimental	2	—
256	jim3692/koboldcpp-flake Run Llama 3.1 directly	10	Experimental	2	Nix
257	JohnClaw/chatllm.lua lua api wrapper for llm-inference chatllm.cpp	10	Experimental	2	Lua

Comparisons in this category

ludwig and OpenLLM (77 vs 63) node-llama-cpp and LLamaSharp (73 vs 65) node-llama-cpp and llama-swap (73 vs 62) LLMFarm and llmfarm_core.swift (55 vs 54) LocalLLMClient and LLMFarm (56 vs 55) node-llama-cpp and llama_sdk (73 vs 56) LocalLLMClient and llmfarm_core.swift (56 vs 54)