Local LLM Deployment Transformer Models

Tools and resources for running, hosting, and serving open-source LLMs locally or on private infrastructure without cloud dependencies. Includes deployment platforms, free API gateways, optimization guides, and access control for self-hosted models. Does NOT include model training, fine-tuning frameworks, or cloud-based LLM services.

There are 257 local llm deployment models tracked. 2 score above 70 (verified tier). The highest-rated is ludwig-ai/ludwig at 77/100 with 11,657 stars. 9 of the top 10 are actively maintained.

Get all 257 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=local-llm-deployment&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Model Score Tier
1 ludwig-ai/ludwig

Low-code framework for building custom LLMs, neural networks, and other AI models

77
Verified
2 withcatai/node-llama-cpp

Run AI models locally on your machine with node.js bindings for llama.cpp....

73
Verified
3 mudler/LocalAI

:robot: The free, Open Source alternative to OpenAI, Claude and others....

67
Established
4 zhudotexe/kani

kani (カニ) is a highly hackable microframework for tool-calling language...

66
Established
5 SciSharp/LLamaSharp

A C#/.NET library to run LLM (πŸ¦™LLaMA/LLaVA) on your local device efficiently.

65
Established
6 kaito-project/aikit

πŸ—οΈ Fine-tune, build, and deploy open-source LLMs easily!

64
Established
7 mybigday/llama.rn

React Native binding of llama.cpp

64
Established
8 UbiquitousLearning/mllm

Fast Multimodal LLM on Mobile Devices

64
Established
9 bentoml/OpenLLM

Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible...

63
Established
10 mostlygeek/llama-swap

Reliable model swapping for any local OpenAI/Anthropic compatible server -...

62
Established
11 mukel/llama3.java

Practical Llama 3 inference in Java

59
Established
12 floneum/floneum

Instant, controllable, local pre-trained AI models in Rust

57
Established
13 Michael-A-Kuykendall/shimmy

⚑ Python-free Rust inference server β€” OpenAI-API compatible. GGUF +...

57
Established
14 sgl-project/ome

Open Model Engine (OME) β€” Kubernetes operator for LLM serving, GPU...

56
Established
15 tattn/LocalLLMClient

Swift package to run local LLMs on iOS, macOS, Linux

56
Established
16 Mobile-Artificial-Intelligence/llama_sdk

lcpp is a dart implementation of llama.cpp used by the mobile artificial...

56
Established
17 guinmoon/LLMFarm

llama and other large language models on iOS and MacOS offline using GGML library.

55
Established
18 tairov/llama2.mojo

Inference Llama 2 in one file of pure πŸ”₯

54
Established
19 guinmoon/llmfarm_core.swift

Swift library to work with llama and other large language models.

54
Established
20 nekomeowww/ollama-operator

🚒 Yet another operator for running large language models on Kubernetes with...

52
Established
21 Strvm/meta-ai-api

Llama 3 API 70B & 405B (MetaAI Reverse Engineered)

50
Established
22 camenduru/text-generation-webui-colab

A colab gradio web UI for running Large Language Models

50
Established
23 sammcj/ingest

Parse files (e.g. code repos) and websites to clipboard or a file for...

50
Established
24 BeRo1985/pasllm

PasLLM - LLM inference engine in Object Pascal (synced from my private work...

50
Established
25 tjake/Jlama

Jlama is a modern LLM inference engine for Java

49
Emerging
26 ngxson/wllama

WebAssembly binding for llama.cpp - Enabling on-browser LLM inference

49
Emerging
27 yoshoku/llama_cpp.rb

llama_cpp.rb provides Ruby bindings for llama.cpp

49
Emerging
28 jmont-dev/ollama-hpp

Modern, Header-only C++ bindings for the Ollama API.

48
Emerging
29 fboulnois/llama-cpp-docker

Run llama.cpp in a GPU accelerated Docker container

48
Emerging
30 cheahjs/free-llm-api-resources

A list of free LLM inference resources accessible via API.

48
Emerging
31 local-ai-zone/local-ai-zone.github.io

Discover the Best AI Models for Your PC

48
Emerging
32 soulteary/docker-llama2-chat

Play LLaMA2 (official / δΈ­ζ–‡η‰ˆ / INT4 / llama2.cpp) Together! ONLY 3 STEPS! (...

47
Emerging
33 LM-Kit/lm-kit-net-samples

.NET samples for LM-Kit.NET

47
Emerging
34 mirpo/fastapi-gen

Build LLM-enabled FastAPI applications without build configuration.

47
Emerging
35 belladoreai/llama3-tokenizer-js

JS tokenizer for LLaMA 3 and LLaMA 3.1

46
Emerging
36 mfoud444/ollamafreeapi

OllamaFreeAPI: Free Distributed API for Ollama LLMs Public gateway to our...

46
Emerging
37 nova-land/gbnf-compiler

Plug n Play GBNF Compiler for llama.cpp

46
Emerging
38 sobelio/llm-chain

`llm-chain` is a powerful rust crate for building chains in large language...

45
Emerging
39 donderom/llm4s

Scala 3 bindings for llama.cpp πŸ¦™

45
Emerging
40 hybridgroup/yzma

Go with your own intelligence - Go applications that directly integrate...

44
Emerging
41 mdrokz/rust-llama.cpp

LLama.cpp rust bindings

44
Emerging
42 absadiki/pyllamacpp

Python bindings for llama.cpp

44
Emerging
43 iaalm/llama-api-server

A OpenAI API compatible REST server for llama.

44
Emerging
44 openjlc/riscv64-library

Some of the libraries (docs) on the RISCV64 architecture are easy for users...

43
Emerging
45 Archimedes1618/Madlab

Madlab is an advanced AI development studio designed to streamline the...

43
Emerging
46 nerve-sparks/iris_android

IRIS is an android app for interfacing with GGUF / llama.cpp models locally.

43
Emerging
47 phronmophobic/llama.clj

Run LLMs locally. A clojure wrapper for llama.cpp.

42
Emerging
48 KolosalAI/kolosal-server

Kolosal AI is an OpenSource and Lightweight alternative to Ollama to run...

42
Emerging
49 withcaer/curtana

Simplified zero-cost wrapper over llama.cpp powered by the lama-cpp-2 Crate.

42
Emerging
50 gitctrlx/llama.go

Llama from scratch in Go.

42
Emerging
51 LLukas22/llm-rs-python

Unofficial python bindings for the rust llm library. πŸβ€οΈπŸ¦€

42
Emerging
52 gotzmann/llama.go

llama.go is like llama.cpp in pure Golang!

42
Emerging
53 nuhmanpk/quick-llama

Run Ollama models on Google Colab

42
Emerging
54 eugenehp/bitnet-cpp-rs

Rust bindings for bitnet.cpp based on llama-cpp-4

41
Emerging
55 KolosalAI/Kolosal

Kolosal AI is an OpenSource and Lightweight alternative to LM Studio to run...

41
Emerging
56 fardjad/node-llmatic

Use self-hosted LLMs with an OpenAI compatible API

41
Emerging
57 developer239/llama.cpp-ts

llama.cpp πŸ¦™ LLM inference in TypeScript

40
Emerging
58 trrahul/llama2.cs

Inference Llama 2 in one file of pure C#

40
Emerging
59 iverly/llamafile-docker

Distribute and run llamafile/LLMs with a single docker image.

40
Emerging
60 KolosalAI/kolosal-cli

Super lightweight Ollama + Qwen Code alternative to run Llama 3.3,...

40
Emerging
61 loong64/llama.cpp

LLM inference in C/C++

40
Emerging
62 cgbur/llama2.zig

Inference Llama 2 in one file of pure Zig

40
Emerging
63 mybigday/llama.node

Node.js binding of llama.cpp

39
Emerging
64 hpretila/llama.net

.NET wrapper for LLaMA.cpp for LLaMA language model inference on CPU. πŸ¦™

39
Emerging
65 belladoreai/llama-tokenizer-js

JS tokenizer for LLaMA 1 and 2

39
Emerging
66 mdegans/drama_llama

Yet another `llama.cpp` Rust wrapper

39
Emerging
67 loong64/ollama

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other...

39
Emerging
68 BodhiSearch/BodhiApp

Run Open Source/Open Weight LLMs locally with OpenAI compatible APIs

38
Emerging
69 jaco-bro/MLX.zig

MLX.zig: Phi-4, Llama 3.2, and Whisper in Zig

38
Emerging
70 amin-tehrani/ollama-colab

Serve Ollama LLMs on Google Colab (free plan) using Ngrok

38
Emerging
71 cocktailpeanut/dalai

The simplest way to run LLaMA on your local machine

38
Emerging
72 Kagamma/llama-pas

Free Pascal bindings for llama.cpp

38
Emerging
73 anthonyfoust/ai-stack-homelab

Complete AI automation stack optimized for Mac Mini M4, but can work in...

38
Emerging
74 K024/llm-sharp

Language models in C#

37
Emerging
75 Agora-Lab-AI/Atom

a suite of finetuned LLMs for atomically precise function calling πŸ§ͺ

37
Emerging
76 dirmacs/lancor

A Rust client library for llama.cpp's OpenAI-compatible API server

37
Emerging
77 SeungyounShin/Llama2-Code-Interpreter

Make Llama2 use Code Execution, Debug, Save Code, Reuse it, Access to Internet

37
Emerging
78 adalkiran/llama-nuts-and-bolts

A holistic way of understanding how Llama and its components run in...

37
Emerging
79 FlatlinerDOA/PerceptivePyro

Run and train Transformer based Large Language Models (LLMS) natively in...

37
Emerging
80 Thrasher-Software/sigil

A local-first LLM development studio. Build, test, and customize inference...

37
Emerging
81 openshieldai/openshield

OpenShield is a new generation security layer for AI models

37
Emerging
82 c0sogi/llama-api

An OpenAI-like LLaMA inference API

36
Emerging
83 trzy/llava-cpp-server

LLaVA server (llama.cpp).

36
Emerging
84 sashazykov/json-repair-rb

A simple Ruby gem designed to repair broken JSON strings

36
Emerging
85 dravenk/ollama-zig

Ollama Zig library

36
Emerging
86 Aloereed/llama.cpp-server-ohos

Llama.cpp server for OpenHarmony

36
Emerging
87 diogok/llama.cpp.zig

A build.zig for llama.cpp

36
Emerging
88 dev-sufyaan/Nexlify

Unified API platform for free access to enterprise-grade AI models from...

36
Emerging
89 abhisheknair10/llama3.cu

Lightweight Llama 3 8B Inference Engine in CUDA C

36
Emerging
90 nikolaydubina/llama2.go

LLaMA-2 in native Go

35
Emerging
91 hoof-ai/hoof

"Just hoof it!" - A spotlight like interface to Ollama

35
Emerging
92 leftmove/cria

Run LLMs locally with as little friction as possible.

35
Emerging
93 saddam213/LLamaStack

ASP.NET Core Web, WebApi & WPF implementations for LLama.cpp & LLamaSharp

35
Emerging
94 fermyon/ai-examples

A collection of serverless apps that show how Fermyon's Serverless AI...

34
Emerging
95 chelsea0x3b/llama-dfdx

LLaMa 7b with CUDA acceleration implemented in rust. Minimal GPU memory needed!

34
Emerging
96 5aharsh/collama

Run Ollama LLM models in Google Colab for free

34
Emerging
97 OneInterface/realtime-bakllava

llama.cpp with BakLLaVA model describes what does it see

34
Emerging
98 moritztng/fltr

Like grep but for natural language questions. Based on Mistral 7B or Mixtral 8x7B.

34
Emerging
99 daskol/llama.py

Python bindings to llama.cpp

33
Emerging
100 zatevakhin/obsidian-local-llm

Obsidian Local LLM is a plugin for Obsidian that provides access to a...

33
Emerging
101 cgjosephlee/ollama-save-load

Save and load ollama models just like operating docker images.

33
Emerging
102 AlenVelocity/langchain-llama

Run LLAMA LLMs in Node with Langchain

33
Emerging
103 benct/kotlin-cheat-sheet

:star: Kotlin <3 Cheat Sheet, Collection Extension Functions and General Examples

33
Emerging
104 maifeeulasad/LocalLLaMA

πŸ“š LocalLLaMA Archive β€” Community-powered static archive for r/LocalLLaMA

33
Emerging
105 Uralstech/vid-orca

Deploy LLaMA-2 Chat on Google Cloud.

33
Emerging
106 andreiramani/jadi4llamacpp

Just another drop in for llama.cpp

32
Emerging
107 codewithdark-git/llama-3-Hackathon

LLaMA Genius is an AI-powered research assistant designed to help users...

32
Emerging
108 BerkeliumLabs/Berkelium-labs

Your personal AI Lab, accessible everywhere! Explore, experiment, and...

32
Emerging
109 rabilrbl/llamafile-builder

A simple github actions script to build a llamafile and uploads to huggingface

32
Emerging
110 zTgx/llmweb-rs

Webpage to structured data in Rust & LLM

32
Emerging
111 didier-durand/llms-in-clouds

Experiments with LLMs in clouds (powered by SGLang)

32
Emerging
112 alvion427/PerroPastor

Run Llama based LLMs in Unity entirely in compute shaders with no dependencies

32
Emerging
113 avatsaev/av-local-llm-api

Allows to easily run local REST API with a custom LLM, running locally or...

31
Emerging
114 makllama/makllama

MaK(Mac+Kubernetes)llama - Running and orchestrating large language models...

31
Emerging
115 lrusso/llama3pure

Three inference engines for Llama 3: pure C for desktop systems, pure...

31
Emerging
116 yfedoseev/llmkit

Production-grade LLM client - Rust, Python, TypeScript. 100+ providers,...

31
Emerging
117 RahulSChand/llama2.c-for-dummies

Step by step explanation/tutorial of llama2.c

31
Emerging
118 khiwniti/kaggle-llm-api

πŸ€– Comprehensive solution for running Ollama/vLLM API servers in Kaggle...

31
Emerging
119 kurnevsky/llama-cpp.el

A client for llama-cpp server

30
Emerging
120 tunib-ai/joker

AI model designed to test the effectiveness in handling external ethical attacks.

30
Emerging
121 seanpm2001/DALL-E_LLaMA

πŸ€–οΈπŸ¦™οΈπŸ§ οΈ DALL-E LLaMA is a combination of DALL-E and LLaMA (Large Language...

30
Emerging
122 seanpm2001/DALL-E_LLaMA_Docs

πŸ€–οΈπŸ¦™οΈπŸ§ οΈπŸ“–οΈ The official documentation source repository for DALL-E LLaMA, a...

30
Emerging
123 UgurkanTech/ArchNetAI

ArchNetAI is a Python library that leverages the Ollama API for generating...

29
Experimental
124 Adriankhl/godot-llm-template

Godot LLM Template/Demo

29
Experimental
125 kassane/ollama-d

D bindings for the Ollama API

29
Experimental
126 johnsutor/llama-jarvis

Turn any LLM into Jarvis

29
Experimental
127 fuglede/llama.ttf

A font for writing tiny stories

28
Experimental
128 hurui200320/llama-cpp-kt

The Kotlin wrapper of llama.cpp, powered by JNA

27
Experimental
129 LastBotInc/llama2j

Pure Java Llama2 inference with optional multi-GPU CUDA implementation

27
Experimental
130 haormj/llama2.go

Inference Llama 2 in one file of pure go

27
Experimental
131 fbaldassarri/llama-cpp-container

Docker image to deploy a llama-cpp container with conda-ready environments

27
Experimental
132 frinknet/gelli

Containerized LLM for any use-case big or small

26
Experimental
133 ksylvest/omniai-llama

An implementation of the OmniAI interface for Llama.

26
Experimental
134 anglerfishlyy/llm-watch-grafana

AI observability Grafana plugin tracking real-time LLM metrics β€” latency,...

26
Experimental
135 lwch/llama2.go

Port of Facebook's LLaMA 2 model in pure go and use little memory

26
Experimental
136 mkagenius/llm-token-visualizer

See How Big Exactly A 128k Token Text Is

25
Experimental
137 Brazilian-willametteriver232/llama.swift

πŸš€ Access llama.cpp easily in your Swift projects, leveraging precompiled...

25
Experimental
138 zerob13/modelinfo-cli

A CLI to query AI model capabilities, context limits, and pricing from...

25
Experimental
139 PCfVW/plip-rs

Mechanistic interpretability toolkit for code LLMs, in Rust. Analysis of...

25
Experimental
140 sc0v0ne/udemy_course_mastering_ollama_build_private_local_llm_apps_with_python

Udemy Course Mastering Ollama Build Private Local LLM Apps with Python

24
Experimental
141 cvedix/omnisdk

On-device AI deloper platform

24
Experimental
142 mhajder/llama.cpp-updater

A shell script to automatically update or build llama.cpp with optimal GPU...

24
Experimental
143 Stoksweet/modlable

A platform for building, training and running inference on TensorflowJS...

24
Experimental
144 Andrew2077/Alpaca

Simple Q/A app, where i created a UI for alpaca (fine tuned LLAMA) model...

24
Experimental
145 ariannamethod/yent.yo

diffusion AI with a bad character

23
Experimental
146 lenticularis39/llama2.inferno

Inference Llama 2 in one file of pure Limbo

23
Experimental
147 waqasm86/Ubuntu-Cuda-Llama.cpp-Executable

Pre-built llama.cpp CUDA binary for Ubuntu 22.04. No compilation required -...

22
Experimental
148 bkataru/chatllm.zig

Zig wrapper for chatllm.cpp - LLM inference with 70+ model architectures

22
Experimental
149 ChristianHohlfeld/ollama-local-docker

Ollama Local Docker - A simple Docker-based setup for running Ollama's API...

22
Experimental
150 austinweis/alpaca.cpp-gui

GUI for GGML Alpaca models

22
Experimental
151 Root1V/axonium-sdk

A production-grade Python SDK for llama-server that streamlines...

22
Experimental
152 RichardHam-co-uk/ProjectLodestar

AI development environment with 90% cost savings. Routes between 8 LLM...

22
Experimental
153 mrtrizer/UnityLlamaCpp

Llama.cpp in Unity, straightforward and clean

22
Experimental
154 whyisitworking/llama-bro

On-device LLM inference SDK for Android, powered by llama.cpp. Run GGUF...

22
Experimental
155 estrify/ProjectLodestar

🌟 Optimize AI development with Lodestar by smartly routing between free...

21
Experimental
156 tokenrouter/tokenrouter-python

Official Python SDK for TokenRouter - an intelligent LLM routing service...

21
Experimental
157 qxoticai/qxotic

AI engine for the JVM

21
Experimental
158 KolosalAI/kolosal-desktop

Kolosal AI is an OpenSource and Lightweight alternative to LM Studio to run...

21
Experimental
159 nathanborror/swift-llama

A Swift client library for interacting with Meta's Llama API.

21
Experimental
160 chromejaw/free-llm-api

A list of free LLM inference resources accessible via API.

21
Experimental
161 ahmedmagood/cpu-slm

πŸ–₯️ Explore CPU-SLM, a Rust-based SLM/LLM project that runs on CPU, offering...

21
Experimental
162 veerapatel/llm.nexus

🌐 Streamline integration with various LLM providers using LLM.Nexus, a .NET...

21
Experimental
163 blackboxprogramming/ai-chain

AI Chain β€” Distributed multi-node LLM inference with automatic failover....

21
Experimental
164 wk-y/rama-swap

ramalama-based model swapping server

21
Experimental
165 nerdsupremacist/LlamaLang

Repository for the Llama Programming Language. Work In Progress

21
Experimental
166 aratan/ApiCloudLLaMA

The idea is to make an api that everyone can consume in their GPT4-like...

21
Experimental
167 sak96/rust_llama_app

Chat bot (llama) written in rust using Yew and Tauri.

21
Experimental
168 seehiong/micronaut-llama3

A high-performance Llama3 implementation using Micronaut and GraalVM Native Image

21
Experimental
169 nininau/awesome-llm-services

πŸ” Discover 106+ open-source LLM services and tools for AI, ideal for local...

21
Experimental
170 invergent-ai/surogate-website

Website for surogate.ai

21
Experimental
171 tbogdala/woolyrust

A high-level Rust wrapper around llama.cpp for text generation AI with LLMs.

20
Experimental
172 llamajs/llama

A dynamic logger for the dynamic developer

20
Experimental
173 xxxbf0222/LlamaDeck

A command-line tool for quickly managing and experimenting with multiple...

20
Experimental
174 secret-ai-labs/awesome-local-llm

Your complete guide to running powerful AI models locally in 2025. Covers...

20
Experimental
175 CameLLM/CameLLM

Run your favourite LLMs locally on macOS from Swift

20
Experimental
176 unaidedelf8777/faster-outlines

A Lazy, high throughput and blazing fast structured text generation backend.

20
Experimental
177 jihadkhawaja/Llama.Grammar

GBNF converter for llama.cpp Grammar directly from C# types

20
Experimental
178 tbogdala/woolycore

The core wrapper around llama.cpp in C to provide an easy surface to build...

20
Experimental
179 themaximalist/ModelDeployer

API Proxy for AI models, rate limiting, management and more!

19
Experimental
180 ferranpons/Llamatik-Server

Remote inference backend implementing the same API as the Llamatik library...

19
Experimental
181 Jshulgach/NeuroBridge

NeuroBridge: Where AI perception meets real-time robotics control

19
Experimental
182 leaxer-ai/leaxer-llama

Pre-built llama.cpp binaries for Leaxer

19
Experimental
183 Komdosh/kLLaMa-jvm

Simple example of using llama.cpp with kotlin (JVM)

19
Experimental
184 EZForever/llama.cpp-static

Static builds of llama.cpp (Currently only amd64 server builds are available)

19
Experimental
185 lufixSch/auto_llama

Supercharge your local LLM

19
Experimental
186 JinHanLei/LLM-Stream-Service

Streaming API and Web page for Large Language Models (Llama3) based on...

19
Experimental
187 TimeSurgeLabs/promptproxy

Call many AIs from a single API.

19
Experimental
188 gyanaranjans/llma-rust

A simple webapp to showcase the ability to write a simple chatbot webapp...

19
Experimental
189 iakashpaul/Ghudsavar

Ghudsavar (Horse rider) - Is a quick llama.cpp server for CPU only runtimes

19
Experimental
190 ns408/local-ai-setup

Run modern AI models on older laptops - optimized for 2nd-gen Intel hardware

19
Experimental
191 antononcube/Raku-WWW-LLaMA

Raku package that provides access to the algorithms/models of (the...

19
Experimental
192 MaoJianwei/llama.cpp-arm-armv7l-Raspberry-Pi-Release-Prebuild

On the Releases page, you can download pre-built binaries for arm, armv7l...

19
Experimental
193 lenML/llama2-tokenizer.js

llama2 tokenizer for javascript

18
Experimental
194 coderonion/awesome-mojo-max-mlir

A collection of some awesome public MAX platform, Mojo programming language...

18
Experimental
195 JavaLLM/llama4j

An easy-to-use Java SDK for running LLaMA models on edge devices, powered by...

18
Experimental
196 niansa/libjustlm

Super easy to use library for doing LLaMA/GPT-J stuff! - Mirror of:...

18
Experimental
197 updcon/libmisc-clj

DKD miscellaneous for Clojure development

17
Experimental
198 NeuralWeights/Llama-Server-AuthKeys

Authorization tokens to access llama.cpp server (LM Studio, Ollama, Msty,...

17
Experimental
199 asaddi/lv-serve

Llama 3.2 Vision OpenAI-like API server

17
Experimental
200 shakfu/llamalib

Thin cython, pybind11, and nanobind wrappers around llama.cpp

17
Experimental
201 3axislabs/llm4j

Build Context Aware LLM Apps using Java

17
Experimental
202 Inferra/Inferra-Python-SDK

Official Python SDK for Inferra API access

17
Experimental
203 entelecheia/llama-factory-container

Container for LLaMA-Factory

16
Experimental
204 lennor-tan/openrouter-free-model

🌐 Explore and manage free models on OpenRouter effortlessly with our web...

15
Experimental
205 harpertoken/memoraxx

LLaMA-style models with memory persistence.

15
Experimental
206 ainize-team/free-llama-api

Run Meta Llama 3.2 API without your GPU for free. We always support lastest model 🧑

14
Experimental
207 Gaolingx/llama.cpp-Launcher

run llama.cpp quickly and conveniently.

14
Experimental
208 iakashpaul/Portal

Android app for the Hole in your Palm project, making LLMs accessible on-device!

14
Experimental
209 miga1999/AirClaw

Run OpenClaw locally on any GPU or CPU without API costs, supporting large...

14
Experimental
210 Atsusheeesh/vllm-daily

πŸ“Š Summarize merged PRs daily with vLLM, ensuring you stay updated on key...

14
Experimental
211 botosadam/matryoshka

πŸš€ Build Ruby gems that utilize Rust for enhanced performance through two...

14
Experimental
212 kashan-alam/ai-backend-fastapi

AI-powered backend API built with FastAPI, JWT authentication, rate...

14
Experimental
213 Root1V/llm-security

JWT-based authentication and authorization gateway for locally deployed LLM...

14
Experimental
214 j-f1/LLM-Playground

Play with LLaMA & GPT-3!

13
Experimental
215 eccenca/llama-index-cmem

llama-index tools eccenca Corporate Memory Integration

13
Experimental
216 diogok/llamautils

Some python utilities for running llama.cpp on linux

13
Experimental
217 zTgx/llama.rust

LLM inference in Rust

13
Experimental
218 georon/llama_test_proj

Skeleton project to run and test Llama and Chromadb locally on a gaming...

13
Experimental
219 SanMog/Uroboros

Automated red-teaming framework for LLMs. Tests GPT-4o, Claude, Llama...

13
Experimental
220 juansalnac/API-mega-list

🌐 Discover a comprehensive collection of APIs to enhance your projects and...

13
Experimental
221 tbogdala/ai_notepad

A lightweight Rust application to test interaction with large language...

13
Experimental
222 Abdullahali77/AI_Testing_CLI

A specialized command-line tool that generates Python unit tests for your...

13
Experimental
223 NavodPeiris/node_llama

run llama models using llamafile and communicate with llama models through...

13
Experimental
224 nherx/free-llm-api-resources

πŸ€– Discover free API access and credits for various legitimate large language...

13
Experimental
225 JohnClaw/chatllm.v

V-lang api wrapper for llm-inference chatllm.cpp

12
Experimental
226 CameLLM/CameLLM-Llama

Plugin for CameLLM supporting the LLaMA family of models

12
Experimental
227 JohnClaw/chatllm.cs

C# api wrapper for llm-inference chatllm.cpp

12
Experimental
228 JohnClaw/chatllm.vb

VB.NET api wrapper for llm-inference chatllm.cpp

12
Experimental
229 JohnClaw/chatllm.nim

Nim api-wrapper for llm-inference chatllm.cpp

11
Experimental
230 Namangupta123/AlgoMonitor

System Monitoring and Management with AI insights

11
Experimental
231 Chelsea486MHz/mobula

Cloud deployment of LLAMA AI using DALAI

11
Experimental
232 einzig-diego/LLaMA-CPP-Server-Endpoint-API

Examples of how to call the REST api endpoint of the C++ port of LLaMA by...

11
Experimental
233 leozqin/hops

A load-balancing reverse proxy server that enables you to address a fleet of...

11
Experimental
234 k-arthik-r/ALSATE

An automated log monitoring system that continuously analyzes Linux sys-logs...

11
Experimental
235 jazibjohar/ai-text-structor

A powerful asynchronous framework for orchestrating Large Language Model...

11
Experimental
236 pAI-OS/fetch_llama_cpp

llama.cpp downloader that selects the latest and best available binaries for...

11
Experimental
237 m9m9ra/llama.swiftui

It`s my playground to test mokpell llama swift lib

11
Experimental
238 numq/text-generation

JVM library for text generation, written in Kotlin and based on the C++...

11
Experimental
239 fasuizu-br/brainiall-llm-gateway

Brainiall LLM Gateway β€” 113+ AI models via OpenAI-compatible API. Claude,...

11
Experimental
240 aruntemme/llamacpp-swap-boilerplate

A cross-platform template for running and managing llama-swap with...

11
Experimental
241 JohnClaw/llama-3.2-1b.vb

llama 3.2 1b fp16 cpu inference in one file of pure VB.NET

11
Experimental
242 pnsk-lab/CrossLM

Transferability for LLM?

11
Experimental
243 aryamanshrivastava/Ollama_Base

Some gen ai projects

11
Experimental
244 Xaxeric/run-ollama-colab

A lightweight setup to run Ollama (for local LLMs like LLaMA 3, Mistral,...

11
Experimental
245 JohnClaw/chatllm.rs

rust api wrapper for llm-inference chatllm.cpp

11
Experimental
246 tacheometry/rbxts-llama

TS typings for the Lua Library for Immutable Data (Llama) by Frelerik

11
Experimental
247 felipemacedo1/hacka-llama-go

Hackathon project integrating Llama AI models with Go backend for...

11
Experimental
248 akshat2602/Omistral

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, and other...

11
Experimental
249 AK2k30/Llama-2-Deployment-on-AWS

This is an api of LLama-2, so anybody can use this api to use LLama-2 for free.

11
Experimental
250 mattzcarey/llama.js

run LLMs (llama, mamba, nemo, mistral) at native speeds from Javascript, Typescript.

11
Experimental
251 JohnClaw/chatllm.d

D-lang api wrapper for llm-inference chatllm.cpp

10
Experimental
252 JohnClaw/chatllm.kt

kotlin api wrapper for llm-inference chatllm.cpp

10
Experimental
253 mikesdatawork/101-series

A comprehensive collection of the 101 most useful commands for various...

10
Experimental
254 nemmusu/run-llama-server

This is a Bash script to automatically launch llama-server, detects...

10
Experimental
255 paulpierre/vllm-docker

test Llama-3.2-11B-Vision-Instruct 4-bit quant quickly on an a100 40GB

10
Experimental
256 jim3692/koboldcpp-flake

Run Llama 3.1 directly

10
Experimental
257 JohnClaw/chatllm.lua

lua api wrapper for llm-inference chatllm.cpp

10
Experimental