nicholasyager/llama-cpp-guidance

A guidance compatibility layer for llama-cpp-python

/ 100

Experimental

This is a tool for developers who are building applications powered by large language models (LLMs) locally. It allows you to use a specific type of local LLM setup (llama-cpp-python) with a popular framework for controlling LLM output (Guidance). This helps you create more reliable and structured text generation for your applications.

No commits in the last 6 months.

Use this if you are a developer building an application that leverages local LLMs and you need to precisely control the structure and content of the text generated by the model.

Not ideal if you are an end-user looking for a ready-to-use application, or if you are working with cloud-based LLMs rather than local ones.

LLM application development local AI deployment structured text generation AI model integration developer tools

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 7 / 25

Maturity 8 / 25

Community 11 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

beehive-lab/GPULlama3.java

GPU-accelerated Llama3.java inference in pure Java using TornadoVM.

gitkaz/mlx_gguf_server

This is a FastAPI based LLM server. Load multiple LLM models (MLX or llama.cpp) simultaneously...

srgtuszy/llama-cpp-swift

Swift bindings for llama-cpp library

JackZeng0208/llama.cpp-android-tutorial

llama.cpp tutorial on Android phone

awinml/llama-cpp-python-bindings

Run fast LLM Inference using Llama.cpp in Python

Explore Transformer Models

All categories Trending Transformer directory Insights