withcatai/node-llama-cpp

Run AI models locally on your machine with node.js bindings for llama.cpp. Enforce a JSON schema on the model output on the generation level

73
/ 100
Verified

This project helps JavaScript and TypeScript developers integrate advanced AI capabilities directly into their applications by running large language models (LLMs) on their own machines. Developers input a language model and prompts, and the tool outputs structured text, function calls, or embeddings, enabling features like smart chatbots, data summarization, or advanced search within their applications. It's designed for developers building AI-powered features without relying on external cloud services.

1,942 stars. Used by 6 other packages. Actively maintained with 2 commits in the last 30 days. Available on npm.

Use this if you are a developer looking to embed AI models directly into your JavaScript/TypeScript applications, especially when you need precise control over model output formatting like JSON.

Not ideal if you are an end-user who just wants to chat with an AI or if you prefer using cloud-based AI services without local setup.

AI-powered applications local LLM deployment JavaScript development TypeScript development on-device AI
Maintenance 13 / 25
Adoption 15 / 25
Maturity 25 / 25
Community 20 / 25

How are scores calculated?

Stars

1,942

Forks

176

Language

TypeScript

License

MIT

Last pushed

Mar 12, 2026

Commits (30d)

2

Dependencies

28

Reverse dependents

6

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/transformers/withcatai/node-llama-cpp"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.