retkowsky/foundry-local

Foundry Local is an on-device AI inference solution that you use to run AI models locally through a CLI, SDK, or REST API

/ 100

Emerging

Foundry Local helps developers run generative AI models directly on their own computers (Windows, macOS, or servers) instead of relying on cloud services. It takes AI models as input and outputs the results of their inference, allowing for tasks like real-time chat or content generation. Developers, data scientists, and engineers who build AI-powered applications will use this.

Use this if you need to run AI models on-device for applications requiring strict data privacy, offline capabilities, low latency, or cost efficiency, without sending data to the cloud.

Not ideal if you prefer a fully managed cloud-based AI service or don't have the local hardware resources (e.g., sufficient RAM or GPU/NPU) to run models efficiently.

AI application development edge AI private AI offline AI local inference

No License No Package No Dependents

Maintenance 10 / 25

Adoption 5 / 25

Maturity 3 / 25

Community 15 / 25

How are scores calculated?

Stars

Forks

Language

Jupyter Notebook

License

—

Featured in

You're Shipping AI You Can't Measure

Higher-rated alternatives

openvinotoolkit/model_server

A scalable inference server for models optimized with OpenVINO™

madroidmaq/mlx-omni-server

MLX Omni Server is a local inference server powered by Apple's MLX framework, specifically...

NVIDIA-NeMo/Guardrails

NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based...

generative-computing/mellea

Mellea is a library for writing generative programs.

rhesis-ai/rhesis

Open-source platform & SDK for testing LLM and agentic apps. Define expected behavior, generate...

Explore Generative AI Tools

All categories Trending Generative AI directory Insights