liltom-eth/llama2-webui

Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere (Linux/Windows/Mac). Use `llama2-wrapper` as your local llama2 backend for Generative Agents/Apps.

/ 100

Emerging

This project helps developers and engineers run various Llama 2 and Code Llama models on their local machines, offering a web-based chat interface. It allows you to input text prompts and receive generated text or code completions directly from these powerful large language models. The end-users are AI/ML developers, data scientists, or engineers who want to integrate Llama 2 into their applications or test models locally.

1,945 stars. No commits in the last 6 months.

Use this if you are a developer or AI engineer looking to run, test, or integrate Llama 2 models locally on your computer with a user-friendly interface or through an OpenAI-compatible API.

Not ideal if you are a non-technical user looking for a ready-to-use AI chatbot without any local setup or programming.

AI development natural language processing code generation machine learning engineering local LLM deployment

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 20 / 25

How are scores calculated?

Stars

1,945

Forks

202

Language

Jupyter Notebook

License

MIT

Higher-rated alternatives

containers/ramalama

RamaLama is an open-source developer tool that simplifies the local serving of AI models from...

av/harbor

One command brings a complete pre-wired LLM stack with hundreds of services to explore.

RunanywhereAI/runanywhere-sdks

Production ready toolkit to run AI locally

runpod-workers/worker-vllm

The RunPod worker template for serving our large language model endpoints. Powered by vLLM.

foldl/chatllm.cpp

Pure C++ implementation of several models for real-time chatting on your computer (CPU & GPU)

Explore LLM Tools

All categories Trending LLM Tool directory Insights