liltom-eth/llama2-webui
Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere (Linux/Windows/Mac). Use `llama2-wrapper` as your local llama2 backend for Generative Agents/Apps.
This project helps developers and engineers run various Llama 2 and Code Llama models on their local machines, offering a web-based chat interface. It allows you to input text prompts and receive generated text or code completions directly from these powerful large language models. The end-users are AI/ML developers, data scientists, or engineers who want to integrate Llama 2 into their applications or test models locally.
1,945 stars. No commits in the last 6 months.
Use this if you are a developer or AI engineer looking to run, test, or integrate Llama 2 models locally on your computer with a user-friendly interface or through an OpenAI-compatible API.
Not ideal if you are a non-technical user looking for a ready-to-use AI chatbot without any local setup or programming.
Stars
1,945
Forks
202
Language
Jupyter Notebook
License
MIT
Category
Last pushed
Mar 22, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/liltom-eth/llama2-webui"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
containers/ramalama
RamaLama is an open-source developer tool that simplifies the local serving of AI models from...
av/harbor
One command brings a complete pre-wired LLM stack with hundreds of services to explore.
RunanywhereAI/runanywhere-sdks
Production ready toolkit to run AI locally
runpod-workers/worker-vllm
The RunPod worker template for serving our large language model endpoints. Powered by vLLM.
foldl/chatllm.cpp
Pure C++ implementation of several models for real-time chatting on your computer (CPU & GPU)