viniciusarruda/llama-cpp-chat-completion-wrapper
Wrapper around llama-cpp-python for chat completion with LLaMA v2 models.
This tool helps developers integrate LLaMA v2 large language models into their applications for conversational AI tasks. It simplifies the process of formatting user prompts and model responses for LLaMA v2, ensuring the chat completion functions correctly. Developers who want to build chatbots or other interactive AI experiences using LLaMA v2 models locally would find this useful.
No commits in the last 6 months.
Use this if you are a developer building a conversational AI application and need to easily implement LLaMA v2 chat completion using Python.
Not ideal if you are a non-developer seeking a ready-to-use chatbot or if you are working with language models other than LLaMA v2.
Stars
24
Forks
1
Language
Jupyter Notebook
License
—
Category
Last pushed
Jul 27, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/viniciusarruda/llama-cpp-chat-completion-wrapper"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
posit-dev/chatlas
Your friendly guide to building LLM chat apps in Python with less effort and more clarity.
xming521/WeClone
🚀 One-stop solution for creating your AI twin from chat history 💡 Fine-tune LLMs with your chat...
ooyinet/WeClone
🚀从聊天记录创造数字分身的一站式解决方案💡 使用聊天记录微调大语言模型,让大模型有“那味儿”,并绑定到聊天机器人,实现自己的数字分身。 数字克隆/数字分身/数字永生/LLM/聊天机器人/LoRA
vemonet/libre-chat
🦙 Free and Open Source Large Language Model (LLM) chatbot web UI and API. Self-hosted, offline...
qqqqqf-q/MirrorFlow
从对话数据到训练:数字分身 + 模型蒸馏 From Dialogue Data to Training Closed-Loop: Digital Twin + Model Distillation