p-e-w/chatbot_clinic

Science-driven chatbot development

/ 100

Emerging

This helps you scientifically evaluate and refine your custom chatbots. You provide various chatbot configurations, descriptions, and settings, then interact with all of them simultaneously in a blind test. The output is a clear statistical breakdown of which chatbot received the most votes, allowing you to objectively determine the best performing version. This is for anyone who designs or fine-tunes virtual characters powered by large language models.

No commits in the last 6 months.

Use this if you are developing multiple chatbot personas or configurations and want a structured, unbiased way to compare their performance based on user preference.

Not ideal if you need an automated, quantitative evaluation using metrics like perplexity or sentiment analysis, rather than human judgment.

chatbot-design LLM-fine-tuning virtual-assistant-development user-preference-testing conversational-AI

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 8 / 25

Maturity 16 / 25

Community 13 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

AGPL-3.0

Higher-rated alternatives

posit-dev/chatlas

Your friendly guide to building LLM chat apps in Python with less effort and more clarity.

xming521/WeClone

🚀 One-stop solution for creating your AI twin from chat history 💡 Fine-tune LLMs with your chat...

ooyinet/WeClone

🚀从聊天记录创造数字分身的一站式解决方案💡 使用聊天记录微调大语言模型，让大模型有“那味儿”，并绑定到聊天机器人，实现自己的数字分身。数字克隆/数字分身/数字永生/LLM/聊天机器人/LoRA

vemonet/libre-chat

🦙 Free and Open Source Large Language Model (LLM) chatbot web UI and API. Self-hosted, offline...

qqqqqf-q/MirrorFlow

从对话数据到训练:数字分身 + 模型蒸馏 From Dialogue Data to Training Closed-Loop: Digital Twin + Model Distillation

Explore LLM Tools

All categories Trending LLM Tool directory Insights