p-e-w/chatbot_clinic

Science-driven chatbot development

37
/ 100
Emerging

This helps you scientifically evaluate and refine your custom chatbots. You provide various chatbot configurations, descriptions, and settings, then interact with all of them simultaneously in a blind test. The output is a clear statistical breakdown of which chatbot received the most votes, allowing you to objectively determine the best performing version. This is for anyone who designs or fine-tunes virtual characters powered by large language models.

No commits in the last 6 months.

Use this if you are developing multiple chatbot personas or configurations and want a structured, unbiased way to compare their performance based on user preference.

Not ideal if you need an automated, quantitative evaluation using metrics like perplexity or sentiment analysis, rather than human judgment.

chatbot-design LLM-fine-tuning virtual-assistant-development user-preference-testing conversational-AI
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 8 / 25
Maturity 16 / 25
Community 13 / 25

How are scores calculated?

Stars

65

Forks

8

Language

Python

License

AGPL-3.0

Last pushed

May 05, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/p-e-w/chatbot_clinic"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.