BCG-X-Official/artkit

Automated prompt-based testing and evaluation of Gen AI applications

/ 100

Established

This tool helps AI product managers, trust & safety specialists, and data scientists automatically test their Generative AI applications for issues like Q&A accuracy, brand value conformity, demographic bias, safety, and security vulnerabilities. It takes your Gen AI system and testing criteria as input, then generates relevant test cases and provides an evaluation report on how well your system performs against those criteria. This is for professionals responsible for the responsible development and deployment of Gen AI.

162 stars. Used by 1 other package. No commits in the last 6 months. Available on PyPI.

Use this if you need an automated, customizable way to rigorously test your Gen AI application for critical issues before or after deployment, scaling your testing efforts beyond manual review.

Not ideal if you are looking for a 'push-button' solution that requires no technical customization or if your primary concern is not the quality and safety of Gen AI outputs.

AI-testing Generative-AI-evaluation AI-trust-and-safety prompt-engineering responsible-AI

Stale 6m

Maintenance 0 / 25

Adoption 11 / 25

Maturity 25 / 25

Community 21 / 25

How are scores calculated?

Stars

162

Forks

Language

Jupyter Notebook

License

Apache-2.0

Related tools

GoogleCloudPlatform/genai-for-marketing

Showcasing Google Cloud's generative AI for marketing scenarios via application frontend,...

modal-labs/modal-client

SDK libraries for Modal

xavidop/genkitx-github

Community Plugin for Genkit to use Github Models

eeemoon/perchance

Unofficial Python API for Perchance.

codeproject/CodeProject.AI-Server

CodeProject.AI Server is a self contained service that software developers can include in, and...

Explore Generative AI Tools

All categories Trending Generative AI directory Insights