BCG-X-Official/artkit
Automated prompt-based testing and evaluation of Gen AI applications
This tool helps AI product managers, trust & safety specialists, and data scientists automatically test their Generative AI applications for issues like Q&A accuracy, brand value conformity, demographic bias, safety, and security vulnerabilities. It takes your Gen AI system and testing criteria as input, then generates relevant test cases and provides an evaluation report on how well your system performs against those criteria. This is for professionals responsible for the responsible development and deployment of Gen AI.
162 stars. Used by 1 other package. No commits in the last 6 months. Available on PyPI.
Use this if you need an automated, customizable way to rigorously test your Gen AI application for critical issues before or after deployment, scaling your testing efforts beyond manual review.
Not ideal if you are looking for a 'push-button' solution that requires no technical customization or if your primary concern is not the quality and safety of Gen AI outputs.
Stars
162
Forks
38
Language
Jupyter Notebook
License
Apache-2.0
Category
Last pushed
Mar 06, 2025
Commits (30d)
0
Dependencies
5
Reverse dependents
1
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/generative-ai/BCG-X-Official/artkit"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
GoogleCloudPlatform/genai-for-marketing
Showcasing Google Cloud's generative AI for marketing scenarios via application frontend,...
modal-labs/modal-client
SDK libraries for Modal
xavidop/genkitx-github
Community Plugin for Genkit to use Github Models
eeemoon/perchance
Unofficial Python API for Perchance.
codeproject/CodeProject.AI-Server
CodeProject.AI Server is a self contained service that software developers can include in, and...