marklysze/AutoGenPromptTesting

Prompt testing Local LLMs for Microsoft's AutoGen

/ 100

Experimental

This project helps developers evaluate how well different local Large Language Models (LLMs) perform specific tasks within an AutoGen multi-agent system. It takes various local LLMs and different prompt engineering techniques as input, then measures how accurately the LLM selects the correct 'speaker' or agent in a simulated debate scenario. The ideal user is a developer building multi-agent AI applications with AutoGen, specifically when using local, self-hosted LLMs that might not be as capable as commercial alternatives.

No commits in the last 6 months.

Use this if you are an AI application developer working with AutoGen and need to understand how to optimize prompts for local LLMs to achieve reliable multi-agent coordination.

Not ideal if you are looking for a fully structured, production-ready framework for generalized prompt testing, or if you are exclusively using highly capable private LLMs like ChatGPT.

AI application development AutoGen local LLMs prompt engineering multi-agent systems

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 5 / 25

Maturity 8 / 25

Community 7 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

dottxt-ai/outlines

Structured Outputs

takashiishida/arxiv-to-prompt

Transform arXiv papers into a single LaTeX source that can be used as a prompt for asking LLMs...

microsoft/promptpex

Test Generation for Prompts

Spr-Aachen/LLM-PromptMaster

A simple LLM-Powered chatbot software.

AI-secure/aug-pe

[ICML 2024 Spotlight] Differentially Private Synthetic Data via Foundation Model APIs 2: Text

Explore Prompt Engineering Tools

All categories Trending Prompt Engineering directory Insights