eth-sri/ToolFuzz
ToolFuzz is a fuzzing framework designed to test your LLM Agent tools.
ToolFuzz helps developers rigorously test the correctness and robustness of their LLM agent tools. It takes your existing agent tools, like those built with Langchain or AutoGen, and automatically generates a wide range of test prompts. The output is a detailed report highlighting any tool crashes or incorrect responses, enabling you to identify and fix issues before deployment.
No commits in the last 6 months.
Use this if you are a developer building LLM agent applications and need to ensure your agent's tools are reliable and accurate under various real-world scenarios.
Not ideal if you are an end-user of an LLM application and not involved in the development or testing of its underlying tools.
Stars
37
Forks
3
Language
Python
License
MIT
Category
Last pushed
Jul 20, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/agents/eth-sri/ToolFuzz"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
petterjuan/agentic-reliability-framework
ARF is an agentic reliability intelligence platform that separates decision intelligence (OSS)...
sarkar-ai-taken/riva
Local-first observability and control plane for AI agents.
Nubaeon/empirica
Make AI agents and AI workflows measurably reliable. Epistemic measurement, Noetic RAG,...
relai-ai/relai-sdk
A platform for building reliable AI agents
itbench-hub/ITBench-CISO-CAA-Agent
Code repository for CISO agent as part of ITBench