eth-sri/ToolFuzz

ToolFuzz is a fuzzing framework designed to test your LLM Agent tools.

/ 100

Emerging

ToolFuzz helps developers rigorously test the correctness and robustness of their LLM agent tools. It takes your existing agent tools, like those built with Langchain or AutoGen, and automatically generates a wide range of test prompts. The output is a detailed report highlighting any tool crashes or incorrect responses, enabling you to identify and fix issues before deployment.

No commits in the last 6 months.

Use this if you are a developer building LLM agent applications and need to ensure your agent's tools are reliable and accurate under various real-world scenarios.

Not ideal if you are an end-user of an LLM application and not involved in the development or testing of its underlying tools.

LLM-development agent-testing application-quality software-testing AI-safety

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 7 / 25

Maturity 16 / 25

Community 8 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Featured in

Agent Governance in 2026: Who's Building the Guardrails?

Higher-rated alternatives

petterjuan/agentic-reliability-framework

ARF is an agentic reliability intelligence platform that separates decision intelligence (OSS)...

sarkar-ai-taken/riva

Local-first observability and control plane for AI agents.

Nubaeon/empirica

Make AI agents and AI workflows measurably reliable. Epistemic measurement, Noetic RAG,...

relai-ai/relai-sdk

A platform for building reliable AI agents

itbench-hub/ITBench-CISO-CAA-Agent

Code repository for CISO agent as part of ITBench

Explore AI Agents

All categories Trending AI Agent directory Insights