giuven95/chatgpt-failures

Failure archive for ChatGPT and similar models

/ 100

Emerging

This archive compiles real-world examples of how large language models (LLMs) like ChatGPT and New Bing can produce unexpected, incorrect, or problematic outputs. It gathers specific instances where these AI models fail in tasks ranging from basic arithmetic and factual recall to exhibiting unusual conversational behaviors. Anyone involved in evaluating, testing, or researching the practical limitations of current LLMs can use this resource to understand common failure modes.

597 stars. No commits in the last 6 months.

Use this if you need concrete examples of LLM failures for research, comparative analysis of different models, or to generate synthetic test data to improve AI robustness.

Not ideal if you are looking for a guide on how to fix these LLM issues yourself, or if you need a collection of success stories.

AI evaluation LLM testing AI safety Machine learning research Natural language processing

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 8 / 25

Community 12 / 25

How are scores calculated?

Stars

597

Forks

Language

Python

License

—

Higher-rated alternatives

k4l1sh/alexa-gpt

A tutorial on how to use ChatGPT in Alexa

rawandahmad698/PyChatGPT

⚡️ Python client for the ChatGPT API with, conversation tracking, proxy support and more.

Simatwa/WebChatGPT

Python SDK/API for ChatGPT Web-Version

toshiakit/MatGPT

MATLAB app to access ChatGPT API from OpenAI

minimaxir/simpleaichat

Python package for easily interfacing with chat apps, with robust features and minimal code complexity.

Explore LLM Tools

All categories Trending LLM Tool directory Insights