zihao-ai/unthinking_vulnerability

To Think or Not to Think: Exploring the Unthinking Vulnerability in Large Reasoning Models

/ 100

Experimental

This project identifies and explores a critical 'Unthinking Vulnerability' in Large Reasoning Models (LRMs), where specific inputs can bypass their intended reasoning processes. It provides tools to either maliciously exploit this flaw to induce incorrect outputs or beneficially monitor and enhance the safety and efficiency of these models. AI developers and researchers working with LRMs would use this to build more robust and secure AI systems.

No commits in the last 6 months.

Use this if you are an AI developer or researcher concerned about the reliability and security of Large Reasoning Models and need tools to test for and mitigate reasoning bypass vulnerabilities.

Not ideal if you are looking for an end-user application or a pre-built solution for a specific business problem, as this is a research toolkit for model developers.

AI-safety Large-Language-Models AI-vulnerability-testing Machine-Learning-security Model-auditing

No License Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 7 / 25

Maturity 8 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Python

License

—

Higher-rated alternatives

Ed1s0nZ/CyberStrikeAI

CyberStrikeAI is an AI-native security testing platform built in Go. It integrates 100+ security...

GH05TCREW/pentestagent

PentestAgent is an AI agent framework for black-box security testing, supporting bug bounty,...

vxcontrol/pentagi

✨ Fully autonomous AI Agents system capable of performing complex penetration testing tasks

asaotomo/FofaMap

FofaMap v2.0 是一款基于 Python3 开发的全网首个 AI 驱动红队资产测绘智能体。在延续原有 FOFA 数据采集、存活检测、统计聚合、图标 Hash...

SanMuzZzZz/LuaN1aoAgent

LuaN1aoAgent is a cognitive-driven AI hacker. It is a fully autonomous AI penetration testing...

Explore AI Agents

All categories Trending AI Agent directory Insights