LLM Hallucination Mitigation LLM Tools

Tools and techniques for detecting, measuring, and correcting hallucinations in large language models across text and multimodal outputs. Does NOT include general LLM evaluation, factuality benchmarks, or non-hallucination-specific safety measures.

There are 34 llm hallucination mitigation tools tracked. 1 score above 50 (established tier). The highest-rated is vectara/hallucination-leaderboard at 55/100 with 3,122 stars. 1 of the top 10 are actively maintained.

Get all 34 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=llm-tools&subcategory=llm-hallucination-mitigation&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

#	Tool	Score	Tier	Stars	Language
1	vectara/hallucination-leaderboard Leaderboard Comparing LLM Performance at Producing Hallucinations when...	55	Established	3,122	Python
2	PKU-YuanGroup/Hallucination-Attack Attack to induce LLMs within hallucinations	41	Emerging	164	Python
3	amir-hameed-mir/Sirraya_LSD_Code Layer-wise Semantic Dynamics (LSD) is a model-agnostic framework for...	39	Emerging	6	Python
4	NishilBalar/Awesome-LVLM-Hallucination up-to-date curated list of state-of-the-art Large vision language models...	39	Emerging	283	—
5	intuit/sac3 Official repo for SAC3: Reliable Hallucination Detection in Black-Box...	38	Emerging	39	Jupyter Notebook
6	HillZhang1999/llm-hallucination-survey Reading list of hallucination in LLMs. Check out our new survey paper:...	35	Emerging	1,078	—
7	Amirhosein-gh98/Gnosis Can LLMs Predict Their Own Failures? Self-Awareness via Internal Circuits	35	Emerging	32	Python
8	OpenMOSS/HalluQA Dataset and evaluation script for "Evaluating Hallucinations in Chinese...	34	Emerging	136	Python
9	MemTensor/HaluMem HaluMem is the first operation level hallucination evaluation benchmark...	34	Emerging	113	Python
10	hongcheki/sweet-watermark Official repository of the paper: Who Wrote this Code? Watermarking for Code...	32	Emerging	40	Python
11	plll4zzx/Awesome-LLM-Watermark A collection list for Large Language Model (LLM) Watermark	31	Emerging	58	—
12	VITA-MLLM/Woodpecker ✨✨Woodpecker: Hallucination Correction for Multimodal Large Language Models	30	Emerging	650	Python
13	hzy312/Awesome-LLM-Watermark UP-TO-DATE LLM Watermark paper. 🔥🔥🔥	30	Emerging	371	—
14	zjunlp/FactCHD [IJCAI 2024] FactCHD: Benchmarking Fact-Conflicting Hallucination Detection	29	Experimental	90	Python
15	oumi-ai/halloumi-demo Try out HallOumi, a state-of-the-art claim verification model in a simple UI!	29	Experimental	42	TypeScript
16	hongbinye/Cognitive-Mirage-Hallucinations-in-LLMs Repository for the paper "Cognitive Mirage: A Review of Hallucinations in...	29	Experimental	49	—
17	Mattbusel/LLM-Hallucination-Detection-Script A comprehensive toolkit for detecting potential hallucinations in LLM...	28	Experimental	15	Makefile
18	Intelligent-Computing-Research-Group/HaVen [DATE 2025] haven: hallucination-mitigated llm for verilog code generation...	27	Experimental	38	Verilog
19	10nc0/Nyan-Protocol Hallucination guard for AI — one invariant, any model, no training required.	27	Experimental	1	—
20	lilakk/PostMark Official repository for "PostMark: A Robust Blackbox Watermark for Large...	25	Experimental	27	Python
21	IAAR-Shanghai/ICSFSurvey Explore concepts like Self-Correct, Self-Refine, Self-Improve,...	24	Experimental	172	Jupyter Notebook
22	hallucinatemd/hallucinate.md The open standard for telling AI not to hallucinate.	22	Experimental	1	JavaScript
23	kjgpta/WhoDunIt-Evaluation_benchmark_for_culprit_detection_in_mystery_stories WHODUNIT is a benchmark repository for evaluating large language models'...	19	Experimental	3	Jupyter Notebook
24	ruisizhang123/REMARK-LLM [USENIX Security'24] REMARK-LLM: A robust and efficient watermarking...	19	Experimental	27	Python
25	lasithadilshan/Hallucination-Detector-App A Hallucination Detection Tool powered by UQML, designed to identify whether...	18	Experimental	1	Python
26	pranav-kural/llm-hallucination-detection-service Build your own open-source REST API endpoint to detect hallucination in LLM...	17	Experimental	1	TypeScript
27	serhanylmz/pas2 PAS2: A Python-based hallucination detection system that evaluates AI...	17	Experimental	1	Python
28	DegenAI-Labs/HalluWorld Repository for the paper "A Unified Definition of Hallucination: It’s The...	16	Experimental	3	—
29	akborsusom/watermark-ai-analysis Reproduction and attack analysis of LLM text watermarking (Kirchenbauer et...	14	Experimental	1	Jupyter Notebook
30	141forever/DiaHalu This is the repository for the paper 'DiaHalu: A Dialogue-level...	14	Experimental	18	—
31	tranhoangtu-it/halluciguard-api HalluciGuard API — AI Hallucination Firewall as a Service. Detect and filter...	14	Experimental	—	Python
32	strayfear/HalluWorld 🌍 Explore the HalluWorld project, a benchmark for understanding and defining...	13	Experimental	—	—
33	IAAR-Shanghai/UHGEval-dataset The full pipeline of creating UHGEval hallucination dataset	13	Experimental	9	Python
34	amarquaye/atlas 🔢Hallucination detector for Large Language Models.	11	Experimental	—	Jupyter Notebook

Comparisons in this category

hallucination-leaderboard and Awesome-LVLM-Hallucination (55 vs 39)