IAAR-Shanghai/ICSFSurvey

Explore concepts like Self-Correct, Self-Refine, Self-Improve, Self-Contradict, Self-Play, and Self-Knowledge, alongside o1-like reasoning elevation🍓 and hallucination alleviation🍄.

/ 100

Experimental

This project is a comprehensive survey that explores how Large Language Models (LLMs) can evaluate and improve their own performance. It helps AI researchers, machine learning engineers, and data scientists understand methods to reduce incorrect outputs (hallucinations) and enhance reasoning abilities in LLMs. The project provides a curated list of research papers and experimental results demonstrating consistency measurement and response analysis.

172 stars. No commits in the last 6 months.

Use this if you are an AI researcher or practitioner looking to understand the current landscape of self-correction, self-refinement, and hallucination alleviation techniques in large language models.

Not ideal if you are looking for an out-of-the-box software tool to directly apply to your LLM projects without diving into research papers.

Large Language Models AI Research Machine Learning Engineering Natural Language Processing AI Ethics

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 8 / 25

Community 6 / 25

How are scores calculated?

Stars

172

Forks

Language

Jupyter Notebook

License

—

Higher-rated alternatives

vectara/hallucination-leaderboard

Leaderboard Comparing LLM Performance at Producing Hallucinations when Summarizing Short Documents

PKU-YuanGroup/Hallucination-Attack

Attack to induce LLMs within hallucinations

amir-hameed-mir/Sirraya_LSD_Code

Layer-wise Semantic Dynamics (LSD) is a model-agnostic framework for hallucination detection in...

NishilBalar/Awesome-LVLM-Hallucination

up-to-date curated list of state-of-the-art Large vision language models hallucinations...

intuit/sac3

Official repo for SAC3: Reliable Hallucination Detection in Black-Box Language Models via...

Explore LLM Tools

All categories Trending LLM Tool directory Insights