AlphaPav/mem-kk-logic

On Memorization of Large Language Models in Logical Reasoning

/ 100

Emerging

This project helps AI researchers understand how Large Language Models (LLMs) solve logical reasoning puzzles, specifically 'Knights and Knaves' problems. It takes an LLM's performance on these puzzles, along with various perturbed versions, and outputs insights into whether the model is truly reasoning or just memorizing the training data. AI researchers and cognitive scientists working with LLMs would use this.

No commits in the last 6 months.

Use this if you are an AI researcher investigating whether an LLM's logical reasoning ability is due to genuine understanding or simply memorizing training examples.

Not ideal if you are looking for a tool to directly improve an LLM's performance on a real-world reasoning task, as this project focuses on analysis rather than application.

AI-research LLM-evaluation cognitive-science reasoning-analysis machine-learning-research

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 9 / 25

Maturity 16 / 25

Community 12 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

ExtensityAI/symbolicai

A neurosymbolic perspective on LLMs

TIGER-AI-Lab/MMLU-Pro

The code and data for "MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding...

deep-symbolic-mathematics/LLM-SR

[ICLR 2025 Oral] This is the official repo for the paper "LLM-SR" on Scientific Equation...

microsoft/interwhen

A framework for verifiable reasoning with language models.

zhudotexe/fanoutqa

Companion code for FanOutQA: Multi-Hop, Multi-Document Question Answering for Large Language...

Explore Transformer Models

All categories Trending Transformer directory Insights