Trustworthy-ML-Lab/ThinkEdit

[EMNLP 25] An effective and interpretable weight-editing method for mitigating overly short reasoning in LLMs, and a mechanistic study uncovering how reasoning length is encoded in the model’s representation space.

/ 100

Experimental

This project helps improve the performance of large language models (LLMs) on complex reasoning tasks by addressing overly short reasoning. It takes an existing LLM's responses to problems and identifies specific internal components responsible for insufficient 'thinking' steps. The output is a modified LLM that generates more complete and accurate reasoning, benefiting anyone using LLMs for tasks requiring detailed, multi-step problem-solving.

Use this if your LLM is producing correct answers on mathematical or logical problems but often skips intermediate steps, leading to less reliable or less transparent results.

Not ideal if you need to debug or modify an LLM's behavior for issues unrelated to reasoning length, such as factual inaccuracies, stylistic preferences, or ethical concerns.

LLM fine-tuning reasoning improvement mathematical problem solving interpretability AI model enhancement

No License No Package No Dependents

Maintenance 6 / 25

Adoption 6 / 25

Maturity 8 / 25

Community 5 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

steering-vectors/steering-vectors

Steering vectors for transformer language models in Pytorch / Huggingface

jianghoucheng/AlphaEdit

AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models, ICLR 2025 (Outstanding Paper)

kmeng01/memit

Mass-editing thousands of facts into a transformer memory (ICLR 2023)

boyiwei/alignment-attribution-code

[ICML 2024] Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications

jianghoucheng/AnyEdit

AnyEdit: Edit Any Knowledge Encoded in Language Models, ICML 2025

Explore Transformer Models

All categories Trending Transformer directory Insights