jwergieluk/revllm

RevLLM -- Reverse Engineering Tools for Large Language Models

/ 100

Emerging

RevLLM provides tools to understand how large language models (LLMs) like GPT-2 generate text and make decisions. It takes a language model and text prompts as input, then outputs detailed visualizations and analyses of the model's internal workings. This helps data scientists and machine learning engineers explain and debug their generative AI models.

No commits in the last 6 months.

Use this if you need to deeply understand why your generative language model produces a specific output or behaves in a certain way.

Not ideal if you are looking for a tool to simply fine-tune or deploy language models without needing to analyze their internal mechanisms.

LLM explainability NLP model analysis Generative AI diagnostics Transformer interpretability Deep learning engineering

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 16 / 25

Community 9 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

MadryLab/context-cite

Attribute (or cite) statements generated by LLMs back to in-context information.

microsoft/augmented-interpretable-models

Interpretable and efficient predictors using pre-trained language models. Scikit-learn compatible.

Trustworthy-ML-Lab/CB-LLMs

[ICLR 25] A novel framework for building intrinsically interpretable LLMs with...

poloclub/LLM-Attributor

LLM Attributor: Attribute LLM's Generated Text to Training Data

THUDM/LongCite

LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA

Explore Transformer Models

All categories Trending Transformer directory Insights