jwergieluk/revllm
RevLLM -- Reverse Engineering Tools for Large Language Models
RevLLM provides tools to understand how large language models (LLMs) like GPT-2 generate text and make decisions. It takes a language model and text prompts as input, then outputs detailed visualizations and analyses of the model's internal workings. This helps data scientists and machine learning engineers explain and debug their generative AI models.
No commits in the last 6 months.
Use this if you need to deeply understand why your generative language model produces a specific output or behaves in a certain way.
Not ideal if you are looking for a tool to simply fine-tune or deploy language models without needing to analyze their internal mechanisms.
Stars
18
Forks
2
Language
Python
License
MIT
Category
Last pushed
Feb 29, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/jwergieluk/revllm"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
MadryLab/context-cite
Attribute (or cite) statements generated by LLMs back to in-context information.
microsoft/augmented-interpretable-models
Interpretable and efficient predictors using pre-trained language models. Scikit-learn compatible.
Trustworthy-ML-Lab/CB-LLMs
[ICLR 25] A novel framework for building intrinsically interpretable LLMs with...
poloclub/LLM-Attributor
LLM Attributor: Attribute LLM's Generated Text to Training Data
THUDM/LongCite
LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA