WolodjaZ/MSAE

Interpreting CLIP with Hierarchical Sparse Autoencoders (ICML 2025)

/ 100

Emerging

This project helps AI researchers and practitioners better understand how large vision-language models like CLIP interpret images and text. It takes pre-computed activations from these models and generates hierarchical, interpretable features that reveal the semantic concepts the model uses. This allows researchers to analyze model biases and perform concept-based similarity searches, ultimately leading to more controllable and explainable AI systems.

Use this if you need to extract and analyze understandable concepts from complex vision-language models to improve their interpretability and control.

Not ideal if you are primarily interested in general-purpose model training or fine-tuning without a specific focus on interpretability.

AI-interpretability vision-language-models model-analysis AI-explainability concept-extraction

No Package No Dependents

Maintenance 10 / 25

Adoption 6 / 25

Maturity 15 / 25

Community 16 / 25

How are scores calculated?

Stars

Forks

Language

Jupyter Notebook

License

MIT

Higher-rated alternatives

mlfoundations/open_clip

An open source implementation of CLIP.

noxdafox/clipspy

Python CFFI bindings for the 'C' Language Integrated Production System CLIPS

openai/CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

moein-shariatnia/OpenAI-CLIP

Simple implementation of OpenAI CLIP model in PyTorch.

BioMedIA-MBZUAI/FetalCLIP

Official repository of FetalCLIP: A Visual-Language Foundation Model for Fetal Ultrasound Image Analysis

Explore ML Frameworks

All categories Trending ML Framework directory Insights