catherinesyeh/attention-viz

Visualizing query-key interactions in language + vision transformers (VIS 2023)

/ 100

Emerging

This tool helps machine learning researchers understand how transformer models process information. It takes in existing language or vision transformer models and their outputs, then generates interactive visualizations of how different parts of the input (like words in a sentence or regions in an image) relate to each other within the model. This is for researchers who build or analyze advanced AI models and need to gain deeper insights into their internal mechanisms.

162 stars. No commits in the last 6 months.

Use this if you are a machine learning researcher working with transformer models and need to visualize and interpret their internal 'attention' mechanisms to improve model understanding and debug performance.

Not ideal if you are looking for a tool to train models, visualize standard performance metrics, or analyze models other than transformers.

AI model explainability Transformer analysis Natural Language Processing research Computer Vision research Deep learning interpretation

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 16 / 25

How are scores calculated?

Stars

162

Forks

Language

HTML

License

MIT

Higher-rated alternatives

Lightning-AI/litgpt

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

liangyuwang/Tiny-DeepSpeed

Tiny-DeepSpeed, a minimalistic re-implementation of the DeepSpeed library

microsoft/Text2Grad

🚀 Text2Grad: Converting natural language feedback into gradient signals for precise model...

huangjia2019/llm-gpt

From classic NLP to modern LLMs: building language models step by step. 异步图书：《 GPT图解大模型是怎样构建的》-...

FareedKhan-dev/Building-llama3-from-scratch

LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's...

Explore LLM Tools

All categories Trending LLM Tool directory Insights