millioniron/LLM_exploration_Graph-Attention-Mechanisms-Perspective

Code: Attention Mechanisms Perspective: Exploring LLM Processing of Graph-Structured Data (ICML2025)

/ 100

Experimental

This project helps researchers and developers understand how Large Language Models (LLMs) process information that is structured like a network or graph (e.g., social networks, molecular structures). It takes graph-structured data and applies different attention mechanisms within LLMs to show how they interpret and learn from these connections. The primary users are AI/ML researchers or practitioners working on advanced LLM applications, especially those involving complex relational data.

No commits in the last 6 months.

Use this if you are a machine learning researcher or engineer interested in the underlying mechanisms of how LLMs handle graph-structured data and want to experiment with different attention strategies.

Not ideal if you are looking for a plug-and-play solution for a specific graph-based prediction task or if you do not have a strong background in machine learning and LLM architectures.

Large Language Models Graph Neural Networks Attention Mechanisms AI Research Deep Learning

No License Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 5 / 25

Maturity 8 / 25

Community 11 / 25

How are scores calculated?

Stars

Forks

Language

Jupyter Notebook

License

—

Higher-rated alternatives

jncraton/languagemodels

Explore large language models in 512MB of RAM

microsoft/unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

haizelabs/verdict

Inference-time scaling for LLMs-as-a-judge.

albertan017/LLM4Decompile

Reverse Engineering: Decompiling Binary Code with Large Language Models

bytedance/Sa2VA

Official Repo For Pixel-LLM Codebase

Explore Transformer Models

All categories Trending Transformer directory Insights