mcbal/deep-implicit-attention

Implementation of deep implicit attention in PyTorch

33
/ 100
Emerging

This project offers an experimental implementation of 'deep implicit attention' using PyTorch, connecting the transformer architecture to mean-field theory and statistical physics. It allows researchers to explore how attention mechanisms in neural networks can be understood as solving a set of self-consistent equations, much like systems in physics. Researchers in machine learning and theoretical neuroscience can use this to gain deeper insights into how modern AI models process information.

No commits in the last 6 months.

Use this if you are a machine learning researcher or theoretician interested in the underlying mathematical and physical principles of transformer attention mechanisms.

Not ideal if you are looking for a plug-and-play solution for building or training standard transformer models for practical applications.

neural-network-theory statistical-physics attention-mechanisms deep-learning-research mean-field-theory
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 8 / 25
Maturity 16 / 25
Community 9 / 25

How are scores calculated?

Stars

65

Forks

5

Language

Python

License

MIT

Last pushed

Aug 02, 2021

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/transformers/mcbal/deep-implicit-attention"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.