Nemesis-12/multihead-latent-attention

Implementation of Multi-head Latent Attention (MLA) from DeepSeek-V2

/ 100

Experimental

No Package No Dependents

Maintenance 6 / 25

Adoption 0 / 25

Maturity 11 / 25

Community 0 / 25

Stars

—

Forks

—

Language

Python

License

MIT

Category

Last pushed

Nov 22, 2025

Commits (30d)

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/transformers/Nemesis-12/multihead-latent-attention"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

Higher-rated alternatives

microsoft/LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

jadore801120/attention-is-all-you-need-pytorch

A PyTorch implementation of the Transformer model in "Attention is All You Need".

bhavnicksm/vanilla-transformer-jax

JAX/Flax implimentation of 'Attention Is All You Need' by Vaswani et al....

kyegomez/SparseAttention

Pytorch Implementation of the sparse attention from the paper: "Generating Long Sequences with...

AbdelStark/attnres

Rust implementation of Attention Residuals from MoonshotAI/Kimi