brandokoch/attention-is-all-you-need-paper

Original transformer paper: Implementation of Vaswani, Ashish, et al. "Attention is all you need." Advances in neural information processing systems. 2017.

/ 100

Emerging

This project provides a clear, runnable implementation of the original Transformer neural network architecture. It allows machine learning researchers and students to input text data, train a model on it, and then use that model to translate text, for example, from English to German. This is ideal for those learning about or experimenting with foundational natural language processing models.

243 stars. No commits in the last 6 months.

Use this if you are a machine learning researcher or student who wants to understand and experiment with the core Transformer architecture from the 'Attention Is All You Need' paper.

Not ideal if you need a production-ready, highly optimized machine translation system, or if you're not familiar with machine learning development concepts.

natural-language-processing machine-translation deep-learning-research neural-networks computational-linguistics

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 22 / 25

How are scores calculated?

Stars

243

Forks

Language

Jupyter Notebook

License

MIT

Compare

attention-is-all-you-need-paper and attention_is_all_you_need

Higher-rated alternatives

philipperemy/keras-attention

Keras Attention Layer (Luong and Bahdanau scores).

tatp22/linformer-pytorch

My take on a practical implementation of Linformer for Pytorch.

ematvey/hierarchical-attention-networks

Document classification with Hierarchical Attention Networks in TensorFlow. WARNING: project is...

datalogue/keras-attention

Visualizing RNNs using the attention mechanism

thushv89/attention_keras

Keras Layer implementation of Attention for Sequential models

Explore ML Frameworks

All categories Trending ML Framework directory Insights