soskek/attention_is_all_you_need

Transformer of "Attention Is All You Need" (Vaswani et al. 2017) by Chainer.

/ 100

Emerging

This project helps machine learning engineers or researchers implement the Transformer model for sequence-to-sequence tasks. You input pairs of text in different languages, and it generates a trained model capable of translating or transforming new text sequences. It's designed for those who need to experiment with or apply the Transformer architecture to problems like machine translation or text summarization.

323 stars. No commits in the last 6 months.

Use this if you need a Python-based implementation of the Transformer model using Chainer for sequence-to-sequence problems, particularly if you are experimenting with different training strategies or model configurations.

Not ideal if you need a production-ready, highly optimized translation system or if you are not comfortable working with machine learning model implementations and datasets directly.

natural-language-processing machine-translation sequence-modeling deep-learning-research

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 23 / 25

How are scores calculated?

Stars

323

Forks

Language

Jupyter Notebook

License

BSD-3-Clause

Compare

attention_is_all_you_need and attention-is-all-you-need-paper

Higher-rated alternatives

philipperemy/keras-attention

Keras Attention Layer (Luong and Bahdanau scores).

tatp22/linformer-pytorch

My take on a practical implementation of Linformer for Pytorch.

ematvey/hierarchical-attention-networks

Document classification with Hierarchical Attention Networks in TensorFlow. WARNING: project is...

datalogue/keras-attention

Visualizing RNNs using the attention mechanism

thushv89/attention_keras

Keras Layer implementation of Attention for Sequential models

Explore ML Frameworks

All categories Trending ML Framework directory Insights