Rishit-dagli/Fast-Transformer

An implementation of Additive Attention

/ 100

Established

This is a developer tool that provides a TensorFlow implementation of the Fastformer model, which uses additive attention for efficient processing of long text sequences. It takes long text as input and outputs processed sequences, enabling faster and more effective text modeling. Machine learning engineers and researchers working on natural language processing tasks would use this.

148 stars. No commits in the last 6 months. Available on PyPI.

Use this if you are a machine learning engineer or researcher building models that need to process very long text sequences efficiently using TensorFlow.

Not ideal if you are looking for an out-of-the-box solution for text analysis or if you are not comfortable working with TensorFlow and deep learning model implementations.

natural-language-processing deep-learning large-language-models text-modeling machine-learning-engineering

Stale 6m

Maintenance 0 / 25

Adoption 10 / 25

Maturity 25 / 25

Community 16 / 25

How are scores calculated?

Stars

148

Forks

Language

Jupyter Notebook

License

Apache-2.0

Compare

Fast-Transformer and x-transformers

Related models

lucidrains/x-transformers

A concise but complete full-attention transformer with a set of promising experimental features...

kanishkamisra/minicons

Utility for behavioral and representational analyses of Language Models

lucidrains/simple-hierarchical-transformer

Experiments around a simple idea for inducing multiple hierarchical predictive model within a GPT

lucidrains/dreamer4

Implementation of Danijar's latest iteration for his Dreamer line of work

Nicolepcx/Transformers-in-Action

This is the corresponding code for the book Transformers in Action

Explore Transformer Models

All categories Trending Transformer directory Insights