Vadimbuildercxx/looped_transformer

Experimental implementation of "Looped Transformers are Better at Learning Learning Algorithms" showing superior performance with 12x fewer parameters. Includes complete environment setup, pre-trained weights, and extensive experiments comparing Looped TFs vs traditional Transformers for In-Context Learning.

/ 100

Experimental

This project helps machine learning researchers understand and apply 'Looped Transformers,' a new neural network architecture. It takes data for in-context learning tasks, specifically linear regression examples, and outputs insights into the performance and efficiency of Looped Transformers compared to traditional Transformers. The ideal user is a machine learning researcher or practitioner exploring advanced transformer architectures for improved learning algorithms.

No commits in the last 6 months.

Use this if you are a machine learning researcher interested in the theoretical and practical benefits of Looped Transformers for in-context learning tasks.

Not ideal if you are looking for a ready-to-use production model or a general-purpose transformer for tasks outside of algorithm learning research.

machine-learning-research neural-network-architecture in-context-learning deep-learning-efficiency algorithm-learning

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 4 / 25

Maturity 8 / 25

Community 8 / 25

How are scores calculated?

Stars

Forks

Language

Jupyter Notebook

License

—

Higher-rated alternatives

lucidrains/x-transformers

A concise but complete full-attention transformer with a set of promising experimental features...

kanishkamisra/minicons

Utility for behavioral and representational analyses of Language Models

lucidrains/simple-hierarchical-transformer

Experiments around a simple idea for inducing multiple hierarchical predictive model within a GPT

lucidrains/dreamer4

Implementation of Danijar's latest iteration for his Dreamer line of work

Nicolepcx/Transformers-in-Action

This is the corresponding code for the book Transformers in Action

Explore Transformer Models

All categories Trending Transformer directory Insights