kyegomez/SelfExtend

Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta

/ 100

Emerging

This project helps machine learning engineers and researchers expand the context window of large language models (LLMs) without needing to retrain them. It takes standard query, key, and value tensors along with positional indices, and outputs an attention tensor that effectively handles longer sequences. This allows LLMs to process and generate much longer texts or code.

No commits in the last 6 months. Available on PyPI.

Use this if you are working with large language models and need to process very long inputs or generate extended outputs, but want to avoid the computational cost and time of fine-tuning the entire model.

Not ideal if you need to fundamentally change the underlying architecture of your LLM, or if your primary goal is to significantly reduce inference latency for short sequences.

large-language-models natural-language-processing machine-learning-engineering deep-learning text-generation

Stale 6m

Maintenance 0 / 25

Adoption 5 / 25

Maturity 25 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Python

License

MIT

Higher-rated alternatives

Goekdeniz-Guelmez/mlx-lm-lora

Train Large Language Models on MLX.

uber-research/PPLM

Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.

VHellendoorn/Code-LMs

Guide to using pre-trained large language models of source code

ssbuild/chatglm_finetuning

chatglm 6b finetuning and alpaca finetuning

jarobyte91/pytorch_beam_search

A lightweight implementation of Beam Search for sequence models in PyTorch.

Explore Transformer Models

All categories Trending Transformer directory Insights