kyegomez/SelfExtend

Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta

30
/ 100
Emerging

This project helps machine learning engineers and researchers expand the context window of large language models (LLMs) without needing to retrain them. It takes standard query, key, and value tensors along with positional indices, and outputs an attention tensor that effectively handles longer sequences. This allows LLMs to process and generate much longer texts or code.

No commits in the last 6 months. Available on PyPI.

Use this if you are working with large language models and need to process very long inputs or generate extended outputs, but want to avoid the computational cost and time of fine-tuning the entire model.

Not ideal if you need to fundamentally change the underlying architecture of your LLM, or if your primary goal is to significantly reduce inference latency for short sequences.

large-language-models natural-language-processing machine-learning-engineering deep-learning text-generation
Stale 6m
Maintenance 0 / 25
Adoption 5 / 25
Maturity 25 / 25
Community 0 / 25

How are scores calculated?

Stars

13

Forks

Language

Python

License

MIT

Last pushed

Nov 11, 2024

Commits (30d)

0

Dependencies

3

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/transformers/kyegomez/SelfExtend"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.