VITA-Group/Ms-PoE

"Found in the Middle: How Language Models Use Long Contexts Better via Plug-and-Play Positional Encoding" Zhenyu Zhang, Runjin Chen, Shiwei Liu, Zhewei Yao, Olatunji Ruwase, Beidi Chen, Xiaoxia Wu, Zhangyang Wang.

/ 100

Emerging

This project helps improve how Large Language Models (LLMs) find and use important details when they're given a very long piece of text. It takes an existing LLM and enhances its ability to pinpoint relevant information, especially if that information is buried in the middle of a long document or conversation. Researchers and AI engineers working with LLMs for complex tasks like summarization or long-form Q&A would find this useful.

No commits in the last 6 months.

Use this if your Large Language Models struggle to accurately extract or act upon key information located in the middle of extremely long text inputs.

Not ideal if you are not working with Large Language Models or if your primary concern is not long-context understanding.

Large Language Models NLP research long-context understanding information retrieval AI model improvement

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 7 / 25

Maturity 16 / 25

Community 11 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

ZHZisZZ/dllm

dLLM: Simple Diffusion Language Modeling

pengzhangzhi/Open-dLLM

Open diffusion language model for code generation — releasing pretraining, evaluation,...

EnnengYang/Awesome-Model-Merging-Methods-Theories-Applications

Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. ACM...

THUDM/LongWriter

[ICLR 2025] LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs

AIoT-MLSys-Lab/SVD-LLM

[ICLR 2025🔥] SVD-LLM & [NAACL 2025🔥] SVD-LLM V2

Explore Transformer Models

All categories Trending Transformer directory Insights