Victorwz/LongMem
Official implementation of our NeurIPS 2023 paper "Augmenting Language Models with Long-Term Memory".
This project offers a way to enhance large language models (LLMs) so they can better handle very long texts and recall relevant information from them. It takes a base LLM and long textual data, producing a more capable LLM that can maintain context and consistency over extended interactions. AI/ML researchers and practitioners working on advanced language understanding applications would find this useful.
822 stars. No commits in the last 6 months.
Use this if you are developing or fine-tuning large language models and need them to process, understand, and generate content effectively over extremely long documents or conversational histories.
Not ideal if you are looking for an off-the-shelf application to use directly without deep technical understanding of language model development.
Stars
822
Forks
74
Language
Python
License
Apache-2.0
Category
Last pushed
Mar 30, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/Victorwz/LongMem"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
jncraton/languagemodels
Explore large language models in 512MB of RAM
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
haizelabs/verdict
Inference-time scaling for LLMs-as-a-judge.
albertan017/LLM4Decompile
Reverse Engineering: Decompiling Binary Code with Large Language Models
bytedance/Sa2VA
Official Repo For Pixel-LLM Codebase