THUDM/LongAlign

[EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs

/ 100

Emerging

If you're building or fine-tuning large language models (LLMs) and need them to handle very long texts, LongAlign provides a complete toolkit. It helps improve an LLM's ability to understand and respond accurately to queries based on documents that are tens of thousands of words long. This is for AI engineers or researchers who are working on developing advanced LLMs for real-world applications.

259 stars. No commits in the last 6 months.

Use this if you are developing or fine-tuning an LLM and want to significantly extend its capability to process and understand very long documents and conversations, up to 100,000 tokens.

Not ideal if you are looking for a pre-trained, ready-to-use LLM without needing to train or fine-tune models yourself.

LLM training long-context AI natural language processing AI model alignment deep learning

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 13 / 25

How are scores calculated?

Stars

259

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

steering-vectors/steering-vectors

Steering vectors for transformer language models in Pytorch / Huggingface

jianghoucheng/AlphaEdit

AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models, ICLR 2025 (Outstanding Paper)

kmeng01/memit

Mass-editing thousands of facts into a transformer memory (ICLR 2023)

boyiwei/alignment-attribution-code

[ICML 2024] Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications

jianghoucheng/AnyEdit

AnyEdit: Edit Any Knowledge Encoded in Language Models, ICML 2025

Explore Transformer Models

All categories Trending Transformer directory Insights