OpenMOSS/LongLLaDA

[AAAI26] LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs

/ 100

Emerging

This project helps AI researchers and practitioners evaluate how well large language models (LLMs) can handle very long texts. It takes different LLM architectures and various long-context benchmarks (like "Needle-In-A-Haystack" or RULER) as input, and outputs performance metrics and comparisons related to an LLM's ability to understand or retrieve information from extensive documents. It's designed for those who develop, benchmark, or work with advanced language models, particularly focusing on diffusion LLMs.

Use this if you are researching or developing large language models and need to rigorously test their performance with exceptionally long input contexts.

Not ideal if you are an end-user looking for a pre-built application or a simple API to process long documents without needing to benchmark LLM architectures.

Large Language Models AI Research LLM Benchmarking Context Window Extension Natural Language Processing

No License No Package No Dependents

Maintenance 6 / 25

Adoption 8 / 25

Maturity 7 / 25

Community 10 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

ZHZisZZ/dllm

dLLM: Simple Diffusion Language Modeling

pengzhangzhi/Open-dLLM

Open diffusion language model for code generation — releasing pretraining, evaluation,...

EnnengYang/Awesome-Model-Merging-Methods-Theories-Applications

Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. ACM...

THUDM/LongWriter

[ICLR 2025] LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs

AIoT-MLSys-Lab/SVD-LLM

[ICLR 2025🔥] SVD-LLM & [NAACL 2025🔥] SVD-LLM V2

Explore Transformer Models

All categories Trending Transformer directory Insights