Diffusion Language Models LLM Tools

Tools and techniques for training, optimizing, and decoding diffusion-based language models. Includes memory enhancement, length extrapolation, constrained decoding, and inference acceleration for diffusion LLMs. Does NOT include standard autoregressive LLMs, general diffusion models for image generation, or non-diffusion-based language model architectures.

There are 8 diffusion language models tools tracked. The highest-rated is zhuhanqing/APOLLO at 42/100 with 271 stars.

Get all 8 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=llm-tools&subcategory=diffusion-language-models&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 zhuhanqing/APOLLO

APOLLO: SGD-like Memory, AdamW-level Performance; MLSys'25 Oustanding Paper...

42
Emerging
2 zhenye234/xcodec

AAAI 2025: Codec Does Matter: Exploring the Semantic Shortcoming of Codec...

42
Emerging
3 HITESHLPATEL/Mamba-Papers

Awesome Mamba Papers: A Curated Collection of Research Papers , Tutorials & Blogs

30
Emerging
4 Y-Research-SBU/CSRv2

Official Repository for CSRv2 - ICLR 2026

30
Emerging
5 psychofict/llm-effective-context-length

Investigating Why the Effective Context Length of LLMs Falls Short (Based on...

21
Experimental
6 hrlics/CoPE

CoPE: Clipped RoPE as A Scalable Free Lunch for Long Context LLMs

18
Experimental
7 rishikksh20/mamba3-pytorch

Readable implementation of Mamba 3 SSM model

18
Experimental
8 Ghost---Shadow/diff-bleu

A fully vectorized PyTorch implementation of BLEU scores optimized for...

13
Experimental