Diffusion Language Models LLM Tools

Tools and techniques for training, optimizing, and decoding diffusion-based language models. Includes memory enhancement, length extrapolation, constrained decoding, and inference acceleration for diffusion LLMs. Does NOT include standard autoregressive LLMs, general diffusion models for image generation, or non-diffusion-based language model architectures.

There are 8 diffusion language models tools tracked. The highest-rated is zhuhanqing/APOLLO at 42/100 with 271 stars.

Get all 8 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=llm-tools&subcategory=diffusion-language-models&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

#	Tool	Score	Tier	Stars	Language
1	zhuhanqing/APOLLO APOLLO: SGD-like Memory, AdamW-level Performance; MLSys'25 Oustanding Paper...	42	Emerging	271	Python
2	zhenye234/xcodec AAAI 2025: Codec Does Matter: Exploring the Semantic Shortcoming of Codec...	42	Emerging	294	Python
3	HITESHLPATEL/Mamba-Papers Awesome Mamba Papers: A Curated Collection of Research Papers , Tutorials & Blogs	30	Emerging	26	—
4	Y-Research-SBU/CSRv2 Official Repository for CSRv2 - ICLR 2026	30	Emerging	10	Python
5	psychofict/llm-effective-context-length Investigating Why the Effective Context Length of LLMs Falls Short (Based on...	21	Experimental	—	Python
6	hrlics/CoPE CoPE: Clipped RoPE as A Scalable Free Lunch for Long Context LLMs	18	Experimental	10	Python
7	rishikksh20/mamba3-pytorch Readable implementation of Mamba 3 SSM model	18	Experimental	7	Python
8	Ghost---Shadow/diff-bleu A fully vectorized PyTorch implementation of BLEU scores optimized for...	13	Experimental	—	—