Diffusion Language Models LLM Tools
Tools and techniques for training, optimizing, and decoding diffusion-based language models. Includes memory enhancement, length extrapolation, constrained decoding, and inference acceleration for diffusion LLMs. Does NOT include standard autoregressive LLMs, general diffusion models for image generation, or non-diffusion-based language model architectures.
There are 8 diffusion language models tools tracked. The highest-rated is zhuhanqing/APOLLO at 42/100 with 271 stars.
Get all 8 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=llm-tools&subcategory=diffusion-language-models&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Tool | Score | Tier |
|---|---|---|---|
| 1 |
zhuhanqing/APOLLO
APOLLO: SGD-like Memory, AdamW-level Performance; MLSys'25 Oustanding Paper... |
|
Emerging |
| 2 |
zhenye234/xcodec
AAAI 2025: Codec Does Matter: Exploring the Semantic Shortcoming of Codec... |
|
Emerging |
| 3 |
HITESHLPATEL/Mamba-Papers
Awesome Mamba Papers: A Curated Collection of Research Papers , Tutorials & Blogs |
|
Emerging |
| 4 |
Y-Research-SBU/CSRv2
Official Repository for CSRv2 - ICLR 2026 |
|
Emerging |
| 5 |
psychofict/llm-effective-context-length
Investigating Why the Effective Context Length of LLMs Falls Short (Based on... |
|
Experimental |
| 6 |
hrlics/CoPE
CoPE: Clipped RoPE as A Scalable Free Lunch for Long Context LLMs |
|
Experimental |
| 7 |
rishikksh20/mamba3-pytorch
Readable implementation of Mamba 3 SSM model |
|
Experimental |
| 8 |
Ghost---Shadow/diff-bleu
A fully vectorized PyTorch implementation of BLEU scores optimized for... |
|
Experimental |