ZHZisZZ/dllm
dLLM: Simple Diffusion Language Modeling
This project is for AI researchers and practitioners focused on advanced language modeling. It provides a toolkit for building, training, and evaluating diffusion-based language models, which generate text differently from traditional models. You can input existing autoregressive models like GPT-2 or BERT and adapt them to this new diffusion framework, ultimately outputting trained models ready for text generation and evaluation.
2,193 stars.
Use this if you are developing or researching new text generation models and want to experiment with diffusion language models, or convert existing autoregressive models into diffusion models.
Not ideal if you are a non-technical user looking for a ready-to-use chatbot or a simple tool for everyday text generation tasks.
Stars
2,193
Forks
206
Language
Python
License
Apache-2.0
Category
Last pushed
Feb 27, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/ZHZisZZ/dllm"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Related models
pengzhangzhi/Open-dLLM
Open diffusion language model for code generation — releasing pretraining, evaluation,...
EnnengYang/Awesome-Model-Merging-Methods-Theories-Applications
Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. ACM...
THUDM/LongWriter
[ICLR 2025] LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
AIoT-MLSys-Lab/SVD-LLM
[ICLR 2025🔥] SVD-LLM & [NAACL 2025🔥] SVD-LLM V2
datamllab/LongLM
[ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning