ZHZisZZ/dllm

dLLM: Simple Diffusion Language Modeling

/ 100

Established

This project is for AI researchers and practitioners focused on advanced language modeling. It provides a toolkit for building, training, and evaluating diffusion-based language models, which generate text differently from traditional models. You can input existing autoregressive models like GPT-2 or BERT and adapt them to this new diffusion framework, ultimately outputting trained models ready for text generation and evaluation.

2,193 stars.

Use this if you are developing or researching new text generation models and want to experiment with diffusion language models, or convert existing autoregressive models into diffusion models.

Not ideal if you are a non-technical user looking for a ready-to-use chatbot or a simple tool for everyday text generation tasks.

AI research natural-language-generation machine-learning-engineering diffusion-models large-language-models

No Package No Dependents

Maintenance 10 / 25

Adoption 10 / 25

Maturity 15 / 25

Community 20 / 25

How are scores calculated?

Stars

2,193

Forks

206

Language

Python

License

Apache-2.0

Compare

dllm and Open-dLLM

Related models

pengzhangzhi/Open-dLLM

Open diffusion language model for code generation — releasing pretraining, evaluation,...

EnnengYang/Awesome-Model-Merging-Methods-Theories-Applications

Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. ACM...

THUDM/LongWriter

[ICLR 2025] LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs

AIoT-MLSys-Lab/SVD-LLM

[ICLR 2025🔥] SVD-LLM & [NAACL 2025🔥] SVD-LLM V2

datamllab/LongLM

[ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning

Explore Transformer Models

All categories Trending Transformer directory Insights