yjyddq/EOSER-ASS-RL

Official Repository of "Taming Masked Diffusion Language Models via Consistency Trajectory Reinforcement Learning with Fewer Decoding Step"

/ 100

Emerging

This project offers novel techniques to enhance Masked Diffusion Language Models (MDLMs). It introduces methods like EOS Early Rejection (EOSER) decoding and an Ascending Step-Size (ASS) scheduler, along with Consistency Trajectory Group Relative Policy Optimization (CJ-GRPO). These innovations aim to make MDLMs more efficient, allowing them to achieve competitive performance with fewer decoding steps. Developers and researchers working with diffusion models for natural language generation will find this useful for improving model speed and output quality.

Use this if you are a machine learning researcher or developer working with Masked Diffusion Language Models and need to reduce the number of decoding steps while maintaining or improving performance.

Not ideal if you are looking for a pre-trained model or an application-ready tool, as this project focuses on foundational algorithmic improvements for MDLMs.

natural-language-generation machine-learning-research diffusion-models language-model-optimization deep-learning-algorithms

No Package No Dependents

Maintenance 10 / 25

Adoption 7 / 25

Maturity 15 / 25

Community 4 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

FlorianFuerrutter/genQC

Generative Quantum Circuits

horseee/DeepCache

[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free

Gen-Verse/MMaDA

MMaDA - Open-Sourced Multimodal Large Diffusion Language Models (dLLMs with block diffusion,...

kuleshov-group/mdlm

[NeurIPS 2024] Simple and Effective Masked Diffusion Language Model

Shark-NLP/DiffuSeq

[ICLR'23] DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models

Explore Diffusion Models

All categories Trending Diffusion directory Insights