JarvisPei/MemDLM
MemDLM: Memory-enhanced Diffusion Language Model
This project helps AI researchers and machine learning engineers fine-tune and evaluate Diffusion Language Models (DLMs) for better understanding of long-context text. You provide existing DLM models and text datasets, and the system outputs enhanced models capable of more accurate long-context comprehension. It's designed for those working directly with advanced language model development and research.
Use this if you are an AI researcher or machine learning engineer developing or evaluating Diffusion Language Models and need to improve their performance on tasks requiring understanding of long documents or conversations.
Not ideal if you are looking for a plug-and-play solution for general text generation or summarization without deep expertise in language model architecture and training.
Stars
9
Forks
—
Language
Python
License
MIT
Category
Last pushed
Mar 24, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/JarvisPei/MemDLM"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
ZHZisZZ/dllm
dLLM: Simple Diffusion Language Modeling
pengzhangzhi/Open-dLLM
Open diffusion language model for code generation — releasing pretraining, evaluation,...
EnnengYang/Awesome-Model-Merging-Methods-Theories-Applications
Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. ACM...
THUDM/LongWriter
[ICLR 2025] LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
AIoT-MLSys-Lab/SVD-LLM
[ICLR 2025🔥] SVD-LLM & [NAACL 2025🔥] SVD-LLM V2