chikap421/catlvdm

[ICLR 2026 - ReALM-GEN] This repository accompanies the paper "Corruption-Aware Training of Latent Video Diffusion Models for Robust Text-to-Video Generation"

/ 100

Experimental

This project helps video creators, marketers, and researchers generate high-quality video content from text descriptions, even when the input text or underlying data might have imperfections or 'noise'. It takes a text prompt as input and produces a short video clip. This is especially useful for professionals who need reliable video generation from diverse or less-than-perfect source materials.

No commits in the last 6 months.

Use this if you need to reliably generate creative video content from text prompts, and you anticipate that your text inputs might be ambiguous, incomplete, or contain 'noise' from real-world data collection.

Not ideal if your primary concern is generating hyper-realistic video from perfectly clean, unambiguous prompts without any robustness considerations.

video-generation creative-content synthetic-media text-to-video robust-AI

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 5 / 25

Maturity 15 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Python

License

MIT

Higher-rated alternatives

huggingface/diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

bghira/SimpleTuner

A general fine-tuning kit geared toward image/video/audio diffusion models.

mcmonkeyprojects/SwarmUI

SwarmUI (formerly StableSwarmUI), A Modular Stable Diffusion Web-User-Interface, with an...

nateraw/stable-diffusion-videos

Create 🔥 videos with Stable Diffusion by exploring the latent space and morphing between text prompts

TheDesignFounder/DreamLayer

Benchmark diffusion models faster. Automate evals, seeds, and metrics for reproducible results.

Explore Diffusion Models

All categories Trending Diffusion directory Insights