shim0114/T2V-Diffusion-Search

[NeurIPS 2025] Inference-Time Text-to-Video Alignment with Diffusion Latent Beam Search

/ 100

Emerging

This helps researchers improve the quality of text-to-video generation, making the generated videos better match the input text description. It takes a text prompt and an existing video generation model, and produces a higher-quality video output. This tool is for AI researchers or machine learning engineers focused on advancing text-to-video capabilities.

Use this if you are a researcher working with text-to-video generation models like Latte, CogVideoX, or Wan 2.1 and want to enhance the alignment between your input text and the resulting video.

Not ideal if you are looking for an end-user application to generate videos without deep technical setup, as this requires familiarity with machine learning environments and model configurations.

AI-research video-generation generative-AI deep-learning text-to-video

No Package No Dependents

Maintenance 10 / 25

Adoption 5 / 25

Maturity 16 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Python

License

Apache-2.0

Higher-rated alternatives

hao-ai-lab/FastVideo

A unified inference and post-training framework for accelerated video generation.

ModelTC/LightX2V

Light Image Video Generation Inference Framework

thu-ml/TurboDiffusion

TurboDiffusion: 100–200× Acceleration for Video Diffusion Models

PKU-YuanGroup/Helios

Helios: Real Real-Time Long Video Generation Model

PKU-YuanGroup/MagicTime

[TPAMI 2025🔥] MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators

Explore Diffusion Models

All categories Trending Diffusion directory Insights