YangLing0818/VideoTetris

[NeurIPS 2024] VideoTetris: Towards Compositional Text-To-Video Generation

/ 100

Emerging

This project helps video creators, marketers, or educators generate custom video clips from text descriptions. You provide text prompts describing objects and their positions, and it outputs a video tailored to your specifications. This is useful for anyone needing to create visual content with precise control over element placement and changes over time.

240 stars. No commits in the last 6 months.

Use this if you need to generate short to long videos where you can explicitly control the position and appearance of multiple distinct elements within the frame, ensuring they follow your creative vision.

Not ideal if you need to generate videos where sub-objects do not collectively fill the entire frame, or if you prefer a simpler, less controlled text-to-video generation experience.

video-creation content-generation digital-storytelling marketing-assets educational-content

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 7 / 25

How are scores calculated?

Stars

240

Forks

Language

Python

License

MIT

Higher-rated alternatives

zai-org/CogVideo

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

zhaorw02/DeepMesh

[ICCV 2025] Official code of DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning

YangLing0818/RPG-DiffusionMaster

[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with...

thu-nics/FrameFusion

[ICCV'25] The official code of paper "Combining Similarity and Importance for Video Token...

Yushi-Hu/tifa

TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering

Explore Diffusion Models

All categories Trending Diffusion directory Insights