YangLing0818/VideoTetris
[NeurIPS 2024] VideoTetris: Towards Compositional Text-To-Video Generation
This project helps video creators, marketers, or educators generate custom video clips from text descriptions. You provide text prompts describing objects and their positions, and it outputs a video tailored to your specifications. This is useful for anyone needing to create visual content with precise control over element placement and changes over time.
240 stars. No commits in the last 6 months.
Use this if you need to generate short to long videos where you can explicitly control the position and appearance of multiple distinct elements within the frame, ensuring they follow your creative vision.
Not ideal if you need to generate videos where sub-objects do not collectively fill the entire frame, or if you prefer a simpler, less controlled text-to-video generation experience.
Stars
240
Forks
6
Language
Python
License
MIT
Category
Last pushed
Nov 04, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/diffusion/YangLing0818/VideoTetris"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
zai-org/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
zhaorw02/DeepMesh
[ICCV 2025] Official code of DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning
YangLing0818/RPG-DiffusionMaster
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with...
thu-nics/FrameFusion
[ICCV'25] The official code of paper "Combining Similarity and Importance for Video Token...
Yushi-Hu/tifa
TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering