guyyariv/TempoTokens

This repo contains the official PyTorch implementation of: Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation

/ 100

Emerging

This project helps creators generate realistic and diverse videos that are perfectly synchronized with an input audio track. You provide an audio file, and it creates a video where the visuals align both thematically and temporally with the sounds. This is ideal for content creators, animators, or anyone needing to visualize audio.

127 stars. No commits in the last 6 months.

Use this if you need to generate high-quality videos directly from an audio input, ensuring the visual content and its timing precisely match the sound.

Not ideal if you primarily need to generate videos from text descriptions without any audio synchronization requirements.

video-generation content-creation audio-visualization digital-art animation

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 14 / 25

How are scores calculated?

Stars

127

Forks

Language

Python

License

MIT

Higher-rated alternatives

open-mmlab/mmagic

OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄:...

jdh-algo/JoyVASA

Diffusion-based Portrait and Animal Animation

haidog-yaqub/EzAudio

High-quality Text-to-Audio Generation with Efficient Diffusion Transformer

404-Repo/404-gen-blender-add-on

Blender add-on for 404-GEN 3D generator running on Bittensor

linzhiqiu/t2v_metrics

Evaluating text-to-image/video/3D models with VQAScore

Explore Generative AI Tools

All categories Trending Generative AI directory Insights