guyyariv/TempoTokens
This repo contains the official PyTorch implementation of: Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation
This project helps creators generate realistic and diverse videos that are perfectly synchronized with an input audio track. You provide an audio file, and it creates a video where the visuals align both thematically and temporally with the sounds. This is ideal for content creators, animators, or anyone needing to visualize audio.
127 stars. No commits in the last 6 months.
Use this if you need to generate high-quality videos directly from an audio input, ensuring the visual content and its timing precisely match the sound.
Not ideal if you primarily need to generate videos from text descriptions without any audio synchronization requirements.
Stars
127
Forks
15
Language
Python
License
MIT
Category
Last pushed
Feb 13, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/generative-ai/guyyariv/TempoTokens"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
open-mmlab/mmagic
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄:...
jdh-algo/JoyVASA
Diffusion-based Portrait and Animal Animation
haidog-yaqub/EzAudio
High-quality Text-to-Audio Generation with Efficient Diffusion Transformer
404-Repo/404-gen-blender-add-on
Blender add-on for 404-GEN 3D generator running on Bittensor
linzhiqiu/t2v_metrics
Evaluating text-to-image/video/3D models with VQAScore