Tinglok/avstyle
Codebase for the Paper: Learning Visual Styles from Audio-Visual Associations (ECCV 2022, in PyTorch)
This project helps manipulate an image's visual style to match a specific sound. You provide an image and an audio input, and it outputs a stylized image that visually aligns with the given sound. It's useful for artists, content creators, or researchers exploring the interplay between sound and visual perception.
No commits in the last 6 months.
Use this if you want to automatically change the aesthetic or 'feel' of a static image to evoke the qualities of a particular sound, like making an image look 'rainy' when paired with rain sounds.
Not ideal if you need precise, manual control over every stylistic detail or if your primary goal is general image editing unrelated to sound.
Stars
15
Forks
2
Language
Python
License
MIT
Category
Last pushed
Jan 26, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/diffusion/Tinglok/avstyle"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
jolibrain/joliGEN
Generative AI Image and Video Toolset with GANs and Diffusion for Real-World Applications
zhangmozhe/Deep-Exemplar-based-Video-Colorization
The source code of CVPR 2019 paper "Deep Exemplar-based Video Colorization".
naver-ai/StyleKeeper
Official Pytorch implementation of "StyleKeeper: Prevent Content Leakage using Negative Visual...
un1tz3r0/finetunepixelartdiffusion
Fine tune a pixelart diffusion model with isometric dataset.
lixiaowen-xw/DiffuEraser
DiffuEraser is a diffusion model for video inpainting, which performs great content completeness...