Tinglok/avstyle

Codebase for the Paper: Learning Visual Styles from Audio-Visual Associations (ECCV 2022, in PyTorch)

/ 100

Emerging

This project helps manipulate an image's visual style to match a specific sound. You provide an image and an audio input, and it outputs a stylized image that visually aligns with the given sound. It's useful for artists, content creators, or researchers exploring the interplay between sound and visual perception.

No commits in the last 6 months.

Use this if you want to automatically change the aesthetic or 'feel' of a static image to evoke the qualities of a particular sound, like making an image look 'rainy' when paired with rain sounds.

Not ideal if you need precise, manual control over every stylistic detail or if your primary goal is general image editing unrelated to sound.

artistic-rendering content-creation visual-effects perceptual-studies multimedia-design

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 16 / 25

Community 10 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

jolibrain/joliGEN

Generative AI Image and Video Toolset with GANs and Diffusion for Real-World Applications

zhangmozhe/Deep-Exemplar-based-Video-Colorization

The source code of CVPR 2019 paper "Deep Exemplar-based Video Colorization".

naver-ai/StyleKeeper

Official Pytorch implementation of "StyleKeeper: Prevent Content Leakage using Negative Visual...

un1tz3r0/finetunepixelartdiffusion

Fine tune a pixelart diffusion model with isometric dataset.

lixiaowen-xw/DiffuEraser

DiffuEraser is a diffusion model for video inpainting, which performs great content completeness...

Explore Diffusion Models

All categories Trending Diffusion directory Insights