taegyeong-lee/Generating-Realistic-Images-from-In-the-wild-Sounds
Official Code Repository for the paper "Generating Realistic Images from In-the-wild Sounds", ICCV 2023
This project helps creative professionals, marketers, and researchers visualize abstract soundscapes or specific audio events. By inputting any "in-the-wild" sound, such as ambient noise or distinct audio cues, it generates realistic images that visually represent the sound's characteristics. This is ideal for anyone needing to create visual content inspired directly by audio.
No commits in the last 6 months.
Use this if you need to generate unique visual representations or concept art directly from diverse sound files, beyond simple sound wave visualizations.
Not ideal if you are looking for a plug-and-play solution, as it currently requires some technical setup and configuration.
Stars
12
Forks
1
Language
Jupyter Notebook
License
—
Category
Last pushed
Aug 24, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/diffusion/taegyeong-lee/Generating-Realistic-Images-from-In-the-wild-Sounds"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
jolibrain/joliGEN
Generative AI Image and Video Toolset with GANs and Diffusion for Real-World Applications
zhangmozhe/Deep-Exemplar-based-Video-Colorization
The source code of CVPR 2019 paper "Deep Exemplar-based Video Colorization".
naver-ai/StyleKeeper
Official Pytorch implementation of "StyleKeeper: Prevent Content Leakage using Negative Visual...
un1tz3r0/finetunepixelartdiffusion
Fine tune a pixelart diffusion model with isometric dataset.
lixiaowen-xw/DiffuEraser
DiffuEraser is a diffusion model for video inpainting, which performs great content completeness...