guyyariv/AudioToken

This repo contains the official PyTorch implementation of AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image Generation

34
/ 100
Emerging

This project helps content creators and researchers generate images directly from audio recordings. You provide an audio clip, and the system creates a corresponding image based on the sound. This is ideal for artists, marketers, or researchers exploring new ways to visualize soundscapes or create multimedia content without needing descriptive text.

No commits in the last 6 months.

Use this if you need to generate visual content from sound, such as creating album art from music, visualizing sound events for research, or producing unique imagery for marketing campaigns based on audio clips.

Not ideal if you need precise control over image details or require images that are not conceptually linked to audio.

audio-visualization content-creation sound-design multimedia-art research-imaging
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 9 / 25
Maturity 16 / 25
Community 9 / 25

How are scores calculated?

Stars

88

Forks

6

Language

Python

License

MIT

Last pushed

Jun 18, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/diffusion/guyyariv/AudioToken"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.