eps696/aphantasia

CLIP + FFT/DWT/RGB = text to image/video

/ 100

Emerging

Turn your written ideas into stunning visuals. This tool takes your text descriptions, optionally with an initial image or style guidelines, and generates detailed images or videos, similar to a 'deepdream' effect. It's perfect for artists, content creators, or anyone looking to visualize concepts or illustrate narratives without needing to draw or sketch.

789 stars. No commits in the last 6 months.

Use this if you want to generate high-resolution images or videos from text prompts, create massive detailed textures, or illustrate lyrics with smooth, animated transitions and a 3D look.

Not ideal if you need photorealistic images with precise control over object placement and lighting, or if you prefer a traditional, sketch-based creative workflow.

digital-art content-creation visual-storytelling generative-media music-visualization

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 21 / 25

How are scores calculated?

Stars

789

Forks

104

Language

Python

License

MIT

Higher-rated alternatives

NVlabs/Sana

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

FoundationVision/VAR

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈]...

nerdyrodent/VQGAN-CLIP

Just playing with getting VQGAN+CLIP running locally, rather than having to use colab.

huggingface/finetrainers

Scalable and memory-optimized training of diffusion models

AssemblyAI-Community/MinImagen

MinImagen: A minimal implementation of the Imagen text-to-image model

Explore Diffusion Models

All categories Trending Diffusion directory Insights