amaralibey/nanoCLIP
A lightweight Text-to-Image Retrieval model [Web App]
This helps you easily find specific images within your personal photo collection by using everyday language. You input your photo gallery and natural language search queries, and it outputs the photos that best match your description. This is perfect for anyone with a large personal photo archive who struggles to locate specific memories.
No commits in the last 6 months.
Use this if you want to quickly search through thousands of personal photos using natural language, like "my dog playing in the park" or "pictures from my trip to the mountains."
Not ideal if you need an enterprise-grade image management system with advanced features like facial recognition or complex metadata tagging.
Stars
29
Forks
5
Language
Python
License
MIT
Category
Last pushed
Dec 06, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/diffusion/amaralibey/nanoCLIP"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
NVlabs/Sana
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
FoundationVision/VAR
[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈]...
nerdyrodent/VQGAN-CLIP
Just playing with getting VQGAN+CLIP running locally, rather than having to use colab.
huggingface/finetrainers
Scalable and memory-optimized training of diffusion models
AssemblyAI-Community/MinImagen
MinImagen: A minimal implementation of the Imagen text-to-image model