BUAADreamer/SPN4CIR

[ACM MM 2024] Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives

/ 100

Emerging

This project helps e-commerce and marketing professionals find specific images in a large catalog by combining an existing image with text descriptions. You input a reference image and a phrase describing how it should be modified (e.g., "make it red"), and the system outputs relevant target images. It's designed for anyone needing to efficiently search visual content using both visual and textual cues.

No commits in the last 6 months.

Use this if you need to quickly locate images that are visually similar to a given example but with specific textual modifications, like finding a different color of a dress shown in a picture.

Not ideal if your image retrieval needs are solely based on keywords or exact image matching without any textual modification to an existing image.

e-commerce visual search fashion retail content management product discovery

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 7 / 25

Maturity 16 / 25

Community 10 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

OFA-Sys/Chinese-CLIP

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Kaushalya/medclip

A multi-modal CLIP model trained on the medical dataset ROCO

kastalimohammed1965/CLIP-fine-tune-registers-gated

Vision Transformers Needs Registers. And Gated MLPs. And +20M params. Tiny modality gap ensues!

clip-italian/clip-italian

CLIP (Contrastive Language–Image Pre-training) for Italian

zer0int/CLIP-fine-tune-registers-gated

Vision Transformers Needs Registers. And Gated MLPs. And +20M params. Tiny modality gap ensues!

Explore Transformer Models

All categories Trending Transformer directory Insights