emerisly/EDIS
Entity-Driven Image Search over Multimodal Web Content (EMNLP 2023)
This project offers a unique collection of 1 million news-related web images, each with a detailed text description, specifically designed for testing and improving image search systems. It helps researchers evaluate how well their algorithms can find relevant images from vast datasets using text queries. The primary users are researchers and developers working on advanced image retrieval and multimodal search technologies.
No commits in the last 6 months.
Use this if you are developing or evaluating advanced image search algorithms, especially for news content, and need a challenging, large-scale dataset with rich entity information.
Not ideal if you are looking for an out-of-the-box image search application, or if your focus is on general object recognition rather than complex cross-modal retrieval from news articles.
Stars
26
Forks
1
Language
Python
License
Apache-2.0
Category
Last pushed
Dec 02, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/emerisly/EDIS"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
ClipsAI/clipsai
Clips AI is an open-source Python library that automatically converts long videos into clips.
ai-forever/ru-clip
CLIP implementation for Russian language
patrickjohncyh/fashion-clip
FashionCLIP is a CLIP-like model fine-tuned for the fashion domain.
Lednik7/CLIP-ONNX
It is a simple library to speed up CLIP inference up to 3x (K80 GPU)
suinleelab/CellCLIP
[NeurIPS 2025] CellCLIP – Learning Perturbation Effects in Cell Painting via Text-Guided...