Multimodal Image Search Vector Databases

Tools for semantic image retrieval using multimodal embeddings (text-to-image, image-to-image, or video search). Includes CLIP-based systems, vision transformers, and cross-modal ranking. Does NOT include general image classification, object detection, or single-modality text/vector search without image integration.

There are 44 multimodal image search tools tracked. The highest-rated is soulteary/simple-image-search-engine at 47/100 with 151 stars.

Get all 44 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=vector-db&subcategory=multimodal-image-search&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 soulteary/simple-image-search-engine

图片搜索引擎,很简单。三步构建属于你自己的图片搜索引擎,掌握向量数据库和以图搜图、文本搜索图片。

47
Emerging
2 ob-labs/image-search

Image search application built with the vector capabilities of OceanBase

44
Emerging
3 shotit/shotit-api

The ultimate brain of Shotit, in charge of task coordination.

43
Emerging
4 shotit/shotit

Shotit is a screenshot-to-video search engine tailored for TV & Film,...

43
Emerging
5 KarunyaChavan/Semantixel-Semantic_Image_Retrieval

Semantic Image Retrieval is a lightweight web-based platform that enables...

42
Emerging
6 sourav4243/sift-video

Semantic video search system that indexes audio and visual content to enable...

38
Emerging
7 EricRollei/Semantic-Search

A powerful **two-stage multimodal retrieval pipeline** for ComfyUI, enabling...

35
Emerging
8 Aaryan2304/visual-search-engine

An AI-powered visual search engine that finds visually similar fashion items...

35
Emerging
9 akashAD98/Car_ai_multimodal_search

A multimodal car search engine powered by LanceDB vector database that...

31
Emerging
10 sachink1729/intelligentgallery

Intelligent Image Gallery with Uploads, Deduplication, and Text-Based Search...

30
Emerging
11 AchrefHemissi/FoundIT-Computer-Vision-Powered-Lost-and-Found-Mobile-Application

The LostFound system is designed to facilitate the recovery of lost items...

30
Emerging
12 weaviate-tutorials/next-multimodal-search-demo

a Weaviate multimodal search demo

28
Experimental
13 shotit/shotit-media

Media broker for serving video preview for shotit

28
Experimental
14 jacobmarks/reverse-image-search-plugin

Find the images in your dataset most similar to a query image from URL or...

27
Experimental
15 bauerem/semantic-text2image-search

This repo implements a simple terminal-based semantic image search.

24
Experimental
16 shotit/shotit-frontend

The frontend of shotit, with full documentation.

24
Experimental
17 redswimmer/trail-camera-search

Multimodal vector search of images and videos taken from trail cameras. ...

23
Experimental
18 laxmanclo/pany

PostgreSQL-native semantic search engine with multi-modal capabilities. Add...

23
Experimental
19 JimmyHernandez503/oceano

Sistema de reconocimiento facial con InsightFace y Qdrant - 100% confiable

22
Experimental
20 santi1602/AnyCam2Ros

📷 Transform any camera into ROS2 image topics for seamless integration with...

22
Experimental
21 Abhics8/Lumina-AI

AI-powered visual commerce engine with semantic fashion search using OWLv2,...

21
Experimental
22 EsraaMadi/similarity-search-weaviate

Text/Image search for similar products

20
Experimental
23 oguzhantasimaz/image-similarity-search

Image Similarity Search with CLIP and Upstash Vector

19
Experimental
24 Aniket-16-S/Semantic_Video_Search

An AI powered Video Serach Engine with google's SigLIP and FAISS. It allows...

18
Experimental
25 IlyasFardaouix/VisualIndexer

Multimodal visual search engine using CLIP, OCR, and vector similarity retrieval.

17
Experimental
26 sefaburakokcu/semantic-image-search

Search for images using text and images using Milvus and OpenAI-Clip.

17
Experimental
27 aritro1011/QID

(Query Images by Description)- A simple pipeline to convert images to...

16
Experimental
28 coder-pikachu/mongo_multimodal

A powerful enterprise-grade application that enables semantic search across...

15
Experimental
29 suraj95/Whatsapp-Reel-Knowledge-Base

A small AI project that extract frames from an Instagram video to generate a...

14
Experimental
30 dschechter27875/clip_image_text_search

Multimodal semantic image search using CLIP embeddings and natural language queries.

14
Experimental
31 Sakshi3027/semantic-video-search

Production-grade semantic video search engine - search across video content...

14
Experimental
32 777reet/PhotoDiaries

Modern web photobooth with AI-powered image similarity search. Built with...

13
Experimental
33 BrandWill-ML-DS-DE/clip-faiss-product-search

End-to-end vision–language search system using CLIP + FAISS (HNSW/IVF) for...

13
Experimental
34 ecmoce/ask-gallery

Ask Gallery — Semantic photo search system powered by VLM, CLIP, and vector search

13
Experimental
35 hareshanmuhan/semantic-search

Search 1M+ images/videos with natural language — OpenAI CLIP + FAISS +...

13
Experimental
36 vaibhavhonakere/ClipQuest

Find exact moments in uploaded videos using natural-language search + timestamps.

13
Experimental
37 ejber-ozkan/local-llm-photo-scanner

A privacy-first, self-hosted photo manager powered by local LLMs (Ollama)...

13
Experimental
38 tyasemin/Data-Feature-Extraction-and-Retrieval-Pipeline

Project DART. Similarity search, SAM, CLIP, and more

13
Experimental
39 anantha119/Vector-Based-Image-Retrieval-System

This project leverages Vision Transformers (ViT) to build a scalable image...

13
Experimental
40 MeAmarP/HeyBagh

Find and Discover Visually Similar Content (Content-based image retrieval)

11
Experimental
41 ezecodes/frame-fushion

AI powered Video-Frame Analysis

11
Experimental
42 samiur-r/ImageSemantics

Semantic image clustering using CLIP and GPT-powered metadata in a Next.js +...

11
Experimental
43 shotit/.github

The README profile of Shotit.

11
Experimental
44 S-Pushkar/TagMyPic

This is a microservice based application that allows users to upload images...

11
Experimental