Multimodal Vision Language Embedding Tools

There are 3 multimodal vision language tools tracked. The highest-rated is isaaccorley/goldeneye at 39/100 with 8 stars.

Get all 3 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=embeddings&subcategory=multimodal-vision-language&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

#	Tool	Score	Tier	Stars	Language
1	isaaccorley/goldeneye GoldenEye is a library of geospatial vision-language models -- run any...	39	Emerging	8	Python
2	BIGBALLON/UME-Search Toward Universal Multimodal Embedding	35	Emerging	74	Python
3	mariyahendriksen/ecir2022_category_to_image_retrieval This repository contains the code for the paper "Extending CLIP for...	22	Experimental	6	Jupyter Notebook