Multimodal Vision Language Embedding Tools
There are 3 multimodal vision language tools tracked. The highest-rated is isaaccorley/goldeneye at 39/100 with 8 stars.
Get all 3 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=embeddings&subcategory=multimodal-vision-language&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Tool | Score | Tier |
|---|---|---|---|
| 1 |
isaaccorley/goldeneye
GoldenEye is a library of geospatial vision-language models -- run any... |
|
Emerging |
| 2 |
BIGBALLON/UME-Search
Toward Universal Multimodal Embedding |
|
Emerging |
| 3 |
mariyahendriksen/ecir2022_category_to_image_retrieval
This repository contains the code for the paper "Extending CLIP for... |
|
Experimental |