Vision Language Models Computer Vision Tools

There are 3 vision language models tools tracked. The highest-rated is LeapLabTHU/Pseudo-Q at 37/100 with 153 stars.

Get all 3 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=computer-vision&subcategory=vision-language-models&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 LeapLabTHU/Pseudo-Q

[CVPR 2022] Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding

37
Emerging
2 Hardhik-Poosa/Drone_Swarm

AI-powered drone swarm simulator that converts images into optimized 2D and...

13
Experimental
3 Gtothemoon/Contrastive-VisionVAE-Follower

Contrastive-VisionVAE-Follower is a model used for multi-modal task called...

11
Experimental