rohit901/cooperative-foundational-models
[WACV 2025] Official code for our paper "Enhancing Novel Object Detection via Cooperative Foundational Models"
This project helps computer vision practitioners accurately identify both familiar and entirely new types of objects within images. You input images, and it outputs bounding boxes and labels for all detected objects, including those it hasn't been specifically trained on. This is for researchers and developers building advanced computer vision systems for tasks like autonomous inspection or content moderation where encountering unexpected items is common.
Use this if you need to detect a wide variety of objects in images, including novel or previously unseen categories, without needing to retrain your entire system for every new object.
Not ideal if you only need to detect a fixed, predefined set of common objects and do not anticipate encountering any new categories.
Stars
84
Forks
5
Language
Python
License
MIT
Category
Last pushed
Jan 02, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/computer-vision/rohit901/cooperative-foundational-models"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
isl-org/Open3D
Open3D: A Modern Library for 3D Data Processing
cvg/Hierarchical-Localization
Visual localization made easy with hloc
gmberton/CosPlace
Official code for CVPR 2022 paper "Rethinking Visual Geo-localization for Large-Scale Applications"
Vincentqyw/image-matching-webui
🤗 image matching webui
cvg/glue-factory
Training library for local feature detection and matching