YuqingWang1029/VisTR

[CVPR2021 Oral] End-to-End Video Instance Segmentation with Transformers

/ 100

Emerging

This project helps computer vision researchers and developers analyze videos by automatically identifying and outlining every distinct object across multiple frames. You provide video frames and their corresponding annotations, and it outputs a JSON file with instance segmentation results, specifying what each object is and where it is located in each frame. This is primarily for those working on advanced video analysis systems.

757 stars. No commits in the last 6 months.

Use this if you need to precisely track and segment individual objects throughout an entire video, such as for advanced behavior analysis or autonomous system perception.

Not ideal if you only need to detect static objects in single images or perform general object classification without detailed instance tracking across frames.

video-analysis object-tracking computer-vision motion-analysis instance-segmentation

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 20 / 25

How are scores calculated?

Stars

757

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

alibaba/EasyCV

An all-in-one toolkit for computer vision

qanastek/HugsVision

HugsVision is a easy to use huggingface wrapper for state-of-the-art computer vision

EMalagoli92/GCViT-TensorFlow

TensorFlow 2.X reimplementation of Global Context Vision Transformers, Ali Hatamizadeh, Hongxu...

kode-git/vfer

Building a real-time environment using webcam frame division in OpenCV and classify cropped...

RubenCasal/owl_vit_detector

NanoOWL Detection System enables real-time open-vocabulary object detection in ROS 2 using a...

Explore Computer Vision Tools

All categories Trending Computer Vision directory Insights