HarborYuan/PolyphonicFormer

[ECCV 2022] 🎵PolyphonicFormer: Unified Query Learning for Depth-aware Video Panoptic Segmentation

/ 100

Experimental

This project helps self-driving car engineers and robotics researchers analyze sensor data by precisely identifying and categorizing every object in a video stream while also estimating its distance from the camera. It takes raw video and depth sensor data as input and outputs a detailed, segmented view of the scene, with each object (like cars, pedestrians, or road surfaces) clearly delineated and assigned a depth value. This is used by engineers developing autonomous navigation systems.

No commits in the last 6 months.

Use this if you need to understand both what objects are present in a video scene and how far away they are, for applications like autonomous driving or advanced robotics.

Not ideal if your primary goal is simple object detection without needing fine-grained segmentation or depth information, or if you only work with still images.

autonomous-driving robotics scene-understanding computer-vision depth-perception

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 8 / 25

Maturity 8 / 25

Community 9 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

isl-org/Open3D

Open3D: A Modern Library for 3D Data Processing

cvg/Hierarchical-Localization

Visual localization made easy with hloc

gmberton/CosPlace

Official code for CVPR 2022 paper "Rethinking Visual Geo-localization for Large-Scale Applications"

Vincentqyw/image-matching-webui

🤗 image matching webui

cvg/glue-factory

Training library for local feature detection and matching

Explore Computer Vision Tools

All categories Trending Computer Vision directory Insights