HarborYuan/PolyphonicFormer
[ECCV 2022] 🎵PolyphonicFormer: Unified Query Learning for Depth-aware Video Panoptic Segmentation
This project helps self-driving car engineers and robotics researchers analyze sensor data by precisely identifying and categorizing every object in a video stream while also estimating its distance from the camera. It takes raw video and depth sensor data as input and outputs a detailed, segmented view of the scene, with each object (like cars, pedestrians, or road surfaces) clearly delineated and assigned a depth value. This is used by engineers developing autonomous navigation systems.
No commits in the last 6 months.
Use this if you need to understand both what objects are present in a video scene and how far away they are, for applications like autonomous driving or advanced robotics.
Not ideal if your primary goal is simple object detection without needing fine-grained segmentation or depth information, or if you only work with still images.
Stars
56
Forks
4
Language
Python
License
—
Category
Last pushed
Dec 22, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/computer-vision/HarborYuan/PolyphonicFormer"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
isl-org/Open3D
Open3D: A Modern Library for 3D Data Processing
cvg/Hierarchical-Localization
Visual localization made easy with hloc
gmberton/CosPlace
Official code for CVPR 2022 paper "Rethinking Visual Geo-localization for Large-Scale Applications"
Vincentqyw/image-matching-webui
🤗 image matching webui
cvg/glue-factory
Training library for local feature detection and matching