ruohaoguo/pavsodr

Official Implementation of "Instance-Level Panoramic Audio-Visual Saliency Detection and Ranking" [ACM MM 2024].

21
/ 100
Experimental

This project helps video analysts and content creators automatically identify the most captivating objects in 360-degree panoramic videos that have sound. You input a panoramic video with audio, and it outputs segmented visual regions corresponding to individual salient objects, along with a ranking of how much each object attracts attention. This is ideal for anyone working with immersive video content who needs to understand viewer focus.

No commits in the last 6 months.

Use this if you need to pinpoint and rank specific attention-grabbing objects within a 360-degree video, especially when sound plays a role.

Not ideal if you are working with standard (non-panoramic) videos or if your primary interest is in general scene understanding rather than specific object saliency.

360-video-analysis content-saliency immersive-media audience-attention video-segmentation
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 5 / 25
Maturity 16 / 25
Community 0 / 25

How are scores calculated?

Stars

11

Forks

Language

Python

License

MIT

Last pushed

Jul 28, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/computer-vision/ruohaoguo/pavsodr"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.