OpenGVLab/perception_test_iccv2023

Champion Solutions repository for Perception Test challenges in ICCV2023 workshop.

21
/ 100
Experimental

This project helps computer vision researchers and AI developers identify specific actions or sounds within video recordings. By inputting video and audio data, it precisely outputs the start and end times of events like a dog barking or a person waving. Researchers working on video analysis, surveillance, or content moderation would find this useful.

No commits in the last 6 months.

Use this if you need to accurately pinpoint the exact moments when specific actions or sounds occur in video content, especially for research or developing new AI models.

Not ideal if you are looking for a ready-to-use application with a graphical interface, as this project provides core models and code for integration by developers.

Video Analysis Temporal Localization Audio Event Detection Action Recognition Computer Vision Research
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 5 / 25
Maturity 16 / 25
Community 0 / 25

How are scores calculated?

Stars

14

Forks

Language

Python

License

MIT

Last pushed

Oct 18, 2023

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/computer-vision/OpenGVLab/perception_test_iccv2023"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.