sakibreza/ECCV24-HAT
Official repository of ECCV 2024 paper - "HAT: History-Augmented Anchor Transformer for Online Temporal Action Localization"
This project helps researchers and engineers analyze continuous video streams to identify specific actions as they happen. It takes pre-extracted video features as input and outputs the precise start and end times of actions in real-time. This is ideal for researchers in computer vision, robotics, and surveillance systems.
No commits in the last 6 months.
Use this if you need to detect and localize actions within video streams online, as the video is being processed, rather than after the entire video has been recorded.
Not ideal if you are looking for a general-purpose video processing tool that handles raw video inputs or if you need to detect objects rather than actions.
Stars
19
Forks
1
Language
Python
License
MIT
Category
Last pushed
Aug 23, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/computer-vision/sakibreza/ECCV24-HAT"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
cvg/LightGlue
LightGlue: Local Feature Matching at Light Speed (ICCV 2023)
shariqfarooq123/AdaBins
Official implementation of Adabins: Depth Estimation using adaptive bins
amusi/CVPR2026-Papers-with-Code
CVPR 2026 论文和开源项目合集
isl-org/ZoeDepth
Metric depth estimation from a single image
daviddmc/NeSVoR
NeSVoR is a package for GPU-accelerated slice-to-volume reconstruction.