wjun0830/QD-DETR

Official pytorch repository for "QD-DETR : Query-Dependent Video Representation for Moment Retrieval and Highlight Detection" (CVPR 2023 Paper)

/ 100

Emerging

This project helps video content creators, marketers, or media analysts pinpoint specific moments within long videos and automatically detect highlights. You provide a video and a text query (like "goal shot" or "product launch"), and it outputs the exact time segments in the video that match your query or stand out as highlights. This tool is for anyone needing to efficiently extract key information or create summaries from video footage.

246 stars. No commits in the last 6 months.

Use this if you need to quickly find specific events or automatically identify important segments within a video based on a textual description.

Not ideal if you're looking to analyze static images, process only audio, or if your primary goal is general video editing rather than content retrieval or highlight detection.

video-editing content-creation media-analysis digital-marketing video-search

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 13 / 25

How are scores calculated?

Stars

246

Forks

Language

Python

License

—

Higher-rated alternatives

BR-IDL/PaddleViT

:robot: PaddleViT: State-of-the-art Visual Transformer and MLP Models for PaddlePaddle 2.0+

pathak22/unsupervised-video

[CVPR 2017] Unsupervised deep learning using unlabelled videos on the web

IBM/CrossViT

Official implementation of CrossViT. https://arxiv.org/abs/2103.14899

NVlabs/GCVit

[ICML 2023] Official PyTorch implementation of Global Context Vision Transformers

ViTAE-Transformer/ViTDet

Unofficial implementation for [ECCV'22] "Exploring Plain Vision Transformer Backbones for Object...

Explore Computer Vision Tools

All categories Trending Computer Vision directory Insights