xyzforever/BEVT
PyTorch implementation of BEVT (CVPR 2022) https://arxiv.org/abs/2112.01529
This project helps machine learning engineers and researchers to pretrain video transformers for classifying complex video content. It takes large datasets of unlabelled videos and images as input, and outputs a highly accurate model capable of recognizing actions or events within new videos. This is ideal for those working in computer vision research or developing video analysis applications.
161 stars. No commits in the last 6 months.
Use this if you need to train a robust video recognition model and want to leverage state-of-the-art self-supervised pretraining techniques to achieve high accuracy on various video tasks.
Not ideal if you are a beginner looking for a simple, out-of-the-box solution without deep understanding of PyTorch, transformers, and large-scale model training.
Stars
161
Forks
19
Language
Python
License
Apache-2.0
Category
Last pushed
Jul 19, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/xyzforever/BEVT"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
patrick-llgc/Learning-Deep-Learning
Paper reading notes on Deep Learning and Machine Learning
eric-yyjau/pytorch-superpoint
Superpoint Implemented in PyTorch: https://arxiv.org/abs/1712.07629
magicleap/SuperGluePretrainedNetwork
SuperGlue: Learning Feature Matching with Graph Neural Networks (CVPR 2020, Oral)
changhao-chen/deep-learning-localization-mapping
A collection of deep learning based localization models
lucasb-eyer/pydensecrf
Python wrapper to Philipp Krähenbühl's dense (fully connected) CRFs with gaussian edge potentials.