xyzforever/BEVT

PyTorch implementation of BEVT (CVPR 2022) https://arxiv.org/abs/2112.01529

/ 100

Emerging

This project helps machine learning engineers and researchers to pretrain video transformers for classifying complex video content. It takes large datasets of unlabelled videos and images as input, and outputs a highly accurate model capable of recognizing actions or events within new videos. This is ideal for those working in computer vision research or developing video analysis applications.

161 stars. No commits in the last 6 months.

Use this if you need to train a robust video recognition model and want to leverage state-of-the-art self-supervised pretraining techniques to achieve high accuracy on various video tasks.

Not ideal if you are a beginner looking for a simple, out-of-the-box solution without deep understanding of PyTorch, transformers, and large-scale model training.

video-classification action-recognition machine-learning-engineering computer-vision deep-learning-research

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 15 / 25

How are scores calculated?

Stars

161

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

patrick-llgc/Learning-Deep-Learning

Paper reading notes on Deep Learning and Machine Learning

eric-yyjau/pytorch-superpoint

Superpoint Implemented in PyTorch: https://arxiv.org/abs/1712.07629

magicleap/SuperGluePretrainedNetwork

SuperGlue: Learning Feature Matching with Graph Neural Networks (CVPR 2020, Oral)

changhao-chen/deep-learning-localization-mapping

A collection of deep learning based localization models

lucasb-eyer/pydensecrf

Python wrapper to Philipp Krähenbühl's dense (fully connected) CRFs with gaussian edge potentials.

Explore ML Frameworks

All categories Trending ML Framework directory Insights