d3tk/REOrder

Does patch ordering affect context-limited vision transformers?

/ 100

Experimental

REOrder helps improve the accuracy of vision models that analyze images by intelligently reordering how different parts of an image are processed. It takes your image dataset and a trained vision transformer model, and outputs a more accurate version of that model. This is for machine learning engineers and researchers who are building and fine-tuning advanced computer vision systems.

No commits in the last 6 months.

Use this if you are working with vision transformers and want to boost their performance on specific image classification or analysis tasks by optimizing how image patches are sequenced.

Not ideal if you are looking for a plug-and-play solution for basic image processing or if you do not work with vision transformer architectures.

computer-vision image-classification deep-learning-optimization machine-learning-engineering

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 6 / 25

Maturity 15 / 25

Community 5 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

BR-IDL/PaddleViT

:robot: PaddleViT: State-of-the-art Visual Transformer and MLP Models for PaddlePaddle 2.0+

pathak22/unsupervised-video

[CVPR 2017] Unsupervised deep learning using unlabelled videos on the web

IBM/CrossViT

Official implementation of CrossViT. https://arxiv.org/abs/2103.14899

NVlabs/GCVit

[ICML 2023] Official PyTorch implementation of Global Context Vision Transformers

ViTAE-Transformer/ViTDet

Unofficial implementation for [ECCV'22] "Exploring Plain Vision Transformer Backbones for Object...

Explore Computer Vision Tools

All categories Trending Computer Vision directory Insights