vision_transformers and convolution-vision-transformers

These are ecosystem siblings—one provides a general framework for applying Vision Transformers across multiple computer vision tasks, while the other implements a specific architectural variant (CvT) that could be integrated into or compared against such frameworks.

Maintenance 6/25
Adoption 8/25
Maturity 25/25
Community 14/25
Maintenance 0/25
Adoption 10/25
Maturity 16/25
Community 19/25
Stars: 65
Forks: 9
Downloads:
Commits (30d): 0
Language: Python
License: MIT
Stars: 226
Forks: 34
Downloads:
Commits (30d): 0
Language: Python
License: MIT
No Dependents
Stale 6m No Package No Dependents

About vision_transformers

sovit-123/vision_transformers

Vision Transformers for image classification, image segmentation, and object detection.

This project helps computer vision practitioners train models to automatically identify objects, classify images, or segment images into meaningful regions. You provide it with images or video data, and it outputs a trained model capable of performing these tasks or shows the detected objects/classifications on your input. It's designed for machine learning engineers, data scientists, and researchers working with visual data.

image-classification object-detection image-segmentation computer-vision deep-learning-models

About convolution-vision-transformers

rishikksh20/convolution-vision-transformers

PyTorch Implementation of CvT: Introducing Convolutions to Vision Transformers

This project offers a PyTorch implementation of the Convolutional Vision Transformer (CvT) neural network architecture. It takes image data as input and produces classifications, enabling advanced computer vision tasks. This is for researchers and machine learning engineers who need to experiment with or apply state-of-the-art image recognition models.

deep-learning image-classification computer-vision neural-networks machine-learning-research

Scores updated daily from GitHub, PyPI, and npm data. How scores work