FrancescoSaverioZuppichini/ViT

Implementing Vi(sion)T(transformer)

37
/ 100
Emerging

This project offers a guide to implementing a Vision Transformer (ViT) model, which is a powerful tool for image recognition tasks. It takes an input image, breaks it into smaller patches, and then processes these patches to classify or understand the image's content. Data scientists and machine learning engineers working on computer vision problems would use this to build and deploy image recognition systems.

453 stars. No commits in the last 6 months.

Use this if you are a machine learning engineer who needs to implement a Vision Transformer model for image classification, understanding, or other computer vision tasks.

Not ideal if you are looking for a plug-and-play solution without diving into the underlying implementation details.

image-recognition computer-vision deep-learning machine-learning-engineering
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 8 / 25
Community 19 / 25

How are scores calculated?

Stars

453

Forks

63

Language

License

Last pushed

Mar 19, 2023

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/FrancescoSaverioZuppichini/ViT"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.