amazon-science/glass-text-spotting

Official implementation for "GLASS: Global to Local Attention for Scene-Text Spotting" (ECCV'22)

/ 100

Emerging

This project helps automatically find and read text within complex images, even if the text is small, rotated, or blended into the background. It takes an image as input and outputs the detected text along with its location in the image. This tool is useful for anyone working with visual data that contains varying and challenging text elements.

102 stars. No commits in the last 6 months.

Use this if you need to accurately extract text from diverse real-world images where text size, orientation, and context vary significantly.

Not ideal if you primarily work with clean, high-contrast images where text is always horizontal and easily legible.

image-processing document-analysis visual-search content-moderation sign-recognition

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 9 / 25

Maturity 16 / 25

Community 14 / 25

How are scores calculated?

Stars

102

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

BR-IDL/PaddleViT

:robot: PaddleViT: State-of-the-art Visual Transformer and MLP Models for PaddlePaddle 2.0+

pathak22/unsupervised-video

[CVPR 2017] Unsupervised deep learning using unlabelled videos on the web

IBM/CrossViT

Official implementation of CrossViT. https://arxiv.org/abs/2103.14899

NVlabs/GCVit

[ICML 2023] Official PyTorch implementation of Global Context Vision Transformers

ViTAE-Transformer/ViTDet

Unofficial implementation for [ECCV'22] "Exploring Plain Vision Transformer Backbones for Object...

Explore Computer Vision Tools

All categories Trending Computer Vision directory Insights