ViTAE-Transformer/SAMText

The official repo for the technical report "Scalable Mask Annotation for Video Text Spotting"

/ 100

Experimental

This project helps content analysts, data annotators, or AI model trainers to accurately outline and track text in video frames. It takes existing video clips with text (like captions, signs, or logos) and produces precise, pixel-level mask annotations for each text instance. This significantly improves the detail compared to traditional rectangular boxes.

No commits in the last 6 months.

Use this if you need to create highly accurate, detailed mask annotations for text appearing in videos to train or evaluate advanced video text spotting models.

Not ideal if you only need basic bounding box annotations for text, as this tool focuses on more granular, pixel-level detail.

video-annotation text-recognition data-labeling computer-vision content-analysis

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 8 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

—

License

—

Higher-rated alternatives

gradio-app/gradio

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

opengeos/segment-geospatial

A Python package for segmenting geospatial data with the Segment Anything Model (SAM)

wkentaro/osam

Get up and running with SAM1-3, EfficientSAM, YOLO-World, and other promptable vision models locally.

juglab/EmbedSeg

Code Implementation for EmbedSeg, an Instance Segmentation Method for Microscopy Images

lartpang/awesome-segmentation-saliency-dataset

A collection of some datasets for segmentation / saliency detection. Welcome to PR...:smile:

Explore ML Frameworks

All categories Trending ML Framework directory Insights