ViTAE-Transformer/SAMText
The official repo for the technical report "Scalable Mask Annotation for Video Text Spotting"
This project helps content analysts, data annotators, or AI model trainers to accurately outline and track text in video frames. It takes existing video clips with text (like captions, signs, or logos) and produces precise, pixel-level mask annotations for each text instance. This significantly improves the detail compared to traditional rectangular boxes.
No commits in the last 6 months.
Use this if you need to create highly accurate, detailed mask annotations for text appearing in videos to train or evaluate advanced video text spotting models.
Not ideal if you only need basic bounding box annotations for text, as this tool focuses on more granular, pixel-level detail.
Stars
16
Forks
—
Language
—
License
—
Category
Last pushed
May 03, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/ViTAE-Transformer/SAMText"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
gradio-app/gradio
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
opengeos/segment-geospatial
A Python package for segmenting geospatial data with the Segment Anything Model (SAM)
wkentaro/osam
Get up and running with SAM1-3, EfficientSAM, YOLO-World, and other promptable vision models locally.
juglab/EmbedSeg
Code Implementation for EmbedSeg, an Instance Segmentation Method for Microscopy Images
lartpang/awesome-segmentation-saliency-dataset
A collection of some datasets for segmentation / saliency detection. Welcome to PR...:smile: