hkchengrex/XMem
[ECCV 2022] XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model
This tool helps video editors, content creators, and VFX artists automatically separate specific objects from their backgrounds in long video sequences. You provide a video and mark the object you care about in the first frame, and it outputs the video with that object precisely outlined in every subsequent frame, even through occlusions or complex movements. It's designed for anyone needing to isolate elements for visual effects, compositing, or detailed analysis.
1,962 stars. No commits in the last 6 months.
Use this if you need to accurately track and segment a specific object across an entire video, especially very long ones, with minimal manual effort.
Not ideal if you need to segment multiple objects at once without prior selection or if precise, real-time object detection for live streams is your primary goal.
Stars
1,962
Forks
207
Language
Python
License
MIT
Category
Last pushed
Nov 15, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/computer-vision/hkchengrex/XMem"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
isl-org/Open3D
Open3D: A Modern Library for 3D Data Processing
cvg/Hierarchical-Localization
Visual localization made easy with hloc
gmberton/CosPlace
Official code for CVPR 2022 paper "Rethinking Visual Geo-localization for Large-Scale Applications"
Vincentqyw/image-matching-webui
🤗 image matching webui
cvg/glue-factory
Training library for local feature detection and matching