XunshanMan/MVGFormer

This is the official implementation of the work presented at CVPR 2024, titled Multiple View Geometry Transformers for 3D Human Pose Estimation (MVGFormer).

/ 100

Emerging

This project helps researchers and engineers working with human motion capture to accurately estimate 3D human poses from multiple camera views. You provide 2D video feeds from different angles, and it outputs precise 3D joint locations and body poses, even when camera setups change. This is ideal for those analyzing human movement in research, sports science, or robotics.

No commits in the last 6 months.

Use this if you need robust and generalizable 3D human pose estimations from multi-camera video footage, particularly in varied or changing camera environments.

Not ideal if you only have single-camera video input or are not working with human pose estimation.

human-motion-capture 3D-pose-estimation multi-camera-systems computer-vision biomechanics-analysis

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 8 / 25

Maturity 16 / 25

Community 8 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

NVlabs/MambaVision

[CVPR 2025] Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone

sign-language-translator/sign-language-translator

Python library & framework to build custom translators for the hearing-impaired and translate...

kyegomez/Jamba

PyTorch Implementation of Jamba: "Jamba: A Hybrid Transformer-Mamba Language Model"

autonomousvision/transfuser

[PAMI'23] TransFuser: Imitation with Transformer-Based Sensor Fusion for Autonomous Driving;...

kyegomez/MultiModalMamba

A novel implementation of fusing ViT with Mamba into a fast, agile, and high performance...

Explore Transformer Models

All categories Trending Transformer directory Insights