FORTH-ModelBasedTracker/MocapNET
We present MocapNET, a real-time method that estimates the 3D human pose directly in the popular Bio Vision Hierarchy (BVH) format, given estimations of the 2D body joints originating from monocular color images. Our contributions include: (a) A novel and compact 2D pose NSRM representation. (b) A human body orientation classifier and an ensemble of orientation-tuned neural networks that regress the 3D human pose by also allowing for the decomposition of the body to an upper and lower kinematic hierarchy. This permits the recovery of the human pose even in the case of significant occlusions. (c) An efficient Inverse Kinematics solver that refines the neural-network-based solution providing 3D human pose estimations that are consistent with the limb sizes of a target person (if known). All the above yield a 33% accuracy improvement on the Human 3.6 Million (H3.6M) dataset compared to the baseline method (MocapNET) while maintaining real-time performance
This project helps animators and 3D artists convert standard video footage of people into 3D character animations. You feed it a color video of a person, and it outputs a Bio Vision Hierarchy (BVH) file, which is a common format for motion capture data. This tool is for anyone creating animated characters, especially in fields like game development, virtual reality, or film.
925 stars.
Use this if you need to quickly generate realistic 3D human pose data from a 2D video to animate characters in your 3D software.
Not ideal if you require extremely precise medical-grade motion analysis or need to capture complex object interactions beyond basic human pose.
Stars
925
Forks
143
Language
C++
License
—
Category
Last pushed
Mar 18, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/computer-vision/FORTH-ModelBasedTracker/MocapNET"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
alicevision/AliceVision
3D Computer Vision Framework
colmap/colmap
COLMAP - Structure-from-Motion and Multi-View Stereo
ANTsX/ANTs
Advanced Normalization Tools (ANTs)
alicevision/Meshroom
Node-based Visual Programming Toolbox
MOLAorg/mola
A Modular Optimization framework for Localization and mApping (MOLA)