Real-time Pose Estimation using Tree Structures Built from Skeletonised Volume Sequences

Pose estimation in the context of human motion analysis is the process of approximating the body configuration in each frame of a motion sequence. We propose a novel pose estimation method based on constructing tree structures from skeletonised visual hulls reconstructed from multi-view video. The pose is estimated independently in each frame, so the method can recover from errors in previous frames, which overcomes the problems of tracking. Publically available datasets were used to evaluate the method. On real data the method performs at a framerate of 15–64 fps depending on the resolution of the volume. Using synthetic data the positions of the extremities were determined with a mean error of 47–53 mm depending on the resolution.

[1]  Deborah Silver,et al.  Curve-Skeleton Properties, Applications, and Algorithms , 2007, IEEE Trans. Vis. Comput. Graph..

[2]  Irfan A. Essa,et al.  Novel Skeletal Representation for Articulated Creatures , 2004, ECCV.

[3]  Michel Couprie,et al.  Generic Initialization for Motion Capture from 3D Shape , 2010, ICIAR.

[4]  Andrea Fusiello,et al.  Tracking Human Motion with Multiple Cameras Using an Articulated Model , 2009, MIRAGE.

[5]  Gabriella Sanniti di Baja,et al.  Curve skeletonization of surface-like objects in 3D images guided by voxel classification , 2002, Pattern Recognit. Lett..

[6]  Lars Moland Eliassen,et al.  A Comparison of Learning Based Background Subtraction Techniques Implemented in CUDA , 2009 .

[7]  Hans-Peter Seidel,et al.  Marker-free kinematic skeleton estimation from sequences of volume data , 2004, VRST '04.

[8]  Adrian Hilton,et al.  A survey of advances in vision-based human motion capture and analysis , 2006, Comput. Vis. Image Underst..

[9]  Bruno Raffin,et al.  3D Skeleton-Based Body Pose Recovery , 2006, Third International Symposium on 3D Data Processing, Visualization, and Transmission (3DPVT'06).

[10]  Adrian Hilton,et al.  The Multiple-Camera 3-D Production Studio , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[11]  Yen-Lin Chen,et al.  3D Reconstruction of Human Motion and Skeleton from Uncalibrated Monocular Video , 2009, ACCV.

[12]  Toby Howard,et al.  Real-time 3-D human body tracking using learnt models of behaviour , 2008, Comput. Vis. Image Underst..

[13]  Rama Chellappa,et al.  Model Driven Segmentation of Articulating Humans in Laplacian Eigenspace , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Maja J. Mataric,et al.  Markerless kinematic model and motion capture from volume sequences , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[15]  Gilles Bertrand,et al.  A New 3D Parallel Thinning Scheme Based on Critical Kernels , 2006, DGCI.

[16]  Ronald Poppe,et al.  Vision-based human motion analysis: An overview , 2007, Comput. Vis. Image Underst..

[17]  A. Laurentini,et al.  The Visual Hull Concept for Silhouette-Based Image Understanding , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[18]  Saïda Bouakaz,et al.  Real-Time and Markerless 3D Human Motion Capture Using Multiple Views , 2007, Workshop on Human Motion.

[19]  Ioannis Pitas,et al.  The i3DPost Multi-View and 3D Human Action/Interaction Database , 2009, 2009 Conference for Visual Media Production.