Human body pose recognition from a single-view depth camera

We propose a model-based approach for human body pose recognition from a single-view depth camera. The proposed algorithm applies an articulated cylinder model to detect human pose and track them based on a particle filter without numerous training data or heuristic detectors. To reduce high degrees of freedom, we adopt a hierarchical method that detects torso and limbs successively. Moreover, we take the advantage of a particle filter to track complex human motion and the results show that the proposed system is robust in human motion tracking. The qualitative evaluation shows that our method can deal with self-occlusion problem and ambiguous human motion effectively, and the quantitative evaluation shows that the average tracking error is 0.06 meters with a standard deviation of 0.04 meters. The proposed method tracks human poses successfully at the speed of 18 frames per second on a laptop with Intel Core i3-2100 CPU and without graphic processing unit.

[1]  Neil J. Gordon,et al.  A tutorial on particle filters for online nonlinear/non-Gaussian Bayesian tracking , 2002, IEEE Trans. Signal Process..

[2]  Paul J. Besl,et al.  A Method for Registration of 3-D Shapes , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Reinhard Koch,et al.  Nonlinear Body Pose Estimation from Depth Images , 2005, DAGM-Symposium.

[4]  Sebastian Thrun,et al.  Real time motion capture using a single time-of-flight camera , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[5]  Maja J. Mataric,et al.  Markerless kinematic model and motion capture from volume sequences , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[6]  Behzad Dariush,et al.  Kinematic self retargeting: A framework for human pose estimation , 2010, Comput. Vis. Image Underst..

[7]  Maoying Qiao,et al.  Model-Based Human Pose Estimation with Hierarchical ICP from Single Depth Images , 2011 .

[8]  Ruigang Yang,et al.  Accurate 3D pose estimation from a single depth image , 2011, 2011 International Conference on Computer Vision.

[9]  Behzad Dariush,et al.  Controlled human pose estimation from depth image streams , 2008, 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[10]  Hans-Peter Seidel,et al.  A data-driven approach for real-time full body pose reconstruction from a depth camera , 2011, 2011 International Conference on Computer Vision.

[11]  Andrew W. Fitzgibbon,et al.  Real-time human pose recognition in parts from single depth images , 2011, CVPR 2011.

[12]  Stefano Soatto,et al.  Classification and Recognition of Dynamical Models: The Role of Phase, Independent Components, Kernels and Optimal Transport , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Stephan Hussmann,et al.  A Performance Review of 3D TOF Vision Systems in Comparison to Stereo Vision Systems , 2008 .

[14]  Jake K. Aggarwal,et al.  Human Motion Analysis: A Review , 1999, Comput. Vis. Image Underst..

[15]  Jake K. Aggarwal,et al.  Human motion analysis: a review , 1997, Proceedings IEEE Nonrigid and Articulated Motion Workshop.

[16]  Adrian Hilton,et al.  A survey of advances in vision-based human motion capture and analysis , 2006, Comput. Vis. Image Underst..

[17]  Gérard G. Medioni,et al.  Human pose estimation from a single view point, real-time range sensor , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops.