Augmented poselets for human body pose inference by a probabilistic graphical model

Human body pose estimation is a challenging task which, depending on the context of application, degree of accuracy and availability of image frames, is being faced with different state-of-the-art approaches. In this paper we propose a part-based detection approach combined with a probabilistic graphical model framework for structural constraints on monocular single images, which offers several benefits: human body joints localization inference (rather than a holistic body detection), low computational cost, and robustness against unknown poses as long as antropomorphic constraints are preserved. These outcomes make this approach feasible for applications related to portable devices or multimedia applications which need to be aware of the presence of people in real time at a low cost, and can take advantage of the knowledge about body poses. The presented approach is built by taking into account the existing "Poselets" architecture and one of its foundations, the "H3D" dataset. On top of this, we "augment" the prior knowledge about human body structure and parts appearance in order to learn spatial probability distributions on body natural constraints, which will be used afterwards by the probabilistic graphical model.

[1]  Jitendra Malik,et al.  Poselets: Body part detectors trained using 3D human pose annotations , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[2]  Michael J. Black,et al.  Learning the Statistics of People in Images and Video , 2003, International Journal of Computer Vision.

[3]  Camillo J. Taylor,et al.  Reconstruction of Articulated Objects from Point Correspondences in a Single Uncalibrated Image , 2000, Comput. Vis. Image Underst..

[4]  Yi Yang,et al.  Articulated pose estimation with flexible mixtures-of-parts , 2011, CVPR 2011.

[5]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[6]  Fatih Murat Porikli,et al.  Pedestrian Detection via Classification on Riemannian Manifolds , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Deva Ramanan,et al.  Learning to parse images of articulated bodies , 2006, NIPS.

[8]  William T. Freeman,et al.  Correctness of Belief Propagation in Gaussian Graphical Models of Arbitrary Topology , 1999, Neural Computation.

[9]  Subhransu Maji,et al.  Object detection using a max-margin Hough transform , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.