Robustness of Input Features from Noisy Silhouettes in Human Pose Estimation

Silhouettes are frequently extracted and described to compose inputs for learning methods in solving human pose estimation problem. Although silhouettes extracted from background subtraction methods are usually noisy, the effect of noisy inputs to pose estimation accuracies is seldom studied. In this paper, we explore this problem. First, We compare performances of several image features widely used for human pose estimation and explore their performances against each other and select one with best performance. Second, iterative closest point algorithm is introduced for a new quantitative measurement of noisy inputs. The proposed measurement is able to automatically discard noise, like camouflage from the background or shadows. With the proposed measurement, we split inputs into different noise levels and assess their pose estimation accuracies. Furthermore, we explore performances of silhouette samples of different noise levels and compare with the selected feature on a public dataset: Human Eva dataset.

[1]  A. Broggi,et al.  Pedestrian Detection using Infrared images and Histograms of Oriented Gradients , 2006, 2006 IEEE Intelligent Vehicles Symposium.

[2]  Cristian Sminchisescu,et al.  Structured output-associative regression , 2009, CVPR.

[3]  Fabio Tozeto Ramos,et al.  Multi-Kernel Gaussian Processes , 2011, IJCAI.

[4]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[5]  Alain Rakotomamonjy,et al.  A Pedestrian Detector Using Histograms of Oriented Gradients and a Support Vector Machine Classifier , 2007, 2007 IEEE Intelligent Transportation Systems Conference.

[6]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[7]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[8]  Serge J. Belongie,et al.  Matching with shape contexts , 2000, 2000 Proceedings Workshop on Content-based Access of Image and Video Libraries.

[9]  Jitendra Malik,et al.  Estimating Human Body Configurations Using Shape Context Matching , 2002, ECCV.

[10]  Edwin V. Bonilla,et al.  Explorer Multitask Gaussian Process Prediction , 2008 .

[11]  Ankur Agarwal,et al.  Recovering 3D human pose from monocular images , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Michael Arens,et al.  Modeling vs. learning approaches for monocular 3D human pose estimation , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[13]  Ahmed M. Elgammal,et al.  Regression from local features for viewpoint and pose estimation , 2011, 2011 International Conference on Computer Vision.

[14]  Andrew Zisserman,et al.  Representing shape with a spatial pyramid kernel , 2007, CIVR '07.

[15]  Liefeng Bo,et al.  Structured output-associative regression , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[16]  Daniel Grest,et al.  Gradient-Enhanced Particle Filter for Vision-Based Motion Capture , 2007, Workshop on Human Motion.

[17]  Zhengyou Zhang,et al.  Iterative point matching for registration of free-form curves and surfaces , 1994, International Journal of Computer Vision.

[18]  Jordi Gonzàlez,et al.  A coarse-to-fine approach for fast deformable object detection , 2011, CVPR 2011.

[19]  Bingbing Ni,et al.  Articulated Object Registration Using Simulated Physical Force/Moment for 3D Human Motion Tracking , 2007, Workshop on Human Motion.

[20]  Yi Yang,et al.  Articulated pose estimation with flexible mixtures-of-parts , 2011, CVPR 2011.

[21]  Maja Pantic,et al.  Shape-constrained Gaussian process regression for facial-point-based head-pose normalization , 2011, 2011 International Conference on Computer Vision.

[22]  Jitendra Malik,et al.  Shape matching and object recognition using shape contexts , 2010, 2010 3rd International Conference on Computer Science and Information Technology.