Robust human body segmentation based on part appearance and spatial constraint

Human body segmentation in images is desirable in various practical applications, e.g., content-based image retrieval. However, it remains a challenging problem due to various body poses and confusing background. To overcome these difficulties, two properties of human body are explored in this paper, i.e., complementary property and weak structure property. Complementary property means that different human body parts always have the similar appearances. With this property, we propose to construct the Part Appearance Map (PAM). PAM can effectively represent the appearance probability of what a pixel belong to a human body, even for inaccurate human pose obtained by pictorial structure model. Afterward, robust foreground and background seeds are acquired by PAM. To utilize the structure information of human body effectively, we propose a novel graph cuts method - spatial constraint based graph cuts (SCGC), which incorporates weak structure property of human body parts into the cost function. The weak structure property constrains the arms, legs and head to appear in limited space under the condition that the location of torso is ascertained. With this property, the SCGC can successfully remove false segmentations by traditional graph cuts methods due to their similar appearances to human body. Experimental results show that the proposed method achieves promising performance and outperforms many state-of-the-art methods over publicly available challenging datasets which contain arbitrary poses.

[1]  Yiannis Kompatsiaris,et al.  TV Content Analysis: Techniques and Applications , 2011 .

[2]  Marie-Pierre Jolly,et al.  Interactive Graph Cuts for Optimal Boundary and Region Segmentation of Objects in N-D Images , 2001, ICCV.

[3]  Yi Yang,et al.  Articulated pose estimation with flexible mixtures-of-parts , 2011, CVPR 2011.

[4]  Aurélio J. C. Campilho,et al.  Performance Evaluation of Image Segmentation , 2006, ICIAR.

[5]  Toby Sharp,et al.  Image segmentation with a bounding box prior , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[6]  King Ngi Ngan,et al.  Automatic body segmentation with graph cut and self-adaptive initialization level set (SAILS) , 2011, J. Vis. Commun. Image Represent..

[7]  Sven J. Dickinson,et al.  TurboPixels: Fast Superpixels Using Geometric Flows , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Martin A. Fischler,et al.  The Representation and Matching of Pictorial Structures , 1973, IEEE Transactions on Computers.

[9]  Ramakant Nevatia,et al.  Segmentation of objects in a detection window by Nonparametric Inhomogeneous CRFs , 2011, Comput. Vis. Image Underst..

[10]  Harish Bhaskar,et al.  Articulated human body parts detection based on cluster background subtraction and foreground matching , 2013, Neurocomputing.

[11]  Hong Yan,et al.  Clothing segmentation using foreground and background estimation based on the constrained Delaunay triangulation , 2008, Pattern Recognit..

[12]  Loong Fah Cheong,et al.  Active segmentation with fixation , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[13]  Andrew Blake,et al.  Geodesic star convexity for interactive image segmentation , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[15]  Yongdong Zhang,et al.  Tracking Web Video Topics: Discovery, Visualization, and Monitoring , 2011, IEEE Transactions on Circuits and Systems for Video Technology.

[16]  Andrew Zisserman,et al.  OBJCUT: Efficient Segmentation Using Top-Down and Bottom-Up Cues , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  Jitendra Malik,et al.  Shape Guided Object Segmentation , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[18]  Larry S. Davis,et al.  Closely coupled object detection and segmentation , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[19]  Mark Everingham,et al.  Clustered Pose and Nonlinear Appearance Models for Human Pose Estimation , 2010, BMVC.

[20]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[21]  Andrew Zisserman,et al.  Progressive search space reduction for human pose estimation , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[22]  Larry S. Davis,et al.  An Interactive Approach to Pose-Assisted and Appearance-based Segmentation of Humans , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[23]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[24]  Deva Ramanan,et al.  Learning to parse images of articulated bodies , 2006, NIPS.

[25]  Sergio A. Velastin,et al.  Intelligent Multimedia Analysis for Security Applications , 2012, Intelligent Multimedia Analysis for Security Applications.

[26]  Leo Grady,et al.  Random Walks for Image Segmentation , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[27]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[28]  Daniel P. Huttenlocher,et al.  Pictorial Structures for Object Recognition , 2004, International Journal of Computer Vision.

[29]  Marie-Pierre Jolly,et al.  Interactive graph cuts for optimal boundary & region segmentation of objects in N-D images , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[30]  Jitendra Malik,et al.  Learning a classification model for segmentation , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[31]  Huchuan Lu,et al.  Arbitrary body segmentation in static images , 2012, Pattern Recognit..

[32]  Jeff A. Bilmes,et al.  A gentle tutorial of the em algorithm and its application to parameter estimation for Gaussian mixture and hidden Markov models , 1998 .

[33]  Shuicheng Yan,et al.  Inferring semantic concepts from community-contributed images and noisy tags , 2009, ACM Multimedia.

[34]  Mark Everingham,et al.  Learning effective human pose estimation from inaccurate annotation , 2011, CVPR 2011.

[35]  Andrew Blake,et al.  "GrabCut" , 2004, ACM Trans. Graph..

[36]  Heiko Wersing,et al.  Online figure-ground segmentation with adaptive metrics in generalized LVQ , 2009, Neurocomputing.

[37]  Shihong Lao,et al.  Adaptive Contour Features in oriented granular space for human detection and segmentation , 2009, CVPR.

[38]  Jitendra Malik,et al.  Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.