Hierarchical Object Parsing from Structured Noisy Point Clouds

Object parsing and segmentation from point clouds are challenging tasks because the relevant data is available only as thin structures along object boundaries or other features, and is corrupted by large amounts of noise. To handle this kind of data, flexible shape models are desired that can accurately follow the object boundaries. Popular models such as active shape and active appearance models (AAMs) lack the necessary flexibility for this task, while recent approaches such as the recursive compositional models make model simplifications to obtain computational guarantees. This paper investigates a hierarchical Bayesian model of shape and appearance in a generative setting. The input data is explained by an object parsing layer which is a deformation of a hidden principal component analysis (PCA) shape model with Gaussian prior. The paper also introduces a novel efficient inference algorithm that uses informed data-driven proposals to initialize local searches for the hidden variables. Applied to the problem of object parsing from structured point clouds such as edge detection images, the proposed approach obtains state-of-the-art parsing errors on two standard datasets without using any intensity information.

[1]  Bjarne K. Ersbøll,et al.  FAME-a flexible appearance modeling environment , 2003, IEEE Transactions on Medical Imaging.

[2]  Mikkel B. Stegmann,et al.  Analysis and segmentation of face images using point annotations and linear subspace techniques, in , 2002 .

[3]  Timothy F. Cootes,et al.  Combining Local and Global Shape Models for Deformable Object Matching , 2009, BMVC.

[4]  Gang Song,et al.  Untangling Cycles for Contour Grouping , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[5]  Adrian Barbu,et al.  Training an Active Random Field for Real-Time Image Denoising , 2009, IEEE Transactions on Image Processing.

[6]  Andrew Zisserman,et al.  OBJ CUT , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[7]  H. Damasio,et al.  IEEE Transactions on Pattern Analysis and Machine Intelligence: Special Issue on Perceptual Organization in Computer Vision , 1998 .

[8]  Aaron Hertzmann,et al.  Learning Non-Rigid 3D Shape from 2D Motion , 2003, NIPS.

[9]  Dorin Comaniciu,et al.  Multiple object detection by sequential monte carlo and Hierarchical Detection Network , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[10]  Zhuowen Tu,et al.  Active skeleton for non-rigid object detection , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[11]  Hua Li,et al.  Robust Non-Frontal Face Alignment with Edge Based Texture , 2005, Journal of Computer Science and Technology.

[12]  Pushmeet Kohli,et al.  Robust Higher Order Potentials for Enforcing Label Consistency , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[13]  Song-Chun Zhu,et al.  Evaluating information contributions of bottom-up and top-down processes , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[14]  David Cristinacce,et al.  Automatic feature localisation with constrained local models , 2008, Pattern Recognit..

[15]  Olga Veksler,et al.  Fast approximate energy minimization via graph cuts , 2001, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[16]  Michael Goesele,et al.  A shape-based object class model for knowledge transfer , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[17]  Christopher Joseph Pal,et al.  Learning Conditional Random Fields for Stereo , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[18]  Anat Levin,et al.  Learning to Combine Bottom-Up and Top-Down Segmentation , 2006, International Journal of Computer Vision.

[19]  Jayaram K. Udupa,et al.  Oriented Active Shape Models , 2009, IEEE Transactions on Medical Imaging.

[20]  Nikos Komodakis,et al.  Shape priors and discrete MRFs for knowledge-based segmentation , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[21]  Shimon Ullman,et al.  Class-Specific, Top-Down Segmentation , 2002, ECCV.

[22]  Cordelia Schmid,et al.  Bandit Algorithms for Tree Search , 2007, UAI.

[23]  Nikos Komodakis,et al.  Beyond pairwise energies: Efficient optimization for higher-order MRFs , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[24]  Jim Graham,et al.  Robust Active Shape Model Search , 2002, ECCV.

[25]  Anand Rangarajan,et al.  A new point matching algorithm for non-rigid registration , 2003, Comput. Vis. Image Underst..

[26]  Joshua D. Schwartz,et al.  Hierarchical Matching of Deformable Shapes , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[27]  Tao Zhang,et al.  Interactive graph cut based segmentation with shape priors , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[28]  Jianbo Shi,et al.  Recognizing objects by piecing together the Segmentation Puzzle , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[29]  Yogesh Rathi,et al.  Graph Cut Segmentation with Nonlinear Shape Priors , 2007, 2007 IEEE International Conference on Image Processing.

[30]  Jayaram K. Udupa,et al.  User-Steered Image Segmentation Paradigms: Live Wire and Live Lane , 1998, Graph. Model. Image Process..

[31]  Timothy F. Cootes,et al.  Active Shape Models-Their Training and Application , 1995, Comput. Vis. Image Underst..

[32]  Daniel P. Huttenlocher,et al.  Learning for Optical Flow Using Stochastic Optimization , 2008, ECCV.

[33]  Manuel Menezes de Oliveira Neto,et al.  Real-time line detection through an improved Hough transform voting scheme , 2008, Pattern Recognit..

[34]  Nebojsa Jojic,et al.  LOCUS: learning object classes with unsupervised segmentation , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[35]  Takeo Kanade,et al.  A robust shape model for multi-view car alignment , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[36]  B. Schiele,et al.  Combined Object Categorization and Segmentation With an Implicit Shape Model , 2004 .

[37]  BarbuAdrian Training an active random field for real-time image denoising , 2009 .

[38]  Song-Chun Zhu,et al.  Learning Active Basis Model for Object Detection and Recognition , 2010, International Journal of Computer Vision.

[39]  Andrew Blake,et al.  Multiscale Categorical Object Recognition Using Contour Fragments , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[40]  Stephen Gould,et al.  Accelerated dual decomposition for MAP inference , 2010, ICML.

[41]  Timothy F. Cootes,et al.  Active Appearance Models , 1998, ECCV.

[42]  U. Grenander,et al.  Structural Image Restoration through Deformable Templates , 1991 .

[43]  CipollaRoberto,et al.  Multiscale Categorical Object Recognition Using Contour Fragments , 2008 .

[44]  Anand Rangarajan,et al.  The Softassign Procrustes Matching Algorithm , 1997, IPMI.

[45]  Shimon Ullman,et al.  Combined Top-Down/Bottom-Up Segmentation , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[46]  Paul Suetens,et al.  Active Shape Model-Based Segmentation of Digital X-ray Images , 1999, MICCAI.

[47]  Frédéric Jurie,et al.  Groups of Adjacent Contour Segments for Object Detection , 2008, IEEE Trans. Pattern Anal. Mach. Intell..

[48]  Long Zhu,et al.  Learning a Hierarchical Deformable Template for Rapid Deformable Object Parsing , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[49]  Jitendra Malik,et al.  Cue Integration for Figure/Ground Labeling , 2005, NIPS.