An integrated framework for image segmentation and perceptual grouping

This paper presents an efficient algorithm for image segmentation and a framework for perceptual grouping. It makes an attempt to provide one way of combining bottom-up and top-down approaches. In image segmentation, it generalizes the Swendsen-Wang cut algorithm (SWC) by Barbu and Zhu (2003) to make both 2-way and m-way cuts, and includes topology change processes (graph repartitioning and boundary diffusion). The method directly works at a low temperature without using annealing. We show that it is much faster than the DDMCMC approach (Tu and Zhu, 2002) and more robust than the SWC method. The results are demonstrated on the Berkeley data set. In perceptual grouping, it integrates discriminative model learning/computing, a belief propagation algorithm (BP) by Yedidia et al. (2000), and SWC into a three-layer computing framework. These methods are realized as different levels of approximation to an "ideal" generative model. We demonstrate the algorithm on the problem of human body configuration.

[1]  P. Green Reversible jump Markov chain Monte Carlo computation and Bayesian model determination , 1995 .

[2]  Alan L. Yuille,et al.  Region Competition: Unifying Snakes, Region Growing, and Bayes/MDL for Multiband Image Segmentation , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Zhuowen Tu,et al.  AUTOMATIC RECOGNITION OF CIVIL INFRASTRUCTURE OBJECTS IN MOBILE MAPPING IMAGERY USING A MARKOV RANDOM FIELD MODEL , 2000 .

[4]  W. Freeman,et al.  Generalized Belief Propagation , 2000, NIPS.

[5]  Ronen Basri,et al.  Segmentation and boundary detection using multiscale intensity measurements , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[6]  Michael Werman,et al.  Self-Organization in Vision: Stochastic Clustering for Image Segmentation, Perceptual Grouping, and Image Database Organization , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  Zhuowen Tu,et al.  A Stochastic Algorithm for 3D Scene Segmentation and Reconstruction , 2002, ECCV.

[8]  Zhuowen Tu,et al.  Parsing Images into Region and Curve Processes , 2002, ECCV.

[9]  Zhuowen Tu,et al.  Image Segmentation by Data-Driven Markov Chain Monte Carlo , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Rongxing Li,et al.  A FRAMEWORK FOR AUTOMATIC RECOGNITION OF SPATIAL FEATURES FROM MOBILE MAPPING IMAGERY , 2002 .

[11]  Tomer Hertz,et al.  Learning and inferring image segmentations using the GBP typical cut algorithm , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[12]  Antonio Torralba,et al.  Using the Forest to See the Trees: A Graphical Model Relating Features, Objects, and Scenes , 2003, NIPS.

[13]  Adrian Barbu,et al.  Graph partition by Swendsen-Wang cuts , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[14]  David A. Forsyth,et al.  Using temporal coherence to build models of animals , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[15]  Zhuowen Tu,et al.  Image Parsing: Unifying Segmentation, Detection, and Recognition , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[16]  Zhuowen Tu,et al.  Image Parsing: Segmentation, Detection, and Recognition , 2003 .

[17]  Martial Hebert,et al.  Discriminative random fields: a discriminative framework for contextual interaction in classification , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[18]  R. Collins,et al.  Representation and matching of articulated shapes , 2004, CVPR 2004.

[19]  Jitendra Malik,et al.  Learning to detect natural image boundaries using local brightness, color, and texture cues , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Jitendra Malik,et al.  Recovering human body configurations: combining segmentation and recognition , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[21]  Zhuowen Tu,et al.  Shape Matching and Recognition - Using Generative Models and Informative Features , 2004, ECCV.

[22]  M. Lee,et al.  Proposal maps driven MCMC for estimating human body pose in static images , 2004, CVPR 2004.

[23]  Zhuowen Tu,et al.  Probabilistic boosting-tree: learning discriminative models for classification, recognition, and clustering , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[24]  William T. Freeman,et al.  Constructing free-energy approximations and generalized belief propagation algorithms , 2005, IEEE Transactions on Information Theory.

[25]  Zhuowen Tu,et al.  Parsing Images into Regions, Curves, and Curve Groups , 2006, International Journal of Computer Vision.

[26]  Paul M. Thompson,et al.  A Learning Based Algorithm for Automatic Extraction of the Cortical Sulci , 2006, MICCAI.

[27]  Zhuowen Tu,et al.  Supervised Learning of Edges and Object Boundaries , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[28]  Paul M. Thompson,et al.  Direct mapping of hippocampal surfaces with intrinsic shape context , 2007, NeuroImage.

[29]  Paul M. Thompson,et al.  Automated Extraction of the Cortical Sulci Based on a Supervised Learning Approach , 2007, IEEE Transactions on Medical Imaging.

[30]  Zhuowen Tu,et al.  Learning Generative Models via Discriminative Approaches , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[31]  Zhuowen Tu,et al.  Feature Mining for Image Classification , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[32]  Arthur W. Toga,et al.  Segmentation of Sub-cortical Structures by the Graph-Shifts Algorithm , 2007, IPMI.

[33]  Paul M. Thompson,et al.  Joint Sulci Detection Using Graphical Models and Boosted Priors , 2007, IPMI.