Combining Top-Down and Bottom-Up Segmentation

In this work we show how to combine bottom-up and top-down approaches into a single figure-ground segmentation process. This process provides accurate delineation of object boundaries that cannot be achieved by either the top-down or bottom-up approach alone. The top-down approach uses object representation learned from examples to detect an object in a given input image and provide an approximation to its figure-ground segmentation. The bottom-up approach uses image-based criteria to define coherent groups of pixels that are likely to belong together to either the figure or the background part. The combination provides a final segmentation that draws on the relative merits of both approaches: The result is as close as possible to the top-down approximation, but is also constrained by the bottom-up process to be consistent with significant image discontinuities. We construct a global cost function that represents these top-down and bottom-up requirements. We then show how the global minimum of this function can be efficiently found by applying the sum-product algorithm. This algorithm also provides a confidence map that can be used to identify image regions where additional top-down or bottom-up information may further improve the segmentation. Our experiments show that the results derived from the algorithm are superior to results given by a pure top-down or pure bottom-up approach. The scheme has broad applicability, enabling the combined use of a range of existing bottom-up and top-down segmentations.

[1]  Alan L. Yuille,et al.  Deformable templates , 1993 .

[2]  Timothy F. Cootes,et al.  Active Shape Models-Their Training and Application , 1995, Comput. Vis. Image Underst..

[3]  William A. Barrett,et al.  Interactive Segmentation with Intelligent Scissors , 1998, Graph. Model. Image Process..

[4]  R. Baillargeon,et al.  Effects of prior experience on 4.5-month-old infants' object segregation , 1998 .

[5]  Pietro Perona,et al.  Unsupervised Learning of Models for Recognition , 2000, ECCV.

[6]  Michel Vidal-Naquet,et al.  A Fragment-Based Approach to Object Representation and Classification , 2001, IWVF.

[7]  Ronen Basri,et al.  Segmentation and boundary detection using multiscale intensity measurements , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[8]  S. Sclaroff,et al.  Region segmentation via deformable model-guided split and merge , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[9]  Ralph Gross,et al.  Concurrent Object Recognition and Segmentation by Graph Partitioning , 2002, NIPS.

[10]  Dan Roth,et al.  Learning a Sparse Representation for Object Detection , 2002, ECCV.

[11]  Shimon Ullman,et al.  Class-Specific, Top-Down Segmentation , 2002, ECCV.

[12]  X. Jin Factor graphs and the Sum-Product Algorithm , 2002 .

[13]  Zhuowen Tu,et al.  Image Parsing: Segmentation, Detection, and Recognition , 2003 .

[14]  Demetri Terzopoulos,et al.  Snakes: Active contour models , 2004, International Journal of Computer Vision.

[15]  Bastian Leibe,et al.  Interleaved Object Categorization and Segmentation , 2003, BMVC.

[16]  Shimon Ullman,et al.  Learning to Segment , 2004, ECCV.