论文信息 - Combining Top-Down and Bottom-Up Segmentation

Combining Top-Down and Bottom-Up Segmentation

In this work we show how to combine bottom-up and top-down approaches into a single figure-ground segmentation process. This process provides accurate delineation of object boundaries that cannot be achieved by either the top-down or bottom-up approach alone. The top-down approach uses object representation learned from examples to detect an object in a given input image and provide an approximation to its figure-ground segmentation. The bottom-up approach uses image-based criteria to define coherent groups of pixels that are likely to belong together to either the figure or the background part. The combination provides a final segmentation that draws on the relative merits of both approaches: The result is as close as possible to the top-down approximation, but is also constrained by the bottom-up process to be consistent with significant image discontinuities. We construct a global cost function that represents these top-down and bottom-up requirements. We then show how the global minimum of this function can be efficiently found by applying the sum-product algorithm. This algorithm also provides a confidence map that can be used to identify image regions where additional top-down or bottom-up information may further improve the segmentation. Our experiments show that the results derived from the algorithm are superior to results given by a pure top-down or pure bottom-up approach. The scheme has broad applicability, enabling the combined use of a range of existing bottom-up and top-down segmentations.

[1] Alan L. Yuille,et al. Deformable templates , 1993 .

[2] Timothy F. Cootes,et al. Active Shape Models-Their Training and Application , 1995, Comput. Vis. Image Underst..

[3] William A. Barrett,et al. Interactive Segmentation with Intelligent Scissors , 1998, Graph. Model. Image Process..

[4] R. Baillargeon,et al. Effects of prior experience on 4.5-month-old infants' object segregation , 1998 .

[5] Pietro Perona,et al. Unsupervised Learning of Models for Recognition , 2000, ECCV.

[6] Michel Vidal-Naquet,et al. A Fragment-Based Approach to Object Representation and Classification , 2001, IWVF.

[7] Ronen Basri,et al. Segmentation and boundary detection using multiscale intensity measurements , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[8] S. Sclaroff,et al. Region segmentation via deformable model-guided split and merge , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[9] Ralph Gross,et al. Concurrent Object Recognition and Segmentation by Graph Partitioning , 2002, NIPS.

[10] Dan Roth,et al. Learning a Sparse Representation for Object Detection , 2002, ECCV.

[11] Shimon Ullman,et al. Class-Specific, Top-Down Segmentation , 2002, ECCV.

[12] X. Jin. Factor graphs and the Sum-Product Algorithm , 2002 .

[13] Zhuowen Tu,et al. Image Parsing: Segmentation, Detection, and Recognition , 2003 .

[14] Demetri Terzopoulos,et al. Snakes: Active contour models , 2004, International Journal of Computer Vision.

[15] Bastian Leibe,et al. Interleaved Object Categorization and Segmentation , 2003, BMVC.

[16] Shimon Ullman,et al. Learning to Segment , 2004, ECCV.