论文信息 - Cosegmentation of Image Pairs by Histogram Matching - Incorporating a Global Constraint into MRFs

Cosegmentation of Image Pairs by Histogram Matching - Incorporating a Global Constraint into MRFs

We introduce the term cosegmentation which denotes the task of segmenting simultaneously the common parts of an image pair. A generative model for cosegmentation is presented. Inference in the model leads to minimizing an energy with an MRF term encoding spatial coherency and a global constraint which attempts to match the appearance histograms of the common parts. This energy has not been proposed previously and its optimization is challenging and NP-hard. For this problem a novel optimization scheme which we call trust region graph cuts is presented. We demonstrate that this framework has the potential to improve a wide range of research: Object driven image retrieval, video tracking and segmentation, and interactive image editing. The power of the framework lies in its generality, the common part can be a rigid/non-rigid object (or scene), observed from different viewpoints or even similar objects of the same class.

[1] G. G. Stokes. "J." , 1890, The New Yale Book of Quotations.

[2] D J Field,et al. Relations between the statistics of natural images and the response properties of cortical cells. , 1987, Journal of the Optical Society of America. A, Optics and image science.

[3] D. Greig,et al. Exact Maximum A Posteriori Estimation for Binary Images , 1989 .

[4] Boris N. Pshenichnyj. The Linearization Method for Constrained Optimization , 1994 .

[5] Dimitri P. Bertsekas,et al. Nonlinear Programming , 1997 .

[6] Jitendra Malik,et al. Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[7] Jitendra Malik,et al. Recognizing surfaces using three-dimensional textons , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[8] Dorin Comaniciu,et al. Mean shift analysis and applications , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[9] Alexander Schrijver,et al. A Combinatorial Algorithm Minimizing Submodular Functions in Strongly Polynomial Time , 2000, J. Comb. Theory B.

[10] James Ze Wang,et al. SIMPLIcity: Semantics-Sensitive Integrated Matching for Picture LIbraries , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[11] James Ze Wang,et al. SIMPLIcity: Semantics-Sensitive Integrated Matching for Picture LIbraries , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[12] Marie-Pierre Jolly,et al. Interactive Graph Cuts for Optimal Boundary and Region Segmentation of Objects in N-D Images , 2001, ICCV.

[13] Y.Y. Boykov,et al. Interactive graph cuts for optimal boundary & region segmentation of objects in N-D images , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[14] Alan L. Yuille,et al. The Concave-Convex Procedure (CCCP) , 2001, NIPS.

[15] Olivier D. Faugeras,et al. Shape gradients for histogram segmentation using active contours , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[16] Dorin Comaniciu,et al. Kernel-Based Object Tracking , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[17] Hermann Ney,et al. Classification error rate for quantitative evaluation of content-based image retrieval systems , 2004, ICPR 2004.

[18] Hermann Ney,et al. Classification error rate for quantitative evaluation of content-based image retrieval systems , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[19] Vladimir Kolmogorov,et al. An experimental comparison of min-cut/max- flow algorithms for energy minimization in vision , 2001, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20] Tao Zhang,et al. Active contours for tracking distributions , 2004, IEEE Transactions on Image Processing.

[21] Andrew Blake,et al. "GrabCut" , 2004, ACM Trans. Graph..

[22] Andrew Blake,et al. Bi-layer segmentation of binocular stereo video , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[23] Adrian Barbu,et al. Generalizing Swendsen-Wang to sampling arbitrary posterior probabilities , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24] Jeff A. Bilmes,et al. A Submodular-supermodular Procedure with Applications to Discriminative Structure Learning , 2005, UAI.

[25] Nebojsa Jojic,et al. LOCUS: learning object classes with unsupervised segmentation , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.