Energy Minimization with Discontinuities

Many tasks in computer vision can be formulated as energy minimization problems. In this paper, we consider a natural class of energy functions that permits discontinuities. We show that minimizing these energy functions is NP-hard. However, the energy minimization problem can be solved by computing a minimum cost multiway cut on an associated graph. This allows the use of approximation algorithms that produce answers with quality guarantees. We present an efficient approximation algorithm that produces a local minimum even if very large moves are allowed. We apply our method to the tasks of image restoration and stereo. For both tasks we obtain promising results on data with ground truth. 1 Energy minimization in early vision Many early vision problems require estimating some spatially varying quantity (such as intensity, disparity or texture) from noisy measurements. Such quantities tend to be piecewise smooth; they vary smoothly at most points, but change dramatically at object boundaries. Every pixel p ∈ P must be assigned a label in some set L; for motion or stereo, the labels are disparities, while for image restoration they represent intensities. The goal is to find a labeling f : P → L that is both piecewise smooth and consistent with the observed data. ∗The authors are with the Computer Science Department, Cornell University, Ithaca, NY 14853; email: yura@cs.cornell.edu, olga@cs.cornell.edu, rdz@cs.cornell.edu

[1]  Donald Geman,et al.  Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images , 1984 .

[2]  Richard Szeliski,et al.  Bayesian modeling of uncertainty in low-level vision , 2011, International Journal of Computer Vision.

[3]  Demetri Terzopoulos,et al.  Regularization of Inverse Visual Problems Involving Discontinuities , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  S. Birchfiled A Pixel Dissimilarity Measure That Is Insensitive to Image Sampling , 1998 .

[5]  J. Besag On the Statistical Analysis of Dirty Pictures , 1986 .

[6]  D. R. Fulkerson,et al.  Flows in Networks. , 1964 .

[7]  A. Goldberg,et al.  A new approach to the maximum-flow problem , 1988, JACM.

[8]  Demetri Terzopoulos,et al.  Image Analysis Using Multigrid Relaxation Methods , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Olga Veksler,et al.  Markov random fields with efficient approximations , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[10]  Andrew Blake,et al.  Comparison of the Efficiency of Deterministic and Stochastic Algorithms for Visual Reconstruction , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Stan Z. Li,et al.  Markov Random Field Modeling in Computer Vision , 1995, Computer Science Workbench.

[12]  KochChristof,et al.  Computational vision and regularization theory , 1985 .

[13]  Andrew V. Goldberg,et al.  A new approach to the maximum flow problem , 1986, STOC '86.

[14]  W. Eric L. Grimson,et al.  Discontinuity detection for visual surface reconstruction , 1985, Comput. Vis. Graph. Image Process..

[15]  Goos Kant,et al.  Regular Edge Labeling of 4-Connected Plane Graphs and Its Applications in Graph Drawing Problems , 1997, Theor. Comput. Sci..

[16]  Mihalis Yannakakis,et al.  The complexity of multiway cuts (extended abstract) , 1992, STOC '92.

[17]  T Poggio,et al.  Parallel integration of vision modules. , 1988, Science.

[18]  Demetri Terzopoulos,et al.  Snakes: Active contour models , 2004, International Journal of Computer Vision.

[19]  R. B. Potts Some generalized order-disorder transformations , 1952, Mathematical Proceedings of the Cambridge Philosophical Society.

[20]  D. Greig,et al.  Exact Maximum A Posteriori Estimation for Binary Images , 1989 .

[21]  Stephen T. Barnard,et al.  Stochastic stereo matching over scale , 1989, International Journal of Computer Vision.

[22]  Berthold K. P. Horn,et al.  Determining Optical Flow , 1981, Other Conferences.

[23]  Davi Geiger,et al.  Segmentation by grouping junctions , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[24]  Christopher M. Brown,et al.  The theory and practice of Bayesian image labeling , 1990, International Journal of Computer Vision.

[25]  Ramesh C. Jain,et al.  Using Dynamic Programming for Solving Variational Problems in Vision , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[26]  David Lee,et al.  One-Dimensional Regularization with Discontinuities , 1988, IEEE Trans. Pattern Anal. Mach. Intell..

[27]  Azriel Rosenfeld,et al.  Scene Labeling by Relaxation Operations , 1976, IEEE Transactions on Systems, Man, and Cybernetics.

[28]  Andrew Blake,et al.  Visual Reconstruction , 1987, Deep Learning for EEG-Based Brain–Computer Interfaces.

[29]  Tomaso Poggio,et al.  Computational vision and regularization theory , 1985, Nature.