Approximation algorithms for classification problems with pairwise relationships: metric labeling and Markov random fields

In a traditional classification problem, we wish to assign one of k labels (or classes) to each of n objects, in a way that is consistent with some observed data that we have about the problem. An active line of research in this area is concerned with classification when one has information about pairwise relationships among the objects to be classified; this issue is one of the principal motivations for the framework of Markov random fields, and it arises in areas such as image processing, biometry: and document analysis. In its most basic form, this style of analysis seeks a classification that optimizes a combinatorial function consisting of assignment costs-based on the individual choice of label we make for each object-and separation costs-based on the pair of choices we make for two "related" objects. We formulate a general classification problem of this type, the metric labeling problem; we show that it contains as special cases a number of standard classification frameworks, including several arising from the theory of Markov random fields. From the perspective of combinatorial optimization, our problem can be viewed as a substantial generalization of the multiway cut problem, and equivalent to a type of uncapacitated quadratic assignment problem. We provide the first non-trivial polynomial-time approximation algorithms for a general family of classification problems of this type. Our main result is an O(log k log log k)-approximation algorithm for the metric labeling problem, with respect to an arbitrary metric on a set of k labels, and an arbitrary weighted graph of relationships on a set of objects. For the special case in which the labels are endowed with the uniform metric-all distances are the same-our methods provide a 2-approximation.

[1]  R. B. Potts Some generalized order-disorder transformations , 1952, Mathematical Proceedings of the Cambridge Philosophical Society.

[2]  John W. Woods,et al.  Two-dimensional discrete Markovian fields , 1972, IEEE Trans. Inf. Theory.

[3]  J. Besag Spatial Interaction and the Statistical Analysis of Lattice Systems , 1974 .

[4]  J. Laurie Snell,et al.  Markov Random Fields and Their Applications , 1980 .

[5]  Donald Geman,et al.  Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  J. Besag On the Statistical Analysis of Dirty Pictures , 1986 .

[7]  M. Queyranne Performance ratio of polynomial heuristics for triangle inequality quadratic assignment problems , 1986 .

[8]  Fernand S. Cohen,et al.  Markov random fields for image modelling and analysis , 1986 .

[9]  Allan Borodin,et al.  An optimal online algorithm for metrical task systems , 1987, STOC.

[10]  Anil K. Jain,et al.  Random field models in image analysis , 1989 .

[11]  D. Greig,et al.  Exact Maximum A Posteriori Estimation for Binary Images , 1989 .

[12]  P. Erdös,et al.  Evolutionary trees: An integer multicommodity max-flow-min-cut theorem , 1992 .

[13]  Allan Borodin,et al.  An optimal on-line algorithm for metrical task system , 1992, JACM.

[14]  Anil K. Jain,et al.  Markov random fields : theory and application , 1993 .

[15]  Mihalis Yannakakis,et al.  The Complexity of Multiterminal Cuts , 1994, SIAM J. Comput..

[16]  Panos M. Pardalos,et al.  Quadratic Assignment and Related Problems , 1994 .

[17]  Stan Z. Li,et al.  Markov Random Field Modeling in Computer Vision , 1995, Computer Science Workbench.

[18]  A. Frigessi,et al.  Fast Approximate Maximum a Posteriori Restoration of Multicolour Images , 1995 .

[19]  Yair Bartal,et al.  Probabilistic approximation of metric spaces and its algorithmic applications , 1996, Proceedings of 37th Conference on Foundations of Computer Science.

[20]  Thomas G. Dietterich Machine-Learning Research Four Current Directions , 1997 .

[21]  Thomas G. Dietterich Machine-Learning Research , 1997, AI Mag..

[22]  John D. Lafferty,et al.  Inducing Features of Random Fields , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[23]  Ingemar J. Cox,et al.  A maximum-flow formulation of the N-camera stereo correspondence problem , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[24]  Yuval Rabani,et al.  An improved approximation algorithm for multiway cut , 1998, STOC '98.

[25]  Piotr Indyk,et al.  Enhanced hypertext categorization using hyperlinks , 1998, SIGMOD '98.

[26]  Davi Geiger,et al.  Segmentation by grouping junctions , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[27]  Alexander V. Karzanov,et al.  Minimum 0-Extensions of Graph Metrics , 1998, Eur. J. Comb..

[28]  Olga Veksler,et al.  Markov random fields with efficient approximations , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[29]  Yair Bartal,et al.  On approximating arbitrary metrices by tree metrics , 1998, STOC '98.

[30]  Mikkel Thorup,et al.  Rounding algorithms for a geometric embedding of minimum multiway cut , 1999, STOC '99.

[31]  Olga Veksler,et al.  Fast approximate energy minimization via graph cuts , 2001, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[32]  William H. Cunningham,et al.  Optimal 3-Terminal Cuts and Linear Programming , 1999, IPCO.