Guest Editorial: Special Issue on Structured Prediction and Inference

Computer vision has been profoundly influenced by machine learning in the past two decades. Canonical papers in the field, such as [Turk and Pentland(1991)], have applied statistical methods to visual data to achieve results that are both compelling and accurate. Classification algorithms, such as support vector machines, are now commonly applied to discriminatively train vision systems. Such systems empirically minimize risk functionals that measure the expected classification error rate given an i.i.d. sampling assumption [Vapnik(1998), Scholkopf and Smola(2002)]. While many intermediate goals in computer vision can be formulated as classification, regression, or dimensionality reduction– the settings most commonly addressed in the machine learning literature–more appropriate statistical methods are needed to train vision systems that make collections of related predictions, such as in segmentation, parts based models, or scene layout analysis. In these settings, the independence assumptions made by binary classifiers no longer hold, and independent prediction may not be computationally or statistically feasible. Structured output prediction [Bakir et al(2007)Bakir, Hofmann, Scholkopf, Smola, Taskar, and Vishwanathan, Nowozin and Lampert(2011)] is the task of predicting related variables given some input, such as image or video data. Study of structured output prediction in the machine learning field has led to a number of techniques be-

[1]  Bernt Schiele,et al.  Discriminative Appearance Models for Pictorial Structures , 2011, International Journal of Computer Vision.

[2]  Rajesh P. N. Rao,et al.  Fast Structured Prediction Using Large Margin Sigmoid Belief Networks , 2012, International Journal of Computer Vision.

[3]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[4]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[5]  M. Turk,et al.  Eigenfaces for Recognition , 1991, Journal of Cognitive Neuroscience.

[6]  A. Atiya,et al.  Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond , 2005, IEEE Transactions on Neural Networks.

[7]  Gökhan BakIr,et al.  Predicting Structured Data , 2008 .

[8]  Ben Taskar,et al.  Max-Margin Markov Networks , 2003, NIPS.

[9]  Thomas Hofmann,et al.  Support vector machine learning for interdependent and structured output spaces , 2004, ICML.

[10]  NowozinSebastian,et al.  Structured Learning and Prediction in Computer Vision , 2011 .

[11]  Thomas Hofmann,et al.  Predicting Structured Data (Neural Information Processing) , 2007 .

[12]  Christopher Joseph Pal,et al.  On Learning Conditional Random Fields for Stereo , 2010, International Journal of Computer Vision.

[13]  Motoaki Kawanabe,et al.  On Taxonomies for Multi-class Image Categorization , 2012, International Journal of Computer Vision.

[14]  Sebastian Nowozin,et al.  Structured Learning and Prediction in Computer Vision , 2011, Found. Trends Comput. Graph. Vis..