CRFs for Image Classification

We use Conditional Random Fields (CRFs) to classify regions in an image. CRFs provide a discriminative framework to incorporate spatial dependencies in an image, which is more appropriate for classification tasks as opposed to a generative framework. In this paper we apply CRFs to two image classification tasks: a binary classification problem (manmade vs. natural regions in the Corel dataset), and a multiclass problem (grass, sky, tree, cow and building in the Microsoft Research, Cambridge dataset). Parameter learning is performed using Mean Field (MF) and Loopy Belief Propagation (LBP) to maximize an approximation to the conditional likelihood, and inference is done using LBP. We focus on three aspects of the classification task: feature extraction, feature aggregation, and techniques to combine binary classifiers to obtain multiclass classification. We present classification results on sample images from both datasets and provide analysis of the effects of various design choices on classification performance.

[1]  Stan Z. Li,et al.  Markov Random Field Modeling in Image Analysis , 2001, Computer Science Workbench.

[2]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[3]  Martial Hebert,et al.  Discriminative Fields for Modeling Spatial Dependencies in Natural Images , 2003, NIPS.

[4]  Martial Hebert,et al.  Man-made structure detection in natural images using a causal multiscale random field , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[5]  Martial Hebert,et al.  Discriminative random fields: a discriminative framework for contextual interaction in classification , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[6]  Jianbo Shi,et al.  Object-Specific Figure-Ground Segregation , 2003, CVPR.

[7]  Hanna M. Wallach,et al.  Conditional Random Fields: An Introduction , 2004 .

[8]  Antonio Criminisi,et al.  Object categorization by learned universal visual dictionary , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[9]  Mark W. Schmidt,et al.  Accelerated training of conditional random fields with stochastic gradient methods , 2006, ICML.

[10]  Vladimir Kolmogorov,et al.  Convergent Tree-Reweighted Message Passing for Energy Minimization , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.