Large-scale conditional random field for natural outdoor scene labeling

We propose a novel large-scale conditional random field model with respect to the problem of natural outdoor scene labeling. The novelty of the proposed method lies in three aspects: 1. features from two neighboring regions are concatenated to form the input of the pair-wise classifier to compensate for the simultaneous feature deviation of neighboring regions; 2. the definition of a generalized neighboring system and the incorporation of direction-specific patterns in conditional random field models based on the generalized neighboring system to better simulate the visual cognition of human being; and 3. the definition of a similarity criterion based on the bags-of-words expression to facilitate the incorporation of semantic patterns. The proposed model is first evaluated over the Corel dataset. Both qualitative and quantitative results show that our model is capable of modeling large-scale spatial relationships between objects in natural outdoor scenes, and achieves better results than other existing conditional random field models. Furthermore, our model is also evaluated over several other natural datasets, which are taken from logged field tests, to further demonstrate the adaptability of our model to different lighting conditions.

[1]  Andrew Zisserman,et al.  Scene Classification Via pLSA , 2006, ECCV.

[2]  Christopher K. I. Williams,et al.  Image Modeling with Position-Encoding Dynamic Trees , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[4]  Luc Van Gool,et al.  Modeling scenes with local descriptors and latent aspects , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[5]  Geoffrey E. Hinton Training Products of Experts by Minimizing Contrastive Divergence , 2002, Neural Computation.

[6]  Osamu Hasegawa,et al.  Random Field Model for Integration of Local Information and Global Information , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  R. Zemel,et al.  Multiscale conditional random fields for image labeling , 2004, CVPR 2004.

[8]  Cordelia Schmid,et al.  Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[9]  Daniel P. Huttenlocher,et al.  Efficient Graph-Based Image Segmentation , 2004, International Journal of Computer Vision.

[10]  Jitendra Malik,et al.  Learning a classification model for segmentation , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[11]  Larry D. Jackel,et al.  The DARPA LAGR program: Goals, challenges, methodology, and phase I results , 2006, J. Field Robotics.

[12]  Christopher K. I. Williams,et al.  Combining Belief Networks and Neural Networks for Scene Segmentation , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[13]  Donald Geman,et al.  Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Pietro Perona,et al.  A Bayesian hierarchical model for learning natural scene categories , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[15]  Richard S. Zemel,et al.  Learning and Incorporating Top-Down Cues in Image Segmentation , 2006, ECCV.

[16]  Martial Hebert,et al.  Discriminative Random Fields , 2006, International Journal of Computer Vision.

[17]  Stan Z. Li,et al.  Markov Random Field Modeling in Image Analysis , 2001, Computer Science Workbench.