A new gaussian mixture conditional random field model for indoor image labeling

In this paper we present a new conditional random field (CRF) model based on Gaussian mixture potentials for indoor image labeling, which is useful in interactive room decoration system. Indoor images which posses many spatial regularities can be efficiently modeled by probabilistic graphical models such as CRF. The potential functions in CRF are usually set empirically and differently for different features depending on applications. We propose a new CRF model based on a general Gaussian mixture potential for different group of features, which has the advantage of labeling accuracy and training simplicity. The new model with belief propagation inference and stochastic gradient descent training is applied to floor region labeling of indoor images in Labelme database. Simulation results and visual effects prove our analysis. Comparing to other CRF models the new approach is more efficient for indoor image labeling tasks.

[1]  Haim H. Permuter,et al.  Gaussian mixture models of texture and colour for image database retrieval , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[2]  Jitendra Malik,et al.  Learning a classification model for segmentation , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[3]  Alexei A. Efros,et al.  Recovering human body configurations: combining segmentation and recognition , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[4]  Ashutosh Saxena,et al.  Make3D: Learning 3D Scene Structure from a Single Still Image , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Jamie Shotton,et al.  Contour and Texture for Visual Recognition of Object Categories: Automatic Object Recognition Using Learned Patterns of Contour and Texture , 2007 .

[6]  Yuan Qi,et al.  Diagram structure recognition by Bayesian conditional random fields , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[7]  Antonio Torralba,et al.  LabelMe: A Database and Web-Based Tool for Image Annotation , 2008, International Journal of Computer Vision.

[8]  Jitendra Malik,et al.  Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[9]  Xuming He,et al.  Learning structured prediction models for image labeling , 2008 .

[10]  Jitendra Malik,et al.  Representing and Recognizing the Visual Appearance of Materials using Three-dimensional Textons , 2001, International Journal of Computer Vision.

[11]  Xiaofeng Wang,et al.  A new room decoration assistance system based on 3D reconstruction and integrated service , 2008, CIVR '08.

[12]  Geoffrey J. McLachlan,et al.  Mixture models : inference and applications to clustering , 1989 .

[13]  Mark W. Schmidt,et al.  Accelerated training of conditional random fields with stochastic gradient methods , 2006, ICML.

[14]  Honglak Lee,et al.  A Dynamic Bayesian Network Model for Autonomous 3D Reconstruction from a Single Indoor Image , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[15]  W. Freeman,et al.  Generalized Belief Propagation , 2000, NIPS.

[16]  Antonio Torralba,et al.  Contextual Models for Object Detection Using Boosted Random Fields , 2004, NIPS.

[17]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.