Scene Classification Algorithm Based on Contextual Semantic Information Extended PLSA Model

In this paper, a novel scene classification algorithm based on contextual visual words combined with extended PLSA model is proposed in which the traditional visual words by contextual information are improved. First, the EICS-LBP (Edge Improved Center Symmetric Local Binary Pattern) and statistical domain color features are extracted from the edge dense sampling region and its 8 neighborhood followed by the computed normalized histogram intersection and the creation of the new feature by linear combination of local information and its contextual intersection information. The visual vocabulary is formed by clustering from dense sampling regions respectively. Then the Bag-Of-Words model is used to describe the image which gives us richer information of scene images and reduces ambiguities and errors. Finally, the potential semantics by extended PLSA model and the confusion table by KNN classifier are obtained. Experiment results show that this method could achieve a higher accuracy, especially performing well in multi-edge color images with contextual information. Besides, it does not require manual annotation on the scene.

[1]  Alexei A. Efros,et al.  Putting Objects in Perspective , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[2]  Antonio Torralba,et al.  Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope , 2001, International Journal of Computer Vision.

[3]  Antonio Torralba,et al.  Contextual Models for Object Detection Using Boosted Random Fields , 2004, NIPS.

[4]  Marko Heikkilä,et al.  Description of interest regions with local binary patterns , 2009, Pattern Recognit..

[5]  N. H. C. Yung,et al.  Scene categorization via contextual visual words , 2010, Pattern Recognit..

[6]  Thomas Hofmann,et al.  Unsupervised Learning by Probabilistic Latent Semantic Analysis , 2004, Machine Learning.

[7]  Arnold W. M. Smeulders,et al.  Color-based object recognition , 1997, Pattern Recognit..

[8]  Kinjiro Amano,et al.  Approaching ideal observer efficiency in using color to retrieve information from natural scenes. , 2009, Journal of the Optical Society of America. A, Optics, image science, and vision.

[9]  Andrew Zisserman,et al.  Scene Classification Using a Hybrid Generative/Discriminative Approach , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Jitendra Malik,et al.  Shape matching and object recognition using shape contexts , 2010, 2010 3rd International Conference on Computer Science and Information Technology.

[11]  Alexei A. Efros,et al.  Discovering objects and their location in images , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[12]  Yap-Peng Tan,et al.  A novel multi-scale spatial-color descriptor for content-based image retrieval , 2002, 7th International Conference on Control, Automation, Robotics and Vision, 2002. ICARCV 2002..

[13]  Martial Hebert,et al.  A hierarchical field framework for unified context-based classification , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[14]  Antonio Torralba,et al.  Contextual Priming for Object Detection , 2003, International Journal of Computer Vision.

[15]  Matti Pietikäinen,et al.  Multiresolution Gray-Scale and Rotation Invariant Texture Classification with Local Binary Patterns , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[16]  Francesca Odone,et al.  Building kernels from binary strings for image matching , 2005, IEEE Transactions on Image Processing.

[17]  Jing Liu,et al.  Expanded bag of words representation for object classification , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).

[18]  Andrew Zisserman,et al.  Scene Classification Via pLSA , 2006, ECCV.

[19]  Bernt Schiele,et al.  Natural Scene Retrieval Based on a Semantic Modeling Step , 2004, CIVR.