Scene Classification Based on the Multifeature Fusion Probabilistic Topic Model for High Spatial Resolution Remote Sensing Imagery

Scene classification has been proved to be an effective method for high spatial resolution (HSR) remote sensing image semantic interpretation. The probabilistic topic model (PTM) has been successfully applied to natural scenes by utilizing a single feature (e.g., the spectral feature); however, it is inadequate for HSR images due to the complex structure of the land-cover classes. Although several studies have investigated techniques that combine multiple features, the different features are usually quantized after simple concatenation (CAT-PTM). Unfortunately, due to the inadequate fusion capacity of k-means clustering, the words of the visual dictionary obtained by CAT-PTM are highly correlated. In this paper, a semantic allocation level (SAL) multifeature fusion strategy based on PTM, namely, SAL-PTM (SAL-pLSA and SAL-LDA) for HSR imagery is proposed. In SAL-PTM: 1) the complementary spectral, texture, and scale-invariant-featuretransform features are effectively combined; 2) the three features are extracted and quantized separately by k-means clustering, which can provide appropriate low-level feature descriptions for the semantic representations; and 3)the latent semantic allocations of the three features are captured separately by PTM, which follows the core idea of PTM-based scene classification. The probabilistic latent semantic analysis (pLSA) and latent Dirichlet allocation (LDA) models were compared to test the effect of different PTMs for HSR imagery. A U.S. Geological Survey data set and the UC Merced data set were utilized to evaluate SAL-PTM in comparison with the conventional methods. The experimental results confirmed that SAL-PTM is superior to the single-feature methods and CAT-PTM in the scene classification of HSR imagery.

[1]  Piotr Tokarczyk,et al.  Features, Color Spaces, and Boosting: New Insights on Semantic Classification of Remote Sensing Images , 2015, IEEE Transactions on Geoscience and Remote Sensing.

[2]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[3]  Dengxin Dai,et al.  Satellite Image Classification via Two-Layer Sparse Coding With Biased Image Representation , 2011, IEEE Geoscience and Remote Sensing Letters.

[4]  Shawn D. Newsam,et al.  Spatial pyramid co-occurrence for image classification , 2011, 2011 International Conference on Computer Vision.

[5]  Hanna M. Wallach,et al.  Topic modeling: beyond bag-of-words , 2006, ICML.

[6]  Chong Wang,et al.  Exploring relations of visual codes for image classification , 2011, CVPR 2011.

[7]  Yihong Gong,et al.  Locality-constrained Linear Coding for image classification , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[8]  Wen Yang,et al.  High-resolution satellite scene classification using a sparse coding based multiple feature combination , 2012 .

[9]  Paolo Gamba,et al.  Improved VHR Urban Area Mapping Exploiting Object Boundaries , 2007, IEEE Transactions on Geoscience and Remote Sensing.

[10]  Tao Mei,et al.  Contextual Bag-of-Words for Visual Categorization , 2011, IEEE Transactions on Circuits and Systems for Video Technology.

[11]  Q. Mcnemar Note on the sampling error of the difference between correlated proportions or percentages , 1947, Psychometrika.

[12]  Ioannis Pratikakis,et al.  Bag of spatio-visual words for context inference in scene classification , 2013, Pattern Recognit..

[13]  Alexei A. Efros,et al.  Discovering objects and their location in images , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[14]  Selim Aksoy,et al.  VisiMine: interactive mining in image databases , 2002, IEEE International Geoscience and Remote Sensing Symposium.

[15]  Johan A. K. Suykens,et al.  Least squares support vector machines classifiers : a multi two-spiral benchmark problem , 2001 .

[16]  Anil M. Cheriyadat,et al.  Unsupervised Feature Learning for Aerial Scene Classification , 2014, IEEE Transactions on Geoscience and Remote Sensing.

[17]  Imdad Ali Rizvi,et al.  Object-Based Image Analysis of High-Resolution Satellite Images Using Modified Cloud Basis Function Neural Network and Probabilistic Relaxation Labeling Process , 2011, IEEE Transactions on Geoscience and Remote Sensing.

[18]  Mihai Datcu,et al.  Bridging the Semantic Gap for Satellite Image Annotation and Automatic Mapping Applications , 2011, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[19]  DeLiang Wang,et al.  Scene analysis by integrating primitive segmentation and associative memory , 2002, IEEE Trans. Syst. Man Cybern. Part B.

[20]  Hongliang Li,et al.  Automatic Annotation of Multispectral Satellite Images Using Author–Topic Model , 2012, IEEE Geoscience and Remote Sensing Letters.

[21]  Andrew Zisserman,et al.  Scene Classification Using a Hybrid Generative/Discriminative Approach , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Xuelong Li,et al.  Rank Preserving Sparse Learning for Kinect Based Scene Classification , 2013, IEEE Transactions on Cybernetics.

[23]  Mihai Datcu,et al.  Semantic Annotation of Satellite Images Using Latent Dirichlet Allocation , 2010, IEEE Geoscience and Remote Sensing Letters.

[24]  Gang Liu,et al.  A Hierarchical Scheme of Multiple Feature Fusion for High-Resolution Satellite Scene Categorization , 2013, ICVS.

[25]  Thomas Hofmann,et al.  Unsupervised Learning by Probabilistic Latent Semantic Analysis , 2004, Machine Learning.

[26]  Hongliang Li,et al.  Semantic Annotation of Satellite Images Using Author–Genre–Topic Model , 2014, IEEE Transactions on Geoscience and Remote Sensing.

[27]  Mihai Datcu,et al.  Latent Dirichlet Allocation for Spatial Analysis of Satellite Images , 2013, IEEE Transactions on Geoscience and Remote Sensing.

[28]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[29]  Tinne Tuytelaars,et al.  Mining Mid-level Features for Image Classification , 2014, International Journal of Computer Vision.

[30]  Chih-Jen Lin,et al.  A Practical Guide to Support Vector Classication , 2008 .

[31]  Yuliya Tarabalka,et al.  Best Merge Region-Growing Segmentation With Integrated Nonadjacent Region Object Aggregation , 2012, IEEE Transactions on Geoscience and Remote Sensing.

[32]  Hong Sun,et al.  Unsupervised Satellite Image Classification Using Markov Field Topic Model , 2013, IEEE Geoscience and Remote Sensing Letters.

[33]  Johan A. K. Suykens,et al.  Least Squares Support Vector Machine Classifiers , 1999, Neural Processing Letters.

[34]  Jean-Marc Odobez,et al.  A Thousand Words in a Scene , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[35]  Robert M. Haralick,et al.  Textural Features for Image Classification , 1973, IEEE Trans. Syst. Man Cybern..

[36]  Tieniu Tan,et al.  Group encoding of local features in image classification , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[37]  Yingli Tian,et al.  Evaluating effectiveness of Latent Dirichlet Allocation model for scene classification , 2011, 2011 20th Annual Wireless and Optical Communications Conference (WOCC).

[38]  Hongqi Wang,et al.  Image Classification Based on pLSA Fusing Spatial Relationships Between Topics , 2012, IEEE Signal Processing Letters.

[39]  Dewen Hu,et al.  Scene classification using a multi-resolution bag-of-features model , 2013, Pattern Recognit..

[40]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[41]  E. Wolff,et al.  Textural and contextual land-cover classification using single and multiple classifier systems , 2002 .

[42]  Andrea Baraldi,et al.  An investigation of the textural characteristics associated with gray level cooccurrence matrix statistical parameters , 1995, IEEE Transactions on Geoscience and Remote Sensing.

[43]  Andrew Zisserman,et al.  Scene Classification Via pLSA , 2006, ECCV.

[44]  Luc Van Gool,et al.  Modeling scenes with local descriptors and latent aspects , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[45]  A. Unnikrishnan,et al.  GREY LEVEL CO-OCCURRENCE MATRICES : GENERALISATION AND SOME NEW FEATURES , 2012, 1205.4831.

[46]  Jiebo Luo,et al.  Image transform bootstrapping and its applications to semantic scene classification , 2005, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[47]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[48]  Min Wang,et al.  Remote-sensing image retrieval by combining image visual and semantic features , 2013 .

[49]  Shawn D. Newsam,et al.  Bag-of-visual-words and spatial extensions for land-use classification , 2010, GIS '10.

[50]  Pietro Perona,et al.  A Bayesian hierarchical model for learning natural scene categories , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[51]  Jonathan Cheung-Wai Chan,et al.  Improved Classification of VHR Images of Urban Areas Using Directional Morphological Profiles , 2008, IEEE Transactions on Geoscience and Remote Sensing.

[52]  Anderson Rocha,et al.  A framework for selection and fusion of pattern classifiers in multimedia recognition , 2014, Pattern Recognit. Lett..

[53]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[54]  Sylvie Philipp-Foliguet,et al.  Multiscale Classification of Remote Sensing Images , 2012, IEEE Transactions on Geoscience and Remote Sensing.

[55]  Thomas Blaschke,et al.  Object based image analysis for remote sensing , 2010 .

[56]  Xinwei Zheng,et al.  Automatic Annotation of Satellite Images via Multifeature Joint Sparse Coding With Spatial Relation Constraint , 2013, IEEE Geoscience and Remote Sensing Letters.

[57]  F. Parmiggiani,et al.  An investigation of the textural characteristics associated with gray level cooccurrence matrix statistical parameters , 1995, IEEE Transactions on Geoscience and Remote Sensing.

[58]  Selim Aksoy,et al.  Learning bayesian classifiers for scene classification with a visual grammar , 2005, IEEE Transactions on Geoscience and Remote Sensing.

[59]  Robert Marti,et al.  Which is the best way to organize/classify images by content? , 2007, Image Vis. Comput..