Semantic and Spatial Co-Occurrence Analysis on Object Pairs for Urban Scene Classification

Urban scene classifications with very-high-resolution (VHR) satellite images play an important role in urban functional-zone investigation and landscape analysis. Most scene classification techniques utilize visual features which however are weak in identifying scenes, as scenes are usually composed of diverse objects with variant visual cues. To resolve this issue, we use spatial object relations to characterize urban scenes and recognize their categories. Accordingly, two questions are considered in this study: how to measure spatial object relations and how to use these relations to classify urban scenes? First, we propose a novel scene feature, semantic and spatial co-occurrence probability (SSCP), to measure spatial relations between objects with considering their directions, distances, and semantics, which can hopefully resolve the three key issues in measuring spatial object relations, i.e., anisotropy, scale- and semantic-dependencies. Then, a semilatent Dirichlet allocation is employed to classify scenes based on the proposed SSCP. In experiments, our method is first verified based on the UC Merced data set, and then used to generate an urban functional-zone map for Beijing, China. Both experiments indicate high accuracies and state-of-the-art performances of our method. In addition, we give a deep insight into SSCP's characteristics including convergence, high efficiency, and invariance to affine transformation, which can make our method more applicable.

[1]  Arno Schäpe,et al.  Multiresolution Segmentation : an optimization approach for high quality multi-scale image segmentation , 2000 .

[2]  M. Ushanandhini,et al.  Classification of high spatial resolution images using semantic allocation level-probabilistic topic model , 2016, 2016 International Conference on Recent Trends in Information Technology (ICRTIT).

[3]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[4]  Stan Uryasev,et al.  Value-at-risk support vector machine: stability to outliers , 2013, Journal of Combinatorial Optimization.

[5]  Liangpei Zhang,et al.  Scene Classification Based on the Multifeature Fusion Probabilistic Topic Model for High Spatial Resolution Remote Sensing Imagery , 2015, IEEE Transactions on Geoscience and Remote Sensing.

[6]  Dirk Tiede,et al.  ESP: a tool to estimate scale parameter for multiresolution image segmentation of remotely sensed data , 2010, Int. J. Geogr. Inf. Sci..

[7]  Ming Cui,et al.  Scene classification based on multifeature probabilistic latent semantic analysis for high spatial resolution remote sensing images , 2015 .

[8]  Anil M. Cheriyadat,et al.  Unsupervised Feature Learning for Aerial Scene Classification , 2014, IEEE Transactions on Geoscience and Remote Sensing.

[9]  Shihong Du,et al.  Multiscale Geoscene Segmentation for Extracting Urban Functional Zones from VHR Satellite Images , 2018, Remote. Sens..

[10]  Shihong Du,et al.  A Linear Dirichlet Mixture Model for decomposing scenes: Application to analyzing urban functional zonings , 2015 .

[11]  Ivan Laptev,et al.  Learning and Transferring Mid-level Image Representations Using Convolutional Neural Networks , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[12]  Robert M. Haralick,et al.  Textural Features for Image Classification , 1973, IEEE Trans. Syst. Man Cybern..

[13]  Gabriela Csurka,et al.  Visual categorization with bags of keypoints , 2002, eccv 2004.

[14]  Shawn D. Newsam,et al.  Spatial pyramid co-occurrence for image classification , 2011, 2011 International Conference on Computer Vision.

[15]  Liangpei Zhang,et al.  Automatic Labelling and Selection of Training Samples for High-Resolution Remote Sensing Image Classification over Urban Areas , 2015, Remote. Sens..

[16]  Brian P. Salmon,et al.  Multiview Deep Learning for Land-Use Classification , 2015, IEEE Geoscience and Remote Sensing Letters.

[17]  Kilian Q. Weinberger,et al.  Distance Metric Learning for Large Margin Nearest Neighbor Classification , 2005, NIPS.

[18]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[19]  Shihong Du,et al.  Semantic Classification of Heterogeneous Urban Scenes Using Intrascene Feature Similarity and Interscene Semantic Dependency , 2015, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[20]  Gabriele Moser,et al.  Classification of urban structural types with multisource data and structured models , 2015, 2015 Joint Urban Remote Sensing Event (JURSE).

[21]  Yan Wang,et al.  Bag-of-Words model based image classification and evaluation of image sample quality in remote sensing , 2013, 2013 IEEE International Conference of IEEE Region 10 (TENCON 2013).

[22]  Nizar Bouguila,et al.  Unsupervised learning of a finite mixture model based on the Dirichlet distribution and its application , 2004, IEEE Transactions on Image Processing.

[23]  Xi Chen,et al.  Measuring the Effectiveness of Various Features for Thematic Information Extraction From Very High Resolution Remote Sensing Imagery , 2015, IEEE Transactions on Geoscience and Remote Sensing.

[24]  David Picard,et al.  Evaluation of second-order visual features for land-use classification , 2014, 2014 12th International Workshop on Content-Based Multimedia Indexing (CBMI).

[25]  Mihai Datcu,et al.  Contextual Descriptors for Scene Classes in Very High Resolution SAR Images , 2012, IEEE Geoscience and Remote Sensing Letters.

[26]  Hao Su,et al.  Object Bank: A High-Level Image Representation for Scene Classification & Semantic Feature Sparsification , 2010, NIPS.

[27]  Hong Sun,et al.  Unsupervised Feature Learning Via Spectral Clustering of Multidimensional Patches for Remotely Sensed Scene Classification , 2015, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[28]  Chong-Wah Ngo,et al.  Evaluating bag-of-visual-words representations in scene classification , 2007, MIR '07.

[29]  William J. Emery,et al.  Object-Based Convolutional Neural Network for High-Resolution Imagery Classification , 2017, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[30]  Yuan Yan Tang,et al.  Simultaneous Spectral-Spatial Feature Selection and Extraction for Hyperspectral Images , 2019, IEEE Transactions on Cybernetics.

[31]  Luisa Verdoliva,et al.  Land Use Classification in Remote Sensing Images by Convolutional Neural Networks , 2015, ArXiv.

[32]  Timothy A. Warner,et al.  Forest Type Mapping using Object-specific Texture Measures from Multispectral Ikonos Imagery: Segmentation Quality and Image Classification Issues , 2009 .

[33]  Paolo Napoletano,et al.  Visual descriptors for content-based retrieval of remote-sensing images , 2016, ArXiv.

[34]  Cordelia Schmid,et al.  Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[35]  Andrew Zisserman,et al.  Scene Classification Via pLSA , 2006, ECCV.

[36]  Geoffrey H. Ball,et al.  ISODATA, A NOVEL METHOD OF DATA ANALYSIS AND PATTERN CLASSIFICATION , 1965 .

[37]  Jefersson Alex dos Santos,et al.  Do deep features generalize from everyday objects to remote sensing and aerial scenes domains? , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[38]  Shihong Du,et al.  Hierarchical semantic cognition for urban functional zones with VHR satellite images and POI data , 2017 .

[39]  Bei Zhao,et al.  Scene Semantic Understanding Based on the Spatial Context Relations of Multiple Objects , 2017, Remote. Sens..

[40]  Bo Du,et al.  Saliency-Guided Unsupervised Feature Learning for Scene Classification , 2015, IEEE Transactions on Geoscience and Remote Sensing.

[41]  Mihai Datcu,et al.  Latent Dirichlet Allocation for Spatial Analysis of Satellite Images , 2013, IEEE Transactions on Geoscience and Remote Sensing.

[42]  L. S. Davis,et al.  An assessment of support vector machines for land cover classi(cid:142) cation , 2002 .

[43]  Jiebo Luo,et al.  Probabilistic spatial context models for scene content understanding , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[44]  Jungho Im,et al.  ISPRS Journal of Photogrammetry and Remote Sensing , 2022 .

[45]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[46]  Mihai Datcu,et al.  Semantic Annotation of Satellite Images Using Latent Dirichlet Allocation , 2010, IEEE Geoscience and Remote Sensing Letters.

[47]  W. Eric L. Grimson,et al.  Spatial Latent Dirichlet Allocation , 2007, NIPS.

[48]  Michael I. Jordan,et al.  Probabilistic models of text and images , 2004 .

[49]  A. M. Qamar,et al.  RELIEF Algorithm and Similarity Learning for k-NN , 2011 .

[50]  Andrew Zisserman,et al.  Scene Classification Using a Hybrid Generative/Discriminative Approach , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[51]  Swati Jain,et al.  Weighted fusion of LBP and CCH features for effective content based image retrieval , 2016, 2016 International Conference on Signal Processing and Communications (SPCOM).

[52]  Rong Yan,et al.  On predicting rare classes with SVM ensembles in scene classification , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[53]  Bolei Zhou,et al.  Learning Deep Features for Scene Recognition using Places Database , 2014, NIPS.

[54]  Wagner Coelho A. Pereira,et al.  Analysis of Co-Occurrence Texture Statistics as a Function of Gray-Level Quantization for Classifying Breast Ultrasound , 2012, IEEE Transactions on Medical Imaging.

[55]  Liangpei Zhang,et al.  On Combining Multiple Features for Hyperspectral Remote Sensing Image Classification , 2012, IEEE Transactions on Geoscience and Remote Sensing.

[56]  Andrew Y. Ng,et al.  Parsing Natural Scenes and Natural Language with Recursive Neural Networks , 2011, ICML.

[57]  Di Wang,et al.  Semi-Supervised Latent Dirichlet Allocation and Its Application for Document Classification , 2012, 2012 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology.

[58]  Yang Wang,et al.  Semi-Latent Dirichlet Allocation: A Hierarchical Model for Human Action Recognition , 2007, Workshop on Human Motion.

[59]  Yingli Tian,et al.  Pyramid of Spatial Relatons for Scene-Level Land Use Classification , 2015, IEEE Transactions on Geoscience and Remote Sensing.

[60]  Sahibsingh A. Dudani The Distance-Weighted k-Nearest-Neighbor Rule , 1976, IEEE Transactions on Systems, Man, and Cybernetics.

[61]  Jitendra Malik,et al.  Geometric blur for template matching , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[62]  Uwe Stilla,et al.  Deep Learning Earth Observation Classification Using ImageNet Pretrained Networks , 2016, IEEE Geoscience and Remote Sensing Letters.

[63]  Liang-Chih Yu,et al.  Topic model allocation of conversational dialogue records by Latent Dirichlet Allocation , 2014, Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific.