Using multi-level fusion of local features for land-use scene classification with high spatial resolution images in urban coastal zones

Abstract Monitoring scene-level land-use in urban coastal zones has become a critical and challenging task, due to the rising risk of marine disasters and the greater number of scene classes such as harbors. Since street blocks are physical containers of different classes of land-use in urban zones, some scene classification methods based on high-spatial-resolution remote sensing images take blocks segmented by roads as classification units. However, these methods extract handcrafted low-level features from remote sensing images, limiting their ability to represent street blocks. To extract semantically meaningful representations of street blocks, the sparse auto-encoder (SAE) model was employed for local feature extraction in this paper and a multi-level method based on the fusion of local features was proposed for block-based land-use scene classification in urban coastal zones. First, convolved feature maps of street blocks were extracted by taking the hidden layer of the SAE as convolution kernels. Then, the local features were fused at three levels to generate more robust and discriminative representations of patches in convolved feature maps. The combination patterns and the absolute relationship of local features were captured at the first and second level, respectively. A convolution neural network was utilized to make the local features more discriminative to semantic information at the third level. Finally, the bag-of-visual-words model was employed to generate global features for street blocks. The proposed method was tested for Qingdao, China using Gaofen-2 (GF-2) satellite images and an overall accuracy of 83.80% was achieved in the study area. The classification results indicate that the proposed method in concert with GF-2 images has potential for accurately monitoring land-use scenes in urban coastal zones.

[1]  Peijun Du,et al.  Mid-Level Feature Representation via Sparse Autoencoder for Remotely Sensed Scene Classification , 2017, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[2]  Geoffrey E. Hinton,et al.  Acoustic Modeling Using Deep Belief Networks , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[3]  Shihong Du,et al.  A Linear Dirichlet Mixture Model for decomposing scenes: Application to analyzing urban functional zonings , 2015 .

[4]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[5]  M. Herold,et al.  Spatial Metrics and Image Texture for Mapping Urban Land Use , 2003 .

[6]  N. Wood,et al.  Stakeholder perspectives on land-use strategies for adapting to climate-change-enhanced coastal hazards: Sarasota, Florida , 2010 .

[7]  Martín Abadi,et al.  TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems , 2016, ArXiv.

[8]  Andrew Y. Ng,et al.  The Importance of Encoding Versus Training with Sparse Coding and Vector Quantization , 2011, ICML.

[9]  Jun Li,et al.  Discriminative Low-Rank Gabor Filtering for Spectral–Spatial Hyperspectral Image Classification , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[10]  Yanfei Zhong,et al.  A spectral–structural bag-of-features scene classifier for very high spatial resolution remote sensing imagery , 2016 .

[11]  Cordelia Schmid,et al.  Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[12]  Jianzhong Guo,et al.  Gabor-Filtering-Based Completed Local Binary Patterns for Land-Use Scene Classification , 2015, 2015 IEEE International Conference on Multimedia Big Data.

[13]  K. Tempfli,et al.  Hierarchical image-object based structural analysis toward urban land use classification using high-resolution imagery and airborne LIDAR data , 2002 .

[14]  Gui-Song Xia,et al.  The bag-of-visual-words scene classifier combining local and global features for high spatial resolution imagery , 2015, 2015 12th International Conference on Fuzzy Systems and Knowledge Discovery (FSKD).

[15]  Patrick Weber,et al.  OpenStreetMap: User-Generated Street Maps , 2008, IEEE Pervasive Computing.

[16]  Christian Berger,et al.  From land cover-graphs to urban structure types , 2014, Int. J. Geogr. Inf. Sci..

[17]  Bei Zhao,et al.  Scene classification based on a hierarchical convolutional sparse auto-encoder for high spatial resolution imagery , 2017 .

[18]  Lei Guo,et al.  Auto-encoder-based shared mid-level visual dictionary learning for scene classification using very high resolution remote sensing images , 2015, IET Comput. Vis..

[19]  Chong-Wah Ngo,et al.  Evaluating bag-of-visual-words representations in scene classification , 2007, MIR '07.

[20]  Shawn D. Newsam,et al.  Bag-of-visual-words and spatial extensions for land-use classification , 2010, GIS '10.

[21]  Shihong Du,et al.  Hierarchical semantic cognition for urban functional zones with VHR satellite images and POI data , 2017 .

[22]  Bei Zhao,et al.  Scene Semantic Understanding Based on the Spatial Context Relations of Multiple Objects , 2017, Remote. Sens..

[23]  Tao Chen,et al.  Unsupervised Feature Learning for Land-Use Scene Recognition , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[24]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[25]  Alfred Stein,et al.  Incorporating Open Source Data for Bayesian Classification of Urban Land Use From VHR Stereo Images , 2017, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[26]  Qian Du,et al.  Local Binary Patterns and Extreme Learning Machine for Hyperspectral Imagery Classification , 2015, IEEE Transactions on Geoscience and Remote Sensing.

[27]  Bo Du,et al.  Saliency-Guided Unsupervised Feature Learning for Scene Classification , 2015, IEEE Transactions on Geoscience and Remote Sensing.

[28]  S. P. Lloyd,et al.  Least squares quantization in PCM , 1982, IEEE Trans. Inf. Theory.

[29]  Shougeng Hu,et al.  Automated urban land-use classification with remote sensing , 2013 .

[30]  Shihong Du,et al.  Semantic Classification of Heterogeneous Urban Scenes Using Intrascene Feature Similarity and Interscene Semantic Dependency , 2015, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[31]  Michael B. Lowry,et al.  Comparing spatial metrics that quantify urban form , 2014, Comput. Environ. Urban Syst..

[32]  Mihai Datcu,et al.  Latent Dirichlet Allocation for Spatial Analysis of Satellite Images , 2013, IEEE Transactions on Geoscience and Remote Sensing.

[33]  Thomas Hofmann,et al.  Unsupervised Learning by Probabilistic Latent Semantic Analysis , 2004, Machine Learning.

[34]  Honglak Lee,et al.  Sparse deep belief net model for visual area V2 , 2007, NIPS.

[35]  Jing Guo,et al.  Storm surge risk assessment for Yuhuan County in Taizhou City , 2016, Natural Hazards.

[36]  Jorge Nocedal,et al.  On the limited memory BFGS method for large scale optimization , 1989, Math. Program..

[37]  Qingming Zhan,et al.  Urban land use extraction from Very High Resolution remote sensing imagery using a Bayesian network , 2016 .

[38]  Marc'Aurelio Ranzato,et al.  Efficient Learning of Sparse Representations with an Energy-Based Model , 2006, NIPS.

[39]  A. Marcomini,et al.  Assessing storm surge risk under future sea-level rise scenarios: a case study in the North Adriatic coast , 2017, Journal of Coastal Conservation.

[40]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[41]  Véronique Prinet,et al.  Mixture Distributions for Weakly Supervised Classification in Remote Sensing Images , 2008, BMVC.

[42]  Y. C. Pati,et al.  Orthogonal matching pursuit: recursive function approximation with applications to wavelet decomposition , 1993, Proceedings of 27th Asilomar Conference on Signals, Systems and Computers.

[43]  Alexis J. Comber,et al.  Community detection in spatial networks: Inferring land use from a planar graph of land cover objects , 2012, Int. J. Appl. Earth Obs. Geoinformation.

[44]  Shihong Du,et al.  Scene classification using multi-scale deeply described visual words , 2016 .

[45]  Andrea Taramelli,et al.  A GIS-based approach for hurricane hazard and vulnerability assessment in the Cayman Islands , 2015 .