Buildings Detection in VHR SAR Images Using Fully Convolution Neural Networks

This paper addresses the highly challenging problem of automatically detecting man-made structures especially buildings in very high-resolution (VHR) synthetic aperture radar (SAR) images. In this context, this paper has two major contributions. First, it presents a novel and generic workflow that initially classifies the spaceborne SAR tomography (TomoSAR) point clouds—generated by processing VHR SAR image stacks using advanced interferometric techniques known as TomoSAR—into buildings and nonbuildings with the aid of auxiliary information (i.e., either using openly available 2-D building footprints or adopting an optical image classification scheme) and later back project the extracted building points onto the SAR imaging coordinates to produce automatic large-scale benchmark labeled (buildings/nonbuildings) SAR data sets. Second, these labeled data sets (i.e., building masks) have been utilized to construct and train the state-of-the-art deep fully convolution neural networks with an additional conditional random field represented as a recurrent neural network to detect building regions in a single VHR SAR image. Such a cascaded formation has been successfully employed in computer vision and remote sensing fields for optical image classification but, to our knowledge, has not been applied to SAR images. The results of the building detection are illustrated and validated over a TerraSAR-X VHR spotlight SAR image covering approximately 39 km 2—almost the whole city of Berlin— with the mean pixel accuracies of around 93.84%.

[1]  Cuizhen Wang,et al.  Improved Building Extraction With Integrated Decomposition of Time-Frequency and Entropy-Alpha Using Polarimetric SAR Data , 2014, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[2]  Vladlen Koltun,et al.  Efficient Inference in Fully Connected CRFs with Gaussian Edge Potentials , 2011, NIPS.

[3]  Michael Ian Shamos,et al.  Computational geometry: an introduction , 1985 .

[4]  Uwe Soergel,et al.  Combining High-Resolution Optical and InSAR Features for Height Estimation of Buildings With Flat Roofs , 2014, IEEE Transactions on Geoscience and Remote Sensing.

[5]  Zhipeng Liu,et al.  Adaptive boosting for SAR automatic target recognition , 2007, IEEE Transactions on Aerospace and Electronic Systems.

[6]  Shanshan Chen,et al.  Automatic Recognition of Isolated Buildings on Single-Aspect SAR Image Using Range Detector , 2015, IEEE Geoscience and Remote Sensing Letters.

[7]  L. Cascini,et al.  Detection and monitoring of facilities exposed to subsidence phenomena via past and current generation SAR sensors , 2013 .

[8]  Diego Reale,et al.  Empirical fragility and vulnerability curves for buildings exposed to slow-moving landslides at medium and large scales , 2017, Landslides.

[9]  Shunjun Wu,et al.  SAR Image Segmentation Based on Mixture Context and Wavelet Hidden-Class-Label Markov Random Field , 2007, Fourth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD 2007).

[10]  Yongfeng Cao,et al.  Detecting the number of buildings in a single high-resolution SAR image , 2014 .

[11]  Gangyao Kuang,et al.  Building detection from urban SAR image using building characteristics and contextual information , 2013, EURASIP J. Adv. Signal Process..

[12]  Xiao Xiang Zhu,et al.  Deep Learning in Remote Sensing: A Comprehensive Review and List of Resources , 2017, IEEE Geoscience and Remote Sensing Magazine.

[13]  H. Scott Clouse,et al.  Convolutional neural networks for synthetic aperture radar classification , 2016, SPIE Defense + Security.

[14]  Heng Zhang,et al.  Building extraction from high-resolution SAR imagery based on deep neural networks , 2017 .

[15]  Richard Bamler,et al.  Ray-Tracing Simulation Techniques for Understanding High-Resolution SAR Images , 2010, IEEE Transactions on Geoscience and Remote Sensing.

[16]  Yue Li,et al.  Multiscale convolutional neural network for the detection of built-up areas in high-resolution SAR images , 2016, 2016 IEEE International Geoscience and Remote Sensing Symposium (IGARSS).

[17]  Antonio Criminisi,et al.  TextonBoost for Image Understanding: Multi-Class Object Recognition and Segmentation by Jointly Modeling Texture, Layout, and Context , 2007, International Journal of Computer Vision.

[18]  Xiao Xiang Zhu,et al.  Vehicle Instance Segmentation From Aerial Image and Video Using a Multitask Learning Residual Fully Convolutional Network , 2018, IEEE Transactions on Geoscience and Remote Sensing.

[19]  Qun Zhao,et al.  Support vector machines for SAR automatic target recognition , 2001 .

[20]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[21]  Yu Zhou,et al.  Polarimetric SAR Image Classification Using Deep Convolutional Neural Networks , 2016, IEEE Geoscience and Remote Sensing Letters.

[22]  Vibhav Vineet,et al.  Conditional Random Fields as Recurrent Neural Networks , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[23]  Xiao Xiang Zhu,et al.  Very High Resolution Tomographic SAR Inversion for Urban Infrastructure Monitoring — A Sparse and Nonlinear Tour , 2011 .

[24]  Irwin Sobel,et al.  An Isotropic 3×3 image gradient operator , 1990 .

[25]  Hongwei Liu,et al.  Convolutional Neural Network With Data Augmentation for SAR Target Recognition , 2016, IEEE Geoscience and Remote Sensing Letters.

[26]  Pascal Neis,et al.  Quality assessment for building footprints data on OpenStreetMap , 2014, Int. J. Geogr. Inf. Sci..

[27]  Xiaoxiang Zhu,et al.  Joint Sparsity in SAR Tomography for Urban Mapping , 2015, IEEE Journal of Selected Topics in Signal Processing.

[28]  Gabriela Csurka,et al.  Visual categorization with bags of keypoints , 2002, eccv 2004.

[29]  Shiyong Cui,et al.  Convolutional Neural Network for SAR image classification at patch level , 2016, 2016 IEEE International Geoscience and Remote Sensing Symposium (IGARSS).

[30]  Mihai Datcu,et al.  Stochastic geometrical modeling for built-up area understanding from a single SAR intensity image with meter resolution , 2004, IEEE Transactions on Geoscience and Remote Sensing.

[31]  Lorenzo Bruzzone,et al.  Building Height Retrieval From VHR SAR Imagery Based on an Iterative Simulation and Matching Technique , 2010, IEEE Transactions on Geoscience and Remote Sensing.

[32]  Peter Reinartz,et al.  AUTOMATIC INTERPRETATION OF HIGH RESOLUTION SAR IMAGES: FIRST RESULTS OF SAR IMAGE SIMULATION FOR SINGLE BUILDINGS , 2012 .

[33]  Xiao Xiang Zhu,et al.  Identifying Corresponding Patches in SAR and Optical Images With a Pseudo-Siamese CNN , 2018, IEEE Geoscience and Remote Sensing Letters.

[34]  Richard Bamler,et al.  Automatic SAR Simulation Technique for Object Identification in Complex Urban Scenarios , 2014, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[35]  Richard Bamler,et al.  Very High Resolution Spaceborne SAR Tomography in Urban Environment , 2010, IEEE Transactions on Geoscience and Remote Sensing.

[36]  Trevor Darrell,et al.  Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[37]  Horst Bischof,et al.  Alternating Decision Forests , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[38]  M. Haklay How Good is Volunteered Geographical Information? A Comparative Study of OpenStreetMap and Ordnance Survey Datasets , 2010 .

[39]  Gianfranco Fornaro,et al.  Three-dimensional multipass SAR focusing: experiments with long-term spaceborne data , 2005, IEEE Transactions on Geoscience and Remote Sensing.

[40]  Uwe Soergel,et al.  Building Detection From One Orthophoto and High-Resolution InSAR Data Using Conditional Random Fields , 2011, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[41]  Xiao Xiang Zhu,et al.  Fusing Meter-Resolution 4-D InSAR Point Clouds and Optical Images for Semantic Urban Infrastructure Monitoring , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[42]  Florence Tupin,et al.  Extraction and Three-Dimensional Reconstruction of Isolated Buildings in Urban Scenes From High-Resolution Optical and SAR Spaceborne Images , 2011, IEEE Transactions on Geoscience and Remote Sensing.

[43]  David Malmgren-Hansen,et al.  Convolutional neural networks for SAR image segmentation , 2015, 2015 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT).

[44]  Andrea Vedaldi,et al.  Vlfeat: an open and portable library of computer vision algorithms , 2010, ACM Multimedia.

[45]  Maoguo Gong,et al.  Change Detection in Synthetic Aperture Radar Images Based on Deep Neural Networks , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[46]  Xiao Xiang Zhu,et al.  Tomo-GENESIS: DLR's tomographic SAR processing system , 2013, Joint Urban Remote Sensing Event 2013.

[47]  Lorenzo Bruzzone,et al.  Automatic Detection and Reconstruction of Building Radar Footprints From Single VHR SAR Images , 2013, IEEE Transactions on Geoscience and Remote Sensing.

[48]  Xiao Xiang Zhu,et al.  Extraction of Buildings in VHR SAR Images Using Fully Convolution Neural Networks , 2018, IGARSS 2018 - 2018 IEEE International Geoscience and Remote Sensing Symposium.

[49]  Xiao Xiang Zhu,et al.  Nonlocal InSAR Filtering for DEM generation and Addressing the Staircasing Effect , 2016 .

[50]  Roberto Cipolla,et al.  Semantic texton forests for image categorization and segmentation , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[51]  Francesco Soldovieri,et al.  Three-dimensional focusing with multipass SAR data , 2003, IEEE Trans. Geosci. Remote. Sens..