Disaster damage detection through synergistic use of deep learning and 3D point cloud features derived from very high resolution oblique aerial images, and multiple-kernel-learning

Oblique aerial images offer views of both building roofs and facades, and thus have been recognized as a potential source to detect severe building damages caused by destructive disaster events such as earthquakes. Therefore, they represent an important source of information for first responders or other stakeholders involved in the post-disaster response process. Several automated methods based on supervised learning have already been demonstrated for damage detection using oblique airborne images. However, they often do not generalize well when data from new unseen sites need to be processed, hampering their practical use. Reasons for this limitation include image and scene characteristics, though the most prominent one relates to the image features being used for training the classifier. Recently features based on deep learning approaches, such as convolutional neural networks (CNNs), have been shown to be more effective than conventional hand-crafted features, and have become the state-of-the-art in many domains, including remote sensing. Moreover, often oblique images are captured with high block overlap, facilitating the generation of dense 3D point clouds – an ideal source to derive geometric characteristics. We hypothesized that the use of CNN features, either independently or in combination with 3D point cloud features, would yield improved performance in damage detection. To this end we used CNN and 3D features, both independently and in combination, using images from manned and unmanned aerial platforms over several geographic locations that vary significantly in terms of image and scene characteristics. A multiple-kernel-learning framework, an effective way for integrating features from different modalities, was used for combining the two sets of features for classification. The results are encouraging: while CNN features produced an average classification accuracy of about 91%, the integration of 3D point cloud features led to an additional improvement of about 3% (i.e. an average classification accuracy of 94%). The significance of 3D point cloud features becomes more evident in the model transferability scenario (i.e., training and testing samples from different sites that vary slightly in the aforementioned characteristics), where the integration of CNN and 3D point cloud features significantly improved the model transferability accuracy up to a maximum of 7% compared with the accuracy achieved by CNN features alone. Overall, an average accuracy of 85% was achieved for the model transferability scenario across all experiments. Our main conclusion is that such an approach qualifies for practical use.

[1]  Vladimir G. Kim,et al.  Shape-based recognition of 3D point clouds in urban environments , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[2]  Adilson Gonzaga,et al.  Object Recognition Based on Bag of Features and a New Local Pattern Descriptor , 2014, Int. J. Pattern Recognit. Artif. Intell..

[3]  Jie Shan,et al.  A comprehensive review of earthquake-induced building damage detection with remote sensing techniques , 2013 .

[4]  George Vosselman,et al.  Identification of Structurally Damaged Areas in Airborne Oblique Images Using a Visual-Bag-of-Words Approach , 2016, Remote. Sens..

[5]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Xiaodong Li,et al.  AN IMPROVED APPROACH OF INFORMATION EXTRACTION FOR EARTHQUAKE-DAMAGED BUILDINGS USING HIGH-RESOLUTION IMAGERY , 2011 .

[7]  Clément Mallet,et al.  Detection, segmentation and localization of individual trees from mms point cloud data , 2016 .

[8]  Sudan Xu,et al.  Segment-Based Classification of Damaged Building Roofs in Aerial Laser Scanning Data , 2013, IEEE Geoscience and Remote Sensing Letters.

[9]  George Vosselman,et al.  Classification of informal settlements through the integration of 2D and 3D features extracted from UAV data , 2016 .

[10]  A. Vetrivel,et al.  POTENTIAL OF MULTI-TEMPORAL OBLIQUE AIRBORNE IMAGERY FOR STRUCTURAL DAMAGE ASSESSMENT , 2016 .

[11]  Rob Fergus,et al.  Visualizing and Understanding Convolutional Networks , 2013, ECCV.

[12]  Junwei Han,et al.  A Survey on Object Detection in Optical Remote Sensing Images , 2016, ArXiv.

[13]  Arivazhagan Selvaraj,et al.  Texture classification using Gabor wavelets based rotation invariant features , 2006, Pattern Recognit. Lett..

[14]  Jianbo Liu,et al.  An Automatic Procedure for Early Disaster Change Mapping Based on Optical Remote Sensing , 2016, Remote. Sens..

[15]  Xiaoqian Jiang,et al.  Supplementary Issue: Computational Advances in Cancer Informatics (a) , 2022 .

[16]  R. Basri,et al.  Direct visibility of point sets , 2007, SIGGRAPH 2007.

[17]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[18]  Norman Kerle,et al.  UAV-based urban structural damage assessment using object-based image analysis and semantic reasoning , 2014 .

[19]  Jon Atli Benediktsson,et al.  A Novel MKL Model of Integrating LiDAR Data and MSI for Urban Area Classification , 2015, IEEE Transactions on Geoscience and Remote Sensing.

[20]  G. Vosselman,et al.  SEGMENTATION OF UAV-BASED IMAGES INCORPORATING 3D POINT CLOUD INFORMATION , 2015 .

[21]  Haigang Sui,et al.  AUTOMATIC BUILDING DAMAGE DETECTION METHOD USING HIGH-RESOLUTION REMOTE SENSING IMAGES AND 3D GIS MODEL , 2016 .

[22]  Luisa Verdoliva,et al.  Land Use Classification in Remote Sensing Images by Convolutional Neural Networks , 2015, ArXiv.

[23]  Pascal Vincent,et al.  Representation Learning: A Review and New Perspectives , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  M. Gerke,et al.  Automatic Structural Seismic Damage Assessment with Airborne Oblique Pictometry© Imagery , 2011 .

[25]  Xintao Hu,et al.  Weakly supervised target detection in remote sensing images based on transferred deep features and negative bootstrapping , 2016, Multidimens. Syst. Signal Process..

[26]  Ethem Alpaydin,et al.  Multiple Kernel Learning Algorithms , 2011, J. Mach. Learn. Res..

[27]  Rong Jin,et al.  Multiple Kernel Learning for Visual Object Recognition: A Review , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[28]  Gang Wang,et al.  Learning Discriminative and Shareable Features for Scene Classification , 2014, ECCV.

[29]  Gui-Song Xia,et al.  Transferring Deep Convolutional Neural Networks for the Scene Classification of High-Resolution Remote Sensing Imagery , 2015, Remote. Sens..

[30]  Jamie Sherrah,et al.  Fully Convolutional Networks for Dense Semantic Labelling of High-Resolution Aerial Imagery , 2016, ArXiv.

[31]  Konrad Schindler,et al.  FAST SEMANTIC SEGMENTATION OF 3D POINT CLOUDS WITH STRONGLY VARYING DENSITY , 2016 .

[32]  A. Vetrivel,et al.  Identification of damage in buildings based on gaps in 3D point clouds from very high resolution oblique airborne images , 2015 .

[33]  Konrad Schindler,et al.  IMPLICIT SHAPE MODELS FOR OBJECT DETECTION IN 3D POINT CLOUDS , 2012, ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences.

[34]  Chunxia Zhang,et al.  Visual Cortex Inspired CNN Model for Feature Construction in Text Analysis , 2016, Front. Comput. Neurosci..

[35]  Aneesh Krishna,et al.  Robust Face Recognition by Utilizing Color Information and Sparse Representation , 2014, Int. J. Pattern Recognit. Artif. Intell..

[36]  Dong ping Tian,et al.  A Review on Image Feature Extraction and Representation Techniques , 2013 .

[37]  Fei-Fei Li,et al.  Large-Scale Video Classification with Convolutional Neural Networks , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[38]  Stefan Hinz,et al.  Semantic point cloud interpretation based on optimal neighborhoods, relevant features and efficient classifiers , 2015 .

[39]  Paolo Gamba,et al.  Remote Sensing and Earthquake Damage Assessment: Experiences, Limits, and Perspectives , 2012, Proceedings of the IEEE.

[40]  F. Nex,et al.  UAV for 3D mapping applications: a review , 2014 .

[41]  Pascal Fua,et al.  SLIC Superpixels Compared to State-of-the-Art Superpixel Methods , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[42]  Deren Li,et al.  Object Classification of Aerial Images With Bag-of-Visual Words , 2010, IEEE Geoscience and Remote Sensing Letters.

[43]  Zheng Cao,et al.  Group feature selection in image classification with multiple kernel learning , 2015, 2015 International Joint Conference on Neural Networks (IJCNN).

[44]  Norman Kerle,et al.  Collaborative damage mapping for emergency response: the role of Cognitive Systems Engineering , 2013 .