A transferable remote sensing approach to classify building structural types for seismic risk analyses: the case of Val d'Agri area (Italy)

This study proposes a methodology based on machine learning (ML) algorithms for rapid and robust classification of building structural types (STs) in multispectral remote sensing imagery aiming to assess buildings’ seismic vulnerability. The seismic behavior of buildings is strongly affected by the ST, including material, age, height, and other main structural features. Previous works deployed in situ data integrated with remote sensing information to statistically infer STs through supervised ML methods. We propose a transferable methodology with specific focus on situations with imbalanced in situ data (i.e., the number of available labeled samples for model learning differs largely between different STs). We learn a transferable model by selecting features from an exhaustive set. The transferability relies on deploying geometric features characterizing individual buildings; thus, the model is less sensitive to domain adaption problems frequently induced by e.g., changes in acquisition parameters of remotely sensed imagery. Thereby, we show that few geometry features enable generalization capabilities similar to models learned with a large number of features describing spectral, geometrical or contextual building properties. We rely on an extensive geodatabase containing almost 18,000 building footprints. We follow a Random Forest (RF)-based feature selection strategy to objectively identify most valuable features for prediction. Furthermore, the problem of unbalanced classes is addressed by adopting two approaches: downsampling the majority class and modifying the classifier internally (weighted RF). The implemented model is transferred on the challenging urban morphology of the Val d’Agri area (Italy). Results confirm the statistical robustness of the model and the importance of the geometry features, allowing for reliable identification of STs.

[1]  Qian Du,et al.  Multi-Modal and Multi-Temporal Data Fusion: Outcome of the 2012 GRSS Data Fusion Contest , 2013, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[2]  Vaishali Ganganwar,et al.  An overview of classification algorithms for imbalanced datasets , 2012 .

[3]  Haibo He,et al.  Learning from Imbalanced Data , 2009, IEEE Transactions on Knowledge and Data Engineering.

[4]  Qihao Weng,et al.  Population Estimation of Urban Residential Communities Using Remotely Sensed Morphologic Data , 2015, IEEE Geoscience and Remote Sensing Letters.

[5]  Angelo Masi,et al.  Fragility curves of gravity-load designed RC buildings with regularity in plan , 2015 .

[6]  Angelo Masi,et al.  Cyclic Tests on External RC Beam-Column Joints: Role of Seismic Design Level and Axial Load Value on the Ultimate Capacity , 2013 .

[7]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[8]  Angelo Masi,et al.  SURVEY OF DWELLING BUILDINGS FOR SEISMIC LOSS ASSESSMENT AT URBAN SCALE: THE CASE STUDY OF 18 VILLAGES IN VAL D'AGRI, ITALY , 2014 .

[9]  M. McHugh Interrater reliability: the kappa statistic , 2012, Biochemia medica.

[10]  H. S. Sheshadri,et al.  On the Classification of Imbalanced Datasets , 2012 .

[11]  Robert R. Freimuth,et al.  A weighted random forests approach to improve predictive performance , 2013, Stat. Anal. Data Min..

[12]  Hannes Taubenböck,et al.  A conceptual vulnerability and risk framework as outline to identify capabilities of remote sensing , 2008 .

[13]  Robert M. Haralick,et al.  Textural Features for Image Classification , 1973, IEEE Trans. Syst. Man Cybern..

[14]  Jorge Gil,et al.  On the discovery of urban typologies: data mining the many dimensions of urban form , 2011, Urban Morphology.

[15]  Mohammad Khalilia,et al.  Predicting disease risks from highly imbalanced data using random forest , 2011, BMC Medical Informatics Decis. Mak..

[16]  Hannes Taubenböck,et al.  Estimation of Seismic Vulnerability Levels of Urban Structures With Multisensor Remote Sensing , 2016, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[17]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[18]  Robin Genuer,et al.  Random Forests: some methodological insights , 2008, 0811.3619.

[19]  Lorenzo Bruzzone,et al.  Domain Adaptation for the Classification of Remote Sensing Data: An Overview of Recent Advances , 2016, IEEE Geoscience and Remote Sensing Magazine.

[20]  Giovanna Cultrera,et al.  Building damage scenarios based on exploitation of Housner intensity derived from finite faults ground motion simulations , 2012, Bulletin of Earthquake Engineering.

[21]  Thomas Esch,et al.  Object-based image information fusion using multisensor earth observation data over urban areas , 2011 .

[22]  Thomas Blaschke,et al.  Object based image analysis for remote sensing , 2010 .

[23]  T. Esch,et al.  Settlement detection and impervious surface estimation in the Mekong Delta using optical and SAR remote sensing data , 2011 .

[24]  K. Chung The Strong Law of Large Numbers , 1951 .

[25]  Sara Casciati,et al.  Synergy of monitoring and security , 2016 .

[26]  Hannes Taubenböck,et al.  How good is the map? A multi-scale cross-comparison framework for global settlement layers: Evidence from Central Europe , 2016 .

[27]  Thomas Blaschke,et al.  Ontology-Based Classification of Building Types Detected from Airborne Laser Scanning Data , 2014, Remote. Sens..

[28]  Hannes Taubenböck,et al.  Building Types’ Classification Using Shape-Based Features and Linear Discriminant Functions , 2016, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[29]  Christiane Schmullius,et al.  Object-based land cover mapping and comprehensive feature calculation for an automated derivation of urban structure types at block level , 2014 .

[30]  Hannes Taubenböck,et al.  Multi-sensor feature fusion for very high spatial resolution built-up area extraction in temporary settlements , 2018 .

[31]  Fabio Casciati,et al.  Real-time identification of disaster areas by an open-access vision-based tool , 2015, Adv. Eng. Softw..

[32]  M. Simonnet The Strong Law of Large Numbers , 1996 .

[33]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[34]  Lorenzo Bruzzone,et al.  A Multilevel Context-Based System for Classification of Very High Spatial Resolution Images , 2006, IEEE Transactions on Geoscience and Remote Sensing.

[35]  G. Cardani,et al.  Behaviour of Historic Masonry Buildings in Seismic Areas: Lessons Learned from the Umbria-Marche Earthquake , 2000 .

[36]  Will Perkins ON THE STRONG LAW OF LARGE NUMBERS , 2004 .

[37]  Nitesh V. Chawla,et al.  Editorial: special issue on learning from imbalanced data sets , 2004, SKDD.

[38]  Angelo Masi,et al.  Seismic Vulnerability Assessment of Gravity Load Designed R/C Frames , 2003 .

[39]  William J. Emery,et al.  A neural network approach using multi-scale textural metrics from very high-resolution panchromatic imagery for urban land-use classification , 2009 .

[40]  Anil M. Cheriyadat,et al.  Unsupervised Feature Learning for Aerial Scene Classification , 2014, IEEE Transactions on Geoscience and Remote Sensing.

[41]  Michael N. Fardis,et al.  Seismic Design, Assessment and Retrofitting of Concrete Buildings , 2009 .

[42]  Luis Ángel Ruiz Fernández,et al.  Using street based metrics to characterize urban typologies , 2014, Comput. Environ. Urban Syst..

[43]  Hannes Taubenböck,et al.  Assessment of Seismic Building Vulnerability from Space , 2014 .

[44]  Robert Weibel,et al.  An Approach for the Classification of Urban Building Structures Based on Discriminant Analysis Techniques , 2008, Trans. GIS.

[45]  Omri Allouche,et al.  Assessing the accuracy of species distribution models: prevalence, kappa and the true skill statistic (TSS) , 2006 .

[46]  Angelo Masi,et al.  VULNERABILITY ASSESSMENT OF GRAVITY-LOAD DESIGNED RC BUILDINGS: EVALUATION OF SEISMIC CAPACITY THROUGH NON LINEAR DYNAMIC ANALYSES , 2012 .

[47]  Hannes Taubenböck,et al.  Unsupervised change detection in VHR remote sensing imagery - an object-based clustering approach in a dynamic urban environment , 2017, Int. J. Appl. Earth Obs. Geoinformation.

[48]  M. Dolce,et al.  Earthquake Damage Scenarios of the Building Stock of Potenza (Southern Italy) Including Site Effects , 2003 .

[49]  Abdesselam Bouzerdoum,et al.  A supervised learning approach for imbalanced data sets , 2008, 2008 19th International Conference on Pattern Recognition.

[50]  Liangpei Zhang,et al.  A pixel shape index coupled with spectral information for classification of high spatial resolution remotely sensed imagery , 2006, IEEE Transactions on Geoscience and Remote Sensing.

[51]  Marina Mueller,et al.  Potential of High-Resolution Satellite Data in the Context of Vulnerability of Buildings , 2006 .

[52]  Russell G. Congalton,et al.  Assessing the accuracy of remotely sensed data : principles and practices , 1998 .

[53]  Hannes Taubenböck,et al.  Assessing building vulnerability using synergistically remote sensing and civil engineering , 2009 .

[54]  Chao Chen,et al.  Using Random Forest to Learn Imbalanced Data , 2004 .

[55]  Aurélien Géron,et al.  Hands-On Machine Learning with Scikit-Learn and TensorFlow: Concepts, Tools, and Techniques to Build Intelligent Systems , 2017 .

[56]  Hannes Taubenböck,et al.  Estimation of seismic building structural types using multi-sensor remote sensing and machine learning techniques , 2015 .

[57]  Fernando De la Torre,et al.  Facing Imbalanced Data--Recommendations for the Use of Performance Metrics , 2013, 2013 Humaine Association Conference on Affective Computing and Intelligent Interaction.

[58]  Guido Magenes,et al.  DEVELOPMENT OF SEISMIC VULNERABILITY ASSESSMENT METHODOLOGIES OVER THE PAST 30 YEARS , 2006 .

[59]  Jianping Wu,et al.  Automated derivation of urban building density information using airborne LiDAR data and object-based method , 2010 .

[60]  Hannes Taubenböck,et al.  On the Effect of Spatially Non-Disjoint Training and Test Samples on Estimated Model Generalization Capabilities in Supervised Classification With Spatial Features , 2017, IEEE Geoscience and Remote Sensing Letters.

[61]  Arun Kumar On the Classification of Imbalanced Datasets , 2012 .

[62]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[63]  C. Krishnaveni,et al.  On the Classification of Imbalanced Datasets , 2022 .

[64]  C. Briese,et al.  A NEW METHOD FOR BUILDING EXTRACTION IN URBAN AREAS FROM HIGH-RESOLUTION LIDAR DATA , 2002 .