Multivariate geotechnical zonation of seismic site effects with clustering-blended model for a city area, South Korea

Abstract The site classification system in the seismic design code and its dependent zonation should be guaranteed to represent the local spatial uncertainty of subsurface features, but have been uniformly used based on the site response parameters. Spatial interpolation-based zonation is only practically feasible if there are clear-cut stochastic/spatial correlations in geotechnical/geophysical measurements. The geology and terrain features can be substituted as an influential proxy for site amplification. To develop cluster-oriented zonation considering the spatial heterogeneity of the different site response parameters focusing on an uninvestigated area, this study proposes a new approach for multivariate site classification blended with geographic information system (GIS)-based spatial clustering and machine learning (ML)-based clustering ensemble technologies. GIS-based clustering characterizes a hot spot cluster with statistical and spatial correlation values of the site response parameters and defines the relative weight using the Gi∗ Z-score as the index of spatial heterogeneity. ML-based clustering ensembles aim to combine the clustering model in terms of consistency and performance, and are designed for optimization through a consensus function by comparing the fitness with the site classification system to obtain better results than individual clustering algorithms. The novelty of the proposed workflow is the stepwise improvement of the proposed models compared with the zonation phases and practical methods.

[1]  Jennifer L. Dungan,et al.  A balanced view of scale in spatial statistical analysis , 2002 .

[2]  Fabian J Theis,et al.  Deep learning: new computational modelling techniques for genomics , 2019, Nature Reviews Genetics.

[3]  Philip H. Stauffer,et al.  Heterogeneity-assisted carbon dioxide storage in marine sediments , 2018, Applied Energy.

[4]  D. Wald,et al.  Review Article Topographic Slope as a Proxy for Seismic Site Conditions and Amplification , 2007 .

[5]  Choong-Ki Chung,et al.  Geo-spatial data integration for subsurface stratification of dam site with outlier analyses , 2016, Environmental Earth Sciences.

[6]  Mahdi Hashemi,et al.  A GIS-based earthquake damage assessment and settlement methodology , 2011 .

[7]  Chang-Guk Sun,et al.  Correlations Between Shear Wave Velocity and In-Situ Penetration Test Results for Korean Soil Deposits , 2013, Pure and Applied Geophysics.

[8]  Amol P. Bhagat,et al.  Penalty Parameter Selection for Hierarchical Data Stream Clustering , 2016 .

[9]  Chang-Guk Sun,et al.  GIS-based regional assessment of seismic site effects considering the spatial uncertainty of site-specific geotechnical characteristics in coastal and inland urban areas , 2017 .

[10]  Chang-Guk Sun,et al.  GIS-Based Optimum Geospatial Characterization for Seismic Site Effect Assessment in an Inland Urban Area, South Korea , 2020, Applied Sciences.

[11]  Brendon A. Bradley,et al.  Critical Parameters Affecting Bias and Variability in Site‐Response Analyses Using KiK‐net Downhole Array Data , 2013 .

[12]  A. Scolobig,et al.  Using reasoned imagination to learn about cascading hazards: a pilot study , 2016 .

[13]  Saburoh Midorikawa,et al.  AVERAGE SHEAR-WAVE VELOCITY MAPPING USING JAPAN ENGINEERING GEOMORPHOLOGIC CLASSIFICATION MAP , 2005 .

[14]  Thomas A. Wettergren,et al.  A ROC-Based Approach for Developing Optimal Strategies in UUV Search Planning , 2018, IEEE Journal of Oceanic Engineering.

[15]  Randal S. Olson,et al.  Relief-Based Feature Selection: Introduction and Review , 2017, J. Biomed. Informatics.

[16]  Chang-Dong Wang,et al.  Locally Weighted Ensemble Clustering , 2016, IEEE Transactions on Cybernetics.

[17]  Oy Leuangthong,et al.  Minimum Acceptance Criteria for Geostatistical Realizations , 2004 .

[18]  Raymond B. Seed,et al.  New Site Coefficients and Site Classification System Used in Recent Building Seismic Code Provisions , 2000 .

[19]  A. Santo,et al.  Seismic soil classification of Italy based on surface geology and shear-wave velocity measurements , 2019, Soil Dynamics and Earthquake Engineering.

[20]  Canlong Zhang,et al.  Combined constraint-based with metric-based in semi-supervised clustering ensemble , 2017, International Journal of Machine Learning and Cybernetics.

[21]  Choong-Ki Chung,et al.  A Three-Dimensional Geotechnical Spatial Modeling Method for Borehole Dataset Using Optimization of Geostatistical Approaches , 2020 .

[22]  T. Houle,et al.  Statistical Evaluation of a Biomarker , 2010, Anesthesiology.

[23]  V. Prasannakumar,et al.  Spatio-Temporal Clustering of Road Accidents: GIS Based Analysis and Assessment , 2011 .

[24]  Ioan Ianos,et al.  Applications of Principal Component Analysis Integrated with GIS , 2012 .

[25]  S. Moon,et al.  Geochemical evidence for K-metasomatism related to uranium enrichment in Daejeon granitic rocks near the central Ogcheon Metamorphic Belt, Korea , 2018, Geosciences Journal.

[26]  Hossam Faris,et al.  Evolutionary static and dynamic clustering algorithms based on multi-verse optimizer , 2018, Eng. Appl. Artif. Intell..

[27]  Rich Caruana,et al.  Consensus Clusterings , 2007, Seventh IEEE International Conference on Data Mining (ICDM 2007).

[28]  Chang-Guk Sun,et al.  Geospatial Assessment of the Post-Earthquake Hazard of the 2017 Pohang Earthquake Considering Seismic Site Effects , 2018, ISPRS Int. J. Geo Inf..

[29]  Yafit Cohen,et al.  A weighted multivariate spatial clustering model to determine irrigation management zones , 2019, Comput. Electron. Agric..

[30]  Choong-Ki Chung,et al.  Application of statistical geo-spatial information technology to soil stratification in the Seoul metropolitan area , 2012 .

[31]  Chang-Guk Sun,et al.  Terrain Proxy-Based Site Classification for Seismic Zonation in North Korea within a Geospatial Data-Driven Workflow , 2021, Remote. Sens..

[32]  M. Moscatelli,et al.  A new Vs30 map for Italy based on the seismic microzonation dataset , 2020 .

[33]  Jerome H Friedman,et al.  Multiple additive regression trees with application in epidemiology , 2003, Statistics in medicine.

[34]  Aleksandra Nenko,et al.  Urban Data and Spatial Segregation: Analysis of Food Services Clusters in St. Petersburg, Russia , 2018, ICCS.

[35]  Dylan Wood,et al.  Bayesian calibration of multi-response systems via multivariate Kriging: Methodology and geological and geotechnical case studies , 2019, Engineering Geology.

[36]  Masashi Matsuoka,et al.  Spatial Prediction of Aftershocks Triggered by a Major Earthquake: A Binary Machine Learning Perspective , 2019, ISPRS Int. J. Geo Inf..

[37]  Jonathan P. Stewart,et al.  Compilation of a Local VS Profile Database and Its Application for Inference of VS30 from Geologic‐ and Terrain‐Based Proxies , 2014 .

[38]  Shuqi Ma,et al.  Comparison of two optimized machine learning models for predicting displacement of rainfall-induced landslide: A case study in Sichuan Province, China , 2017 .

[39]  Berthold K. P. Horn,et al.  Hill shading and the reflectance map , 1981, Proceedings of the IEEE.

[40]  H. Suito,et al.  DETECTION OF HOTSPOTS FOR THREE-DIMENSIONAL SPATIAL DATA AND ITS APPLICATION TO ENVIRONMENTAL POLLUTION DATA , 2007 .

[41]  Jian Wang,et al.  Deep learning and its application in geochemical mapping , 2019, Earth-Science Reviews.

[42]  Tinghuai Ma,et al.  A comparative study of clustering ensemble algorithms , 2018, Comput. Electr. Eng..

[43]  Zhang Kun,et al.  Comparison between General Moran's Index and Getis-Ord General G of Spatial Autocorrelation , 2007 .

[44]  Robert Y. Liang,et al.  A semi-supervised clustering-based approach for stratification identification using borehole and cone penetration test data , 2019, Engineering Geology.

[45]  Arthur Getis,et al.  Spatial Analysis and Modeling in a GIS Environment , 2004 .

[46]  Balakrushna Tripathy,et al.  Performance Analysis of Clustering Algorithm in Data Mining in R Language , 2018 .

[47]  Sylvain Chartier,et al.  The k-means clustering technique: General considerations and implementation in Mathematica , 2013 .

[48]  Kyriazis Pitilakis,et al.  New code site classification, amplification factors and normalized response spectra based on a worldwide ground-motion database , 2013, Bulletin of Earthquake Engineering.

[49]  J. DeCoster Overview of Factor Analysis , 1998 .

[50]  William A. Bryant,et al.  A Site-Conditions Map for California Based on Geology and Shear-Wave Velocity , 2000 .

[51]  Yiming Yang,et al.  A Comparative Study on Feature Selection in Text Categorization , 1997, ICML.

[52]  R. Bilonick An Introduction to Applied Geostatistics , 1989 .

[53]  T. Moon The expectation-maximization algorithm , 1996, IEEE Signal Process. Mag..

[54]  Xiaodong Liu,et al.  A Pearson's correlation coefficient based decision tree and its parallel implementation , 2018, Inf. Sci..

[55]  Chin-Teng Lin,et al.  A review of clustering techniques and developments , 2017, Neurocomputing.

[56]  Tonglin Zhang,et al.  A decomposition of Moran's I for clustering detection , 2007, Comput. Stat. Data Anal..

[57]  Chang-Guk Sun Determination of mean shear wave velocity to 30 m depth for site classification using shallow depth shear wave velocity profile in Korea , 2015 .

[58]  Timothy D. Ancheta,et al.  NGA-West2 Site Database , 2014 .

[59]  D. Wald,et al.  On the Use of High-Resolution Topographic Data as a Proxy for Seismic Site Conditions (VS30) , 2009 .

[60]  Willibald Loiskandl,et al.  Using sequential Gaussian simulation to assess the field-scale spatial uncertainty of soil water content , 2009 .

[61]  Young K. Chang,et al.  Current and future applications of statistical machine learning algorithms for agricultural machine vision systems , 2019, Comput. Electron. Agric..

[62]  Philippe De Smedt,et al.  Towards a three-dimensional cost-effective registration of the archaeological heritage , 2013 .

[63]  Kyriazis Pitilakis,et al.  Erratum to: New code site classification, amplification factors and normalized response spectra based on a worldwide ground-motion database , 2013, Bulletin of Earthquake Engineering.

[64]  Larry A. Rendell,et al.  A Practical Approach to Feature Selection , 1992, ML.

[65]  Dookie Kim,et al.  Seismic Vulnerability of Cabinet Facility with Tuned Mass Dampers Subjected to High- and Low-Frequency Earthquakes , 2020, Applied Sciences.

[66]  Mohamed S. Kamel,et al.  Cumulative Voting Consensus Method for Partitions with Variable Number of Clusters , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[67]  L. Wotherspoon,et al.  Influence of soil aging on SPT-Vs correlation and seismic site classification , 2020 .

[68]  Chang-Guk Sun,et al.  Geo-Proxy-Based Site Classification for Regional Zonation of Seismic Site Effects in South Korea , 2018 .

[69]  Mrityunjoy Jana,et al.  Modeling of hotspot detection using cluster outlier analysis and Getis-Ord Gi* statistic of educational development in upper-primary level, India , 2016, Modeling Earth Systems and Environment.

[70]  L. Anselin Local Indicators of Spatial Association—LISA , 2010 .

[71]  J. Stewart,et al.  Taiwan-Specific Model for V S30 Prediction Considering Between-Proxy Correlations , 2018, Earthquake Spectra.

[72]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..