Dimension Reduction for Gaussian Process Emulation: An Application to the Influence of Bathymetry on Tsunami Heights

High accuracy complex computer models, or simulators, require large resources in time and memory to produce realistic results. Statistical emulators are computationally cheap approximations of such simulators. They can be built to replace simulators for various purposes, such as the propagation of uncertainties from inputs to outputs or the calibration of some internal parameters against observations. However, when the input space is of high dimension, the construction of an emulator can become prohibitively expensive. In this paper, we introduce a joint framework merging emulation with dimension reduction in order to overcome this hurdle. The gradient-based kernel dimension reduction technique is chosen due to its ability to drastically decrease dimensionality with little loss in information. The Gaussian process emulation technique is combined with this dimension reduction approach. Our proposed approach provides an answer to the dimension reduction issue in emulation for a wide range of simulation problems that cannot be tackled using existing methods. The efficiency and accuracy of the proposed framework is demonstrated theoretically, and compared with other methods on an elliptic partial differential equation (PDE) problem. We finally present a realistic application to tsunami modeling. The uncertainties in the bathymetry (seafloor elevation) are modeled as high-dimensional realizations of a spatial process using a geostatistical approach. Our dimension-reduced emulation enables us to compute the impact of these uncertainties on resulting possible tsunami wave heights near-shore and on-shore. We observe a significant increase in the spread of uncertainties in the tsunami heights due to the contribution of the bathymetry uncertainties. These results highlight the need to include the effect of uncertainties in the bathymetry in tsunami early warnings and risk assessments.

[1]  M. Wainwright,et al.  Sampled forms of functional PCA in reproducing kernel Hilbert spaces , 2011, 1109.3336.

[2]  Sonja Kuhnt,et al.  Design and analysis of computer experiments , 2010 .

[3]  J. Landes,et al.  Strictly Proper Scoring Rules , 2014 .

[4]  R. Cook On the Interpretation of Regression Plots , 1994 .

[5]  B. Li,et al.  On a Projective Resampling Method for Dimension Reduction With Multivariate Responses , 2008 .

[6]  Xiangrong Yin,et al.  Sequential sufficient dimension reduction for large p, small n problems , 2015 .

[7]  R. Cook,et al.  Principal fitted components for dimension reduction in regression , 2008, 0906.3953.

[8]  R. Christensen,et al.  Fisher Lecture: Dimension Reduction in Regression , 2007, 0708.3774.

[9]  G. Leoni A First Course in Sobolev Spaces , 2009 .

[10]  H. Rue,et al.  An explicit link between Gaussian fields and Gaussian Markov random fields: the stochastic partial differential equation approach , 2011 .

[11]  Anthony O'Hagan,et al.  Kendall's Advanced Theory of Statistics, volume 2B: Bayesian Inference, second edition , 2004 .

[12]  H. Tong,et al.  Article: 2 , 2002, European Financial Services Law.

[13]  T. J. Mitchell,et al.  Bayesian design and analysis of computer experiments: Use of derivatives in surface prediction , 1993 .

[14]  Kofi P. Adragni,et al.  ldr: An R Software Package for Likelihood-Based Su?cient Dimension Reduction , 2014 .

[15]  R. H. Moore,et al.  Regression Graphics: Ideas for Studying Regressions Through Graphics , 1998, Technometrics.

[16]  Bertrand Iooss,et al.  Global sensitivity analysis of computer models with functional inputs , 2008, Reliab. Eng. Syst. Saf..

[17]  Denys Dutykh,et al.  The VOLNA code for the numerical modeling of tsunami waves: Generation, propagation and inundation , 2010, 1002.4553.

[18]  Do Hyang Kim,et al.  Partial sufficient dimension reduction in regression. , 2011 .

[19]  Ilias Bilionis,et al.  Gaussian processes with built-in dimensionality reduction: Applications in high-dimensional uncertainty propagation , 2016, 1602.04550.

[20]  Peter Challenor,et al.  Computational Statistics and Data Analysis the Effect of the Nugget on Gaussian Process Emulators of Computer Models , 2022 .

[21]  K. Fukumizu,et al.  Gradient-Based Kernel Dimension Reduction for Regression , 2014 .

[22]  Ker-Chau Li Sliced inverse regression for dimension reduction (with discussion) , 1991 .

[23]  Bernhard Schölkopf,et al.  A Kernel Two-Sample Test , 2012, J. Mach. Learn. Res..

[24]  Robert H. Halstead,et al.  Matrix Computations , 2011, Encyclopedia of Parallel Computing.

[25]  Serge Guillas,et al.  Sequential Design with Mutual Information for Computer Experiments (MICE): Emulation of a Tsunami Model , 2014, SIAM/ASA J. Uncertain. Quantification.

[26]  Ming-Jun Lai,et al.  Efficient Spatial Modeling Using the SPDE Approach With Bivariate Splines , 2015, 1503.03761.

[27]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[28]  Bertrand Iooss,et al.  Global sensitivity analysis of stochastic computer models with joint metamodels , 2008, Statistics and Computing.

[29]  Georg Stadler,et al.  Discretely Exact Derivatives for Hyperbolic PDE-Constrained Optimization Problems Discretized by the Discontinuous Galerkin Method , 2013, Journal of Scientific Computing.

[30]  H. Rue,et al.  Approximate Bayesian inference for latent Gaussian models by using integrated nested Laplace approximations , 2009 .

[31]  Michael I. Jordan,et al.  Dimensionality Reduction for Supervised Learning with Reproducing Kernel Hilbert Spaces , 2004, J. Mach. Learn. Res..

[32]  D. Gleich,et al.  Computing active subspaces with Monte Carlo , 2014, 1408.0545.

[33]  Qiqi Wang,et al.  Erratum: Active Subspace Methods in Theory and Practice: Applications to Kriging Surfaces , 2013, SIAM J. Sci. Comput..

[34]  Darren J. Wilkinson,et al.  Bayesian Emulation and Calibration of a Stochastic Computer Model of Mitochondrial DNA Deletions in Substantia Nigra Neurons , 2009 .