Emulation of Multivariate Simulators Using Thin-Plate Splines with Application to Atmospheric Dispersion

It is often desirable to build a statistical emulator of a complex computer simulator in order to perform analysis which would otherwise be computationally infeasible. We propose methodology to model multivariate output from a computer simulator taking into account output structure in the responses. The utility of this approach is demonstrated by applying it to a chemical and biological hazard prediction model. Predicting the hazard area that results from an accidental or deliberate chemical or biological release is imperative in civil and military planning and also in emergency response. The hazard area resulting from such a release is highly structured in space and we therefore propose the use of a thin-plate spline to capture the spatial structure and fit a Gaussian process emulator to the coefficients of the resultant basis functions. We compare and contrast four different techniques for emulating multivariate output: dimension-reduction using (i) a fully Bayesian approach with a principal component basis, (ii) a fully Bayesian approach with a thin-plate spline basis, assuming that the basis coefficients are independent, and (iii) a "plug-in" Bayesian approach with a thin-plate spline basis and a separable covariance structure; and (iv) a functional data modeling approach using a tensor-product (separable) Gaussian process. We develop methodology for the two thin-plate spline emulators and demonstrate that these emulators significantly outperform the principal component emulator. Further, the separable thin-plate spline emulator, which accounts for the dependence between basis coefficients, provides substantially more realistic quantification of uncertainty, and is also computationally more tractable, allowing fast emulation. For high resolution output data, it also offers substantial predictive and computational advantages over the tensor-product Gaussian process emulator.

[1]  Peter Z. G. Qian,et al.  Gaussian Process Models for Computer Experiments With Qualitative and Quantitative Factors , 2008, Technometrics.

[2]  S. Wood Thin plate regression splines , 2003 .

[3]  Jeremy E. Oakley,et al.  Multivariate Gaussian Process Emulators With Nonseparable Covariance Structures , 2013, Technometrics.

[4]  Holger Dette,et al.  Optimal design for additive partially nonlinear models , 2011 .

[5]  A. O'Hagan,et al.  Bayesian emulation of complex multi-output and dynamic computer models , 2010 .

[6]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[7]  Henry P. Wynn,et al.  [Design and Analysis of Computer Experiments]: Rejoinder , 1989 .

[8]  James A. Smith,et al.  Bayesian updating of atmospheric dispersion models for use after an accidental release of radioactivity , 1993 .

[9]  Brian J. Williams,et al.  Sensitivity analysis when model outputs are functions , 2006, Reliab. Eng. Syst. Saf..

[10]  Astrid Maute,et al.  Expert Knowledge and Multivariate Emulation: The Thermosphere–Ionosphere Electrodynamics General Circulation Model (TIE-GCM) , 2009, Technometrics.

[11]  H. F. Stripling,et al.  Spline-Based Emulators for Radiative Shock Experiments With Measurement Error , 2013 .

[12]  Ian T. Jolliffe,et al.  Principal Component Analysis , 2002, International Encyclopedia of Statistical Science.

[13]  A. O'Hagan,et al.  Bayesian calibration of computer models , 2001 .

[14]  M. J. Bayarri,et al.  Computer model validation with functional output , 2007, 0711.3271.

[15]  F. Graybill,et al.  Matrices with Applications in Statistics. , 1984 .

[16]  Lennart Robertson,et al.  Bayesian updating of atmospheric dispersion after a nuclear accident , 2004 .

[17]  Thomas J. Santner,et al.  The Design and Analysis of Computer Experiments , 2003, Springer Series in Statistics.

[18]  J. Rougier Efficient Emulators for Multivariate Deterministic Functions , 2008 .

[19]  Andy J. Keane,et al.  Engineering Design via Surrogate Modelling - A Practical Guide , 2008 .

[20]  Robert B. Gramacy,et al.  Cases for the nugget in modeling computer experiments , 2010, Statistics and Computing.

[21]  Runze Li,et al.  Design and Modeling for Computer Experiments , 2005 .

[22]  T. J. Mitchell,et al.  Exploratory designs for computational experiments , 1995 .

[23]  D. Higdon,et al.  Computer Model Calibration Using High-Dimensional Output , 2008 .

[24]  Aaas News,et al.  Book Reviews , 1893, Buffalo Medical and Surgical Journal.

[25]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[26]  Stefan M. Wild,et al.  Bayesian Calibration and Uncertainty Analysis for Computationally Expensive Models Using Optimization and Radial Basis Function Approximation , 2008 .

[27]  David C. Woods,et al.  Weighted space-filling designs , 2013, J. Simulation.

[28]  P. Robins,et al.  Realtime sequential inference of static parameters with expensive likelihood calculations , 2009 .