Multivariate Gaussian Process Emulators With Nonseparable Covariance Structures

The Gaussian process regression model is a popular type of “emulator” used as a fast surrogate for computationally expensive simulators (deterministic computer models). For simulators with multivariate output, common practice is to specify a separable covariance structure for the Gaussian process. Though computationally convenient, this can be too restrictive, leading to poor performance of the emulator, particularly when the different simulator outputs represent different physical quantities. Also, treating the simulator outputs as independent can lead to inappropriate representations of joint uncertainty. We develop nonseparable covariance structures for Gaussian process emulators, based on the linear model of coregionalization and convolution methods. Using two case studies, we compare the performance of these covariance structures both with standard separable covariance structures and with emulators that assume independence between the outputs. In each case study, we find that only emulators with nonseparable covariances structures have sufficient flexibility both to give good predictions and to represent joint uncertainty about the simulator outputs appropriately. This article has supplementary material online.

[1]  Richard J. Beckman,et al.  A Comparison of Three Methods for Selecting Values of Input Variables in the Analysis of Output From a Computer Code , 2000, Technometrics.

[2]  Anthony O'Hagan,et al.  Probabilistic sensitivity analysis , 2009 .

[3]  Stanley H. Cohen,et al.  Design and Analysis , 2010 .

[4]  Hendriek C. Boshuizen,et al.  Probabilistic sensitivity analysis: be a Bayesian. , 2009, Value in health : the journal of the International Society for Pharmacoeconomics and Outcomes Research.

[5]  A. O'Hagan,et al.  Predicting the output from a complex computer code when fast approximations are available , 2000 .

[6]  N. Urban,et al.  Probabilistic hindcasts and projections of the coupled climate, carbon cycle and Atlantic meridional overturning circulation system: a Bayesian fusion of century-scale observations with a simple model , 2010 .

[7]  Ronald P. Barry,et al.  Constructing and fitting models for cokriging and multivariable spatial prediction , 1998 .

[8]  J. Oakley Eliciting Gaussian process priors for complex computer codes , 2002 .

[9]  J. Rougier,et al.  Bayes Linear Calibrated Prediction for Complex Systems , 2006 .

[10]  Peter Z. G. Qian,et al.  Gaussian Process Models for Computer Experiments With Qualitative and Quantitative Factors , 2008, Technometrics.

[11]  Multivariate Geostatistics , 2004 .

[12]  D. Higdon,et al.  Computer Model Calibration Using High-Dimensional Output , 2008 .

[13]  A. Saltelli,et al.  On the Relative Importance of Input Factors in Mathematical Models , 2002 .

[14]  Jim Berger,et al.  Special Issue on Computer Modeling , 2009, Technometrics.

[15]  A. O'Hagan,et al.  Quantifying uncertainty in the biospheric carbon flux for England and Wales , 2007 .

[16]  Chris D. Jones,et al.  Climate-carbon cycle feedbacks under stabilization: uncertainty and observational constraints , 2006 .

[17]  D. Moens,et al.  An interval finite element approach for the calculation of envelope frequency response functions , 2004 .

[18]  Vicente J. Romero,et al.  Calibration and Uncertainty Analysis for Computer Simulations with Multivariate Output , 2008 .

[19]  M. J. Bayarri,et al.  Predicting Vehicle Crashworthiness: Validation of Computer Models for Functional and Hierarchical Data , 2009 .

[20]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[21]  J. Rougier Efficient Emulators for Multivariate Deterministic Functions , 2008 .

[22]  Sourabh Bhattacharya,et al.  A simulation approach to Bayesian emulation of complex dynamic computer models , 2007 .

[23]  A. O. Hagan A Markov Property for Covariance Structures , 2006 .

[24]  Marcus R. Frean,et al.  Dependent Gaussian Processes , 2004, NIPS.

[25]  A. O'Hagan,et al.  Bayesian calibration of computer models , 2001 .

[26]  Sonja Kuhnt,et al.  Design and analysis of computer experiments , 2010 .

[27]  Nathan M. Urban,et al.  Probabilistic hindcasts and projections of the coupled climate, carbon cycle, and Atlantic meridional overturning circulation systems , 2009 .

[28]  Astrid Maute,et al.  Expert Knowledge and Multivariate Emulation: The Thermosphere–Ionosphere Electrodynamics General Circulation Model (TIE-GCM) , 2009, Technometrics.

[29]  M. E. Johnson,et al.  Minimax and maximin distance designs , 1990 .

[30]  Etienne Balmes,et al.  GARTEUR GROUP ON GROUND VIBRATION TESTING. RESULTS FROM THE TEST OF A SINGLE STRUCTURE BY 12 LABORATORIES IN EUROPE. , 1997 .

[31]  A. O'Hagan,et al.  Probabilistic sensitivity analysis of complex models: a Bayesian approach , 2004 .

[32]  T. J. Mitchell,et al.  Bayesian Prediction of Deterministic Functions, with Applications to the Design and Analysis of Computer Experiments , 1991 .

[33]  Alan E. Gelfand,et al.  Multivariate Spatial Modeling for Geostatistical Data Using Convolved Covariance Functions , 2007 .

[34]  A. O'Hagan,et al.  Bayesian emulation of complex multi-output and dynamic computer models , 2010 .

[35]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[36]  By NATHAN M. URBAN,et al.  Probabilistic hindcasts and projections of the coupled climate , carbon cycle , and Atlantic meridional overturning circulation system : A Bayesian fusion of century-scale observations with a simple model , 2008 .

[37]  Neil D. Lawrence,et al.  Sparse Convolved Gaussian Processes for Multi-output Regression , 2008, NIPS.

[38]  M. Goulard,et al.  Linear coregionalization model: Tools for estimation and choice of cross-variogram matrix , 1992 .

[39]  C. F. Sirmans,et al.  Nonstationary multivariate process modeling through spatially varying coregionalization , 2004 .

[40]  David B. Dunson,et al.  Gaussian process models , 2013 .

[41]  M. J. Bayarri,et al.  Computer model validation with functional output , 2007, 0711.3271.

[42]  D. Higdon Space and Space-Time Modeling using Process Convolutions , 2002 .

[43]  Anthony O'Hagan,et al.  Diagnostics for Gaussian Process Emulators , 2009, Technometrics.