An efficient methodology for modeling complex computer codes with Gaussian processes

Complex computer codes are often too time expensive to be directly used to perform uncertainty propagation studies, global sensitivity analysis or to solve optimization problems. A well known and widely used method to circumvent this inconvenience consists in replacing the complex computer code by a reduced model, called a metamodel, or a response surface that represents the computer code and requires acceptable calculation time. One particular class of metamodels is studied: the Gaussian process model that is characterized by its mean and covariance functions. A specific estimation procedure is developed to adjust a Gaussian process model in complex cases (non-linear relations, highly dispersed or discontinuous output, high-dimensional input, inadequate sampling designs, etc.). The efficiency of this algorithm is compared to the efficiency of other existing algorithms on an analytical test case. The proposed methodology is also illustrated for the case of a complex hydrogeological computer code, simulating radionuclide transport in groundwater.

[1]  Mike Rees,et al.  5. Statistics for Spatial Data , 1993 .

[2]  Eric Walter,et al.  Intrinsic Kriging and prior information , 2005 .

[3]  R. Ababou,et al.  On the condition number of covariance matrices in kriging, estimation, and simulation of random fields , 1994 .

[4]  Ken R. McNaught,et al.  A comparison of experimental designs in the development of a neural network simulation metamodel , 2004, Simul. Model. Pract. Theory.

[5]  A. OHagan,et al.  Bayesian analysis of computer code outputs: A tutorial , 2006, Reliab. Eng. Syst. Saf..

[6]  Céline Scheidt,et al.  Assessing Uncertainty and Optimizing Production Schemes – Experimental Designs for Non-Linear Production Response Modeling an Application to Early Water Breakthrough Prevention , 2004 .

[7]  Phillip S. Kott,et al.  Wiley Series in Probability and Mathematical Statistics , 1995 .

[8]  Runze Li,et al.  Design and Modeling for Computer Experiments , 2005 .

[9]  Jack P. C. Kleijnen,et al.  An Overview of the Design and Analysis of Simulation Experiments for Sensitivity Analysis , 2005, Eur. J. Oper. Res..

[10]  T. J. Mitchell,et al.  Bayesian Prediction of Deterministic Functions, with Applications to the Design and Analysis of Computer Experiments , 1991 .

[11]  Søren Nymand Lophaven,et al.  DACE - A Matlab Kriging Toolbox, Version 2.0 , 2002 .

[12]  Jon C. Helton,et al.  Survey of sampling-based methods for uncertainty and sensitivity analysis , 2006, Reliab. Eng. Syst. Saf..

[13]  Henry P. Wynn,et al.  Screening, predicting, and computer experiments , 1992 .

[14]  Thomas J. Santner,et al.  The Design and Analysis of Computer Experiments , 2003, Springer Series in Statistics.

[15]  Richard J. Beckman,et al.  A Comparison of Three Methods for Selecting Values of Input Variables in the Analysis of Output From a Computer Code , 2000, Technometrics.

[16]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[17]  M. S. Bazaraa,et al.  Nonlinear Programming , 1979 .

[18]  Robert Haining,et al.  Statistics for spatial data: by Noel Cressie, 1991, John Wiley & Sons, New York, 900 p., ISBN 0-471-84336-9, US $89.95 , 1993 .

[19]  Jack P. C. Kleijnen,et al.  A methodology for fitting and validating metamodels in simulation , 2000, Eur. J. Oper. Res..

[20]  Sonja Kuhnt,et al.  Design and analysis of computer experiments , 2010 .

[21]  Jennifer A Hoeting,et al.  Model selection for geostatistical models. , 2006, Ecological applications : a publication of the Ecological Society of America.

[22]  B. Iooss,et al.  Global sensitivity analysis for a numerical model of radionuclide migration from the RRC “Kurchatov Institute” radwaste disposal site , 2008 .

[23]  Eric R. Ziegel,et al.  The Elements of Statistical Learning , 2003, Technometrics.

[24]  Bertrand Iooss,et al.  Response surfaces and sensitivity analyses for an environmental model of dose calculations , 2006, Reliab. Eng. Syst. Saf..

[25]  Cristina H. Amon,et al.  An engineering design methodology with multistage Bayesian surrogates and optimal sampling , 1996 .

[26]  George E. P. Box,et al.  Empirical Model‐Building and Response Surfaces , 1988 .

[27]  Harald Bergstriim Mathematical Theory of Probability and Statistics , 1966 .

[28]  Isabelle Zabalza-Mezghani,et al.  Response Surface Designs for Scenario Management and Uncertainty Quantification in Reservoir Production , 2004 .

[29]  J. Chilès,et al.  Geostatistics: Modeling Spatial Uncertainty , 1999 .

[30]  Jack P. C. Kleijnen,et al.  Sensitivity analysis and related analyses: A review of some statistical techniques , 1997 .