Surrogate modeling of multifidelity data for large samples

The problem of construction of a surrogate model based on available lowand high-fidelity data is considered. The low-fidelity data can be obtained, e.g., by performing the computer simulation and the high-fidelity data can be obtained by performing experiments in a wind tunnel. A regression model based on Gaussian processes proves to be convenient for modeling variable-fidelity data. Using this model, one can efficiently reconstruct nonlinear dependences and estimate the prediction accuracy at a specified point. However, if the sample size exceeds several thousand points, direct use of the Gaussian process regression becomes impossible due to a high computational complexity of the algorithm. We develop new algorithms for processing multifidelity data based on Gaussian process model, which are efficient even for large samples. We illustrate application of the developed algorithms by constructing surrogate models of a complex engineering system.

[1]  Ashok Srivastava,et al.  Stable and Efficient Gaussian Process Calculations , 2009, J. Mach. Learn. Res..

[2]  Andy J. Keane,et al.  Engineering Design via Surrogate Modelling - A Practical Guide , 2008 .

[3]  Petros Drineas,et al.  On the Nyström Method for Approximating a Gram Matrix for Improved Kernel-Based Learning , 2005, J. Mach. Learn. Res..

[4]  Carl E. Rasmussen,et al.  A Unifying View of Sparse Approximate Gaussian Process Regression , 2005, J. Mach. Learn. Res..

[5]  Alexander I. J. Forrester,et al.  Multi-fidelity optimization via surrogate modelling , 2007, Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[6]  Tom Dhaene,et al.  Cost-efficient electromagnetic-simulation-driven antenna design using co-Kriging , 2012 .

[7]  François Bachoc,et al.  Cross Validation and Maximum Likelihood estimations of hyper-parameters of Gaussian processes with model misspecification , 2013, Comput. Stat. Data Anal..

[8]  A. O'Hagan,et al.  Predicting the output from a complex computer code when fast approximations are available , 2000 .

[9]  Mike Rees,et al.  5. Statistics for Spatial Data , 1993 .

[10]  J. Butcher Numerical methods for ordinary differential equations , 2003 .

[11]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[12]  Zoubin Ghahramani,et al.  Sparse Gaussian Processes using Pseudo-inputs , 2005, NIPS.

[13]  Evgeny Burnaev,et al.  Properties of the Bayesian Parameter Estimation of a Regression Based on Gaussian Processes , 2014 .

[14]  Noel A. C. Cressie,et al.  Statistics for Spatial Data: Cressie/Statistics , 1993 .

[15]  Dervis Karaboga,et al.  A powerful and efficient algorithm for numerical function optimization: artificial bee colony (ABC) algorithm , 2007, J. Glob. Optim..

[16]  Sasan C. Armand Structural Optimization Methodology for Rotating Disks of Aircraft Engines , 1995 .

[17]  Jeong‐Soo Park Optimal Latin-hypercube designs for computer experiments , 1994 .