Output-weighted optimal sampling for Bayesian regression and rare event statistics using few samples

For many important problems the quantity of interest is an unknown function of the parameters, which is a random vector with known statistics. Since the dependence of the output on this random vector is unknown, the challenge is to identify its statistics, using the minimum number of function evaluations. This problem can be seen in the context of active learning or optimal experimental design. We employ Bayesian regression to represent the derived model uncertainty due to finite and small number of input–output pairs. In this context we evaluate existing methods for optimal sample selection, such as model error minimization and mutual information maximization. We show that for the case of known output variance, the commonly employed criteria in the literature do not take into account the output values of the existing input–output pairs, while for the case of unknown output variance this dependence can be very weak. We introduce a criterion that takes into account the values of the output for the existing samples and adaptively selects inputs from regions of the parameter space which have an important contribution to the output. The new method allows for application to high-dimensional inputs, paving the way for optimal experimental design in high dimensions.

[1]  A. Majda,et al.  Predicting fat-tailed intermittent probability distributions in passive scalar turbulence with imperfect models through empirical information theory , 2016 .

[2]  I. Verdinelli,et al.  Bayesian designs for maximizing information and outcome , 1992 .

[3]  D. Peter,et al.  Kernel estimation of a distribution function , 1985 .

[4]  Themistoklis P. Sapsis,et al.  Sequential sampling strategy for extreme event statistics in nonlinear dynamical systems , 2018, Proceedings of the National Academy of Sciences.

[5]  Robert C. Wolpert,et al.  A Review of the , 1985 .

[6]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[7]  Michael Ortiz,et al.  A hysteretic cohesive-law model of fatigue-crack nucleation , 2005 .

[8]  Petros Koumoutsakos,et al.  Machine Learning for Fluid Mechanics , 2019, Annual Review of Fluid Mechanics.

[9]  Themistoklis P. Sapsis,et al.  A probabilistic decomposition-synthesis method for the quantification of rare events due to internal instabilities , 2015, J. Comput. Phys..

[10]  Mardavij Roozbehani,et al.  Robustness Sensitivities in Large Networks , 2018 .

[11]  Man Kam Kwong,et al.  Norm Inequalities for Derivatives and Differences , 1993 .

[12]  D. Fan,et al.  A robotic Intelligent Towing Tank for learning complex fluid-structure dynamics , 2019, Science Robotics.

[13]  T. Sapsis,et al.  A variational approach to probing extreme events in turbulent dynamical systems , 2017, Science Advances.

[14]  Jitesh H. Panchal,et al.  Bayesian Optimal Design of Experiments for Inferring the Statistical Expectation of Expensive Black-Box Functions , 2019, Journal of Mechanical Design.

[15]  K. Chaloner,et al.  Bayesian Experimental Design: A Review , 1995 .

[16]  Gm Gero Walter,et al.  Bayesian linear regression , 2009 .

[17]  A. C. Rencher Linear models in statistics , 1999 .

[18]  Themistoklis P. Sapsis,et al.  New perspectives for the prediction and statistical quantification of extreme events in high-dimensional dynamical systems , 2018, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[19]  Piyush Pandita,et al.  Bayesian Optimal Design of Experiments For Inferring The Statistical Expectation Of A Black-Box Function , 2018 .

[20]  X. Huan,et al.  GRADIENT-BASED STOCHASTIC OPTIMIZATION METHODS IN BAYESIAN EXPERIMENTAL DESIGN , 2012, 1212.2228.

[21]  Mohammad Farazmand,et al.  Are extreme dissipation events predictable in turbulent fluid flows? , 2018, Physical Review Fluids.

[22]  Chandler Squires,et al.  ABCD-Strategy: Budgeted Experimental Design for Targeted Causal Structure Discovery , 2019, AISTATS.

[23]  T. Sapsis,et al.  Reduced-order precursors of rare events in unidirectional nonlinear water waves , 2015, Journal of Fluid Mechanics.

[24]  A. Majda,et al.  Statistical dynamical model to predict extreme events and anomalous features in shallow water waves with abrupt depth change , 2019, Proceedings of the National Academy of Sciences.