A deep surrogate approach to efficient Bayesian inversion in PDE and integral equation models

We propose a novel deep learning approach to efficiently perform Bayesian inference in partial differential equation (PDE) and integral equation models over potentially high-dimensional parameter spaces. The contributions of this paper are two-fold; the first is the introduction of a neural network approach to approximating the solutions of Fredholm and Volterra integral equations of the first and second kind. The second is the description of a deep surrogate model which allows for efficient sampling from a Bayesian posterior distribution in which the likelihood depends on the solutions of PDEs or integral equations. For the latter, our method relies on the approximate representation of parametric solutions by neural networks. This deep learning approach allows the accurate and efficient approximation of parametric solutions in significantly higher dimensions than is possible using classical techniques. Since the approximated solutions are very cheap to evaluate, the solutions of Bayesian inverse problems over large parameter spaces are tractable using Markov chain Monte Carlo. We demonstrate the efficiency of our method using two real-world examples; these include Bayesian inference in the PDE and integral equation case for an example from electrochemistry, and Bayesian inference of a function-valued heat-transfer parameter with applications in aviation.

[1]  George Cybenko,et al.  Approximation by superpositions of a sigmoidal function , 1992, Math. Control. Signals Syst..

[2]  J. Michael Owen,et al.  Use of fin equation to calculate Nusselt numbers for rotating disks , 2015 .

[3]  Richard A. Davis,et al.  Remarks on Some Nonparametric Estimates of a Density Function , 2011 .

[4]  Simon Cotter,et al.  Surrogate accelerated Bayesian inversion for the determination of the thermal diffusivity of a material , 2018, Metrologia.

[5]  Arnulf Jentzen,et al.  Analysis of the generalization error: Empirical risk minimization over deep artificial neural networks overcomes the curse of dimensionality in the numerical approximation of Black-Scholes partial differential equations , 2018, SIAM J. Math. Data Sci..

[6]  Paris Perdikaris,et al.  Physics Informed Deep Learning (Part I): Data-driven Solutions of Nonlinear Partial Differential Equations , 2017, ArXiv.

[7]  E Weinan,et al.  Deep Learning-Based Numerical Methods for High-Dimensional Parabolic Partial Differential Equations and Backward Stochastic Differential Equations , 2017, Communications in Mathematics and Statistics.

[8]  Arnulf Jentzen,et al.  A proof that artificial neural networks overcome the curse of dimensionality in the numerical approximation of Black-Scholes partial differential equations , 2018, Memoirs of the American Mathematical Society.

[9]  Paris Perdikaris,et al.  Physics-Constrained Deep Learning for High-dimensional Surrogate Modeling and Uncertainty Quantification without Labeled Data , 2019, J. Comput. Phys..

[10]  T. J. Dodwell,et al.  A Hierarchical Multilevel Markov Chain Monte Carlo Algorithm with Applications to Uncertainty Quantification in Subsurface Flow , 2013, SIAM/ASA J. Uncertain. Quantification.

[11]  Andrew M. Stuart,et al.  Inverse problems: A Bayesian perspective , 2010, Acta Numerica.

[12]  Dario Izzo,et al.  Interplanetary transfers via deep representations of the optimal policy and/or of the value function , 2019, GECCO.

[13]  Y. Marzouk,et al.  A stochastic collocation approach to Bayesian inference in inverse problems , 2009 .

[14]  Adrian F. M. Smith,et al.  Bayesian computation via the gibbs sampler and related markov chain monte carlo methods (with discus , 1993 .

[15]  D. Xiu,et al.  Modeling Uncertainty in Steady State Diffusion Problems via Generalized Polynomial Chaos , 2002 .

[16]  Nicholas J. Hills,et al.  HEAT TRANSFER IN HIGH-PRESSURE COMPRESSOR GAS TURBINE INTERNAL AIR SYSTEMS: A ROTATING DISC-CONE CAVITY WITH AXIAL THROUGHFLOW , 2000 .

[17]  Maziar Raissi,et al.  Forward-Backward Stochastic Neural Networks: Deep Learning of High-dimensional Partial Differential Equations , 2018, ArXiv.

[18]  Yalchin Efendiev,et al.  An efficient two‐stage Markov chain Monte Carlo method for dynamic data integration , 2005 .

[19]  M. D. Rooij,et al.  Electrochemical Methods: Fundamentals and Applications , 2003 .

[20]  Yalchin Efendiev,et al.  Preconditioning Markov Chain Monte Carlo Simulations Using Coarse-Scale Models , 2006, SIAM J. Sci. Comput..

[21]  Arnulf Jentzen,et al.  A proof that deep artificial neural networks overcome the curse of dimensionality in the numerical approximation of Kolmogorov partial differential equations with constant diffusion and nonlinear drift coefficients , 2018, Communications in Mathematical Sciences.

[22]  R. Ghanem,et al.  Stochastic Finite Elements: A Spectral Approach , 1990 .

[23]  Clemens Elster,et al.  A surrogate model enables a Bayesian approach to the inverse problem of scatterometry , 2014 .

[24]  J. Michael Owen,et al.  Use of Fin Equation to Calculate Nusselt Numbers for Rotating Discs , 2015 .

[25]  A. Abdel-azim Fundamentals of Heat and Mass Transfer , 2011 .

[26]  N. Metropolis,et al.  Equation of State Calculations by Fast Computing Machines , 1953, Resonance.

[27]  Paris Perdikaris,et al.  Physics Informed Deep Learning (Part II): Data-driven Discovery of Nonlinear Partial Differential Equations , 2017, ArXiv.

[28]  Nicholas Zabaras,et al.  An efficient Bayesian inference approach to inverse problems based on an adaptive sparse grid collocation method , 2009 .

[29]  Sebastian Becker,et al.  Deep Optimal Stopping , 2018, J. Mach. Learn. Res..

[30]  Qi Gong,et al.  Adaptive Deep Learning for High Dimensional Hamilton-Jacobi-Bellman Equations , 2019, SIAM J. Sci. Comput..

[31]  Hessam Babaee,et al.  Deep Learning of Turbulent Scalar Mixing , 2018, Physical Review Fluids.

[32]  Liangliang Zhang,et al.  A General Framework of Optimal Investment , 2019, Journal of Mathematical Finance.

[33]  Justin A. Sirignano,et al.  DGM: A deep learning algorithm for solving partial differential equations , 2017, J. Comput. Phys..

[34]  M. Rahman,et al.  Integral Equations and Their Applications , 2007 .

[35]  Arnulf Jentzen,et al.  Solving high-dimensional partial differential equations using deep learning , 2017, Proceedings of the National Academy of Sciences.

[36]  Gianluca Iaccarino,et al.  A Surrogate Accelerated Bayesian Inverse Analysis of the HyShot II Flight Data , 2011 .

[37]  Hadi Meidani,et al.  Physics-Informed Regularization of Deep Neural Networks , 2018, J. Comput. Inf. Sci. Eng..