ConvPDE-UQ: Convolutional neural networks with quantified uncertainty for heterogeneous elliptic partial differential equations on varied domains

Abstract In this work, we introduce the ConvPDE-UQ framework for constructing light-weight numerical solvers for partial differential equations (PDEs) using convolutional neural networks. A theoretical justification for the neural network approximation to partial differential equation solvers on varied domains is established based on the existence and properties of Green's functions. These solvers are able to effectively reduce the computational demands of traditional numerical methods into a single forward-pass of a convolutional network. The network architecture is also designed to predict pointwise Gaussian posterior distributions, with weights trained to minimize the associated negative log-likelihood of the observed solutions. This setup facilitates simultaneous training and uncertainty quantification for the network's solutions, allowing the solver to provide pointwise uncertainties for its predictions. The associated training procedure avoids the computationally expensive Bayesian inference steps used by other state-of-the-art uncertainty models and allows training to be scaled to the large data sets required for learning on varied problem domains. The performance of the framework is demonstrated on three distinct classes of PDEs consisting of two linear elliptic problem setups and a nonlinear Poisson problem. After a single offline training procedure for each class, the proposed networks are capable of accurately predicting the solutions to linear and nonlinear elliptic problems with heterogeneous source terms defined on any specified two-dimensional domain using just a single forward-pass of a convolutional neural network. Additionally, an analysis of the predicted pointwise uncertainties is presented with experimental evidence establishing the validity of the network's uncertainty quantification schema.

[1]  Anders Logg,et al.  Unified form language: A domain-specific language for weak formulations of partial differential equations , 2012, TOMS.

[2]  Søren Hauberg,et al.  Probabilistic Solutions to Differential Equations and their Application to Riemannian Statistics , 2013, AISTATS.

[3]  Garth N. Wells,et al.  Optimizations for quadrature representations of finite element tensors through automated code generation , 2011, TOMS.

[4]  D. Kundu Discriminating Between Normal and Laplace Distributions , 2005 .

[5]  Robert C. Kirby,et al.  FIAT: numerical construction of finite element basis functions , 2012 .

[6]  Chao Xu,et al.  Sparsity-promoting elastic net method with rotations for high-dimensional nonlinear inverse problem , 2019, Computer Methods in Applied Mechanics and Engineering.

[7]  Kurt Hornik,et al.  Multilayer feedforward networks are universal approximators , 1989, Neural Networks.

[8]  Yukio Kosugi,et al.  Neural network representation of finite element method , 1994, Neural Networks.

[9]  Alex Kendall,et al.  What Uncertainties Do We Need in Bayesian Deep Learning for Computer Vision? , 2017, NIPS.

[10]  Anders Logg,et al.  DOLFIN: Automated finite element computing , 2010, TOMS.

[11]  Dimitris G. Papageorgiou,et al.  Neural-network methods for boundary value problems with irregular boundaries , 2000, IEEE Trans. Neural Networks Learn. Syst..

[12]  Anders Logg,et al.  A compiler for variational forms , 2006, TOMS.

[13]  Charles Blundell,et al.  Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles , 2016, NIPS.

[14]  Ilias Bilionis,et al.  Deep UQ: Learning deep neural network surrogate models for high dimensional uncertainty quantification , 2018, J. Comput. Phys..

[15]  A. Logg Automating the Finite Element Method , 2007, 1112.0433.

[16]  Anders Logg,et al.  Benchmarking Domain-Specific Compiler Optimizations for Variational Forms , 2008, TOMS.

[17]  Anders Logg,et al.  FFC: the FEniCS Form Compiler , 2012 .

[18]  Zoubin Ghahramani,et al.  Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning , 2015, ICML.

[19]  Anders Logg,et al.  The FEniCS Project Version 1.5 , 2015 .

[20]  Kent-André Mardal,et al.  On the efficiency of symbolic computations combined with code generation for finite element methods , 2010, TOMS.

[21]  Max Welling,et al.  Auto-Encoding Variational Bayes , 2013, ICLR.

[22]  Niclas Jansson,et al.  Unicorn Parallel adaptive finite element simulation of turbulent flow and fluid-structure interaction for deforming domains and complex geometry , 2013 .

[23]  Nicholas Zabaras,et al.  Bayesian Deep Convolutional Encoder-Decoder Networks for Surrogate Modeling and Uncertainty Quantification , 2018, J. Comput. Phys..

[24]  Anders Logg,et al.  Automated Code Generation for Discontinuous Galerkin Methods , 2008, SIAM J. Sci. Comput..

[25]  Kun Hu,et al.  Optimal observations-based retrieval of topography in 2D shallow water equations using PC-EnKF , 2019, J. Comput. Phys..

[26]  NICLAS JANSSON,et al.  Framework for Massively Parallel Adaptive Finite Element Computational Fluid Dynamics on Tetrahedral Meshes , 2012, SIAM J. Sci. Comput..

[27]  Dimitrios I. Fotiadis,et al.  Artificial neural networks for solving ordinary and partial differential equations , 1997, IEEE Trans. Neural Networks.

[28]  D. Gilbarg,et al.  Elliptic Partial Differential Equa-tions of Second Order , 1977 .

[29]  Matthew G. Knepley,et al.  Optimizing the Evaluation of Finite Element Matrices , 2005, SIAM J. Sci. Comput..

[30]  Claes Johnson,et al.  Turbulent flow and fluid–structure interaction , 2012 .

[31]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[32]  Kent-Andre Mardal,et al.  SyFi and SFC: symbolic finite elements and form compilation , 2012 .

[33]  Andrew T. T. McRae,et al.  Automating the solution of PDEs on the sphere and other manifolds in FEniCS 1.2 , 2013 .

[34]  M. Girolami,et al.  Bayesian Solution Uncertainty Quantification for Differential Equations , 2013 .

[35]  Robert C. Kirby,et al.  Geometric Optimization of the Evaluation of Finite Element Matrices , 2007, SIAM J. Sci. Comput..

[36]  Anders Logg,et al.  Automated Solution of Differential Equations by the Finite Element Method: The FEniCS Book , 2012 .

[37]  Anders Logg,et al.  Efficient compilation of a class of variational forms , 2007, TOMS.

[38]  Kurt Hornik,et al.  Universal approximation of an unknown mapping and its derivatives using multilayer feedforward networks , 1990, Neural Networks.

[39]  E. W. Ng,et al.  A table of integrals of the error functions. , 1969 .

[40]  Yuepeng Wang,et al.  Calibration of reduced-order model for a coupled Burgers equations based on PC-EnKF , 2018 .

[41]  Hyuk Lee,et al.  Neural algorithm for solving differential equations , 1990 .

[42]  Robert C. Kirby,et al.  Algorithm 839: FIAT, a new paradigm for computing finite element basis functions , 2004, TOMS.

[43]  Anders Logg,et al.  Unified framework for finite element assembly , 2009, Int. J. Comput. Sci. Eng..

[44]  George Cybenko,et al.  Approximation by superpositions of a sigmoidal function , 1992, Math. Control. Signals Syst..

[45]  Sergey Ioffe,et al.  Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[46]  Anders Logg,et al.  Efficient Assembly of $H(\mathrmdiv) and H(\mathrmcurl)$ Conforming Finite Elements , 2009, SIAM J. Sci. Comput..

[47]  Anders Logg,et al.  Efficient representation of computational meshes , 2009, Int. J. Comput. Sci. Eng..

[48]  J. B. Copas,et al.  On the unimodality of the likelihood for the Cauchy distribution , 1975 .

[49]  Anders Logg,et al.  DOLFIN: a C++/Python Finite Element Library , 2012 .

[50]  N. Phan-Thien,et al.  Neural-network-based approximations for solving partial differential equations , 1994 .

[51]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[52]  Anders Logg,et al.  UFC: a Finite Element Code Generation Interface , 2012 .

[53]  Martin Sandve Alnæs,et al.  UFL: a finite element form language , 2012 .

[54]  E Weinan,et al.  Deep Learning-Based Numerical Methods for High-Dimensional Parabolic Partial Differential Equations and Backward Stochastic Differential Equations , 2017, Communications in Mathematics and Statistics.

[55]  Justin A. Sirignano,et al.  DGM: A deep learning algorithm for solving partial differential equations , 2017, J. Comput. Phys..

[56]  Andy R. Terrel,et al.  Topological Optimization of the Evaluation of Finite Element Matrices , 2006, SIAM J. Sci. Comput..

[57]  Alaeddin Malek,et al.  Numerical solution for high order differential equations using a hybrid neural network - Optimization method , 2006, Appl. Math. Comput..

[58]  Paris Perdikaris,et al.  Physics Informed Deep Learning (Part II): Data-driven Discovery of Nonlinear Partial Differential Equations , 2017, ArXiv.