AdjointNet: Constraining machine learning models with physics-based codes

Physics-informed Machine Learning has recently become attractive for learning physical parameters and features from simulation and observation data. However, most existing methods do not ensure that the physics, such as balance laws (e.g., mass, momentum, energy conservation), are constrained. Some recent works (e.g., physics-informed neural networks) softly enforce physics constraints by including partial differential equation (PDE)-based loss functions but need re-discretization of the PDEs using auto-differentiation. Training these neural nets on observational data showed that one could solve forward and inverse problems in one shot. They evaluate the state variables and the parameters in a PDE. This re-discretization of PDEs is not necessarily an attractive option for domain scientists that work with physics-based codes that have been developed for decades with sophisticated discretization techniques to solve complex process models and advanced equations of state. This paper proposes a physics constrained machine learning framework, AdjointNet, allowing domain scientists to embed their physics code in neural network training workflows. This embedding ensures that physics is constrained everywhere in the domain. Additionally, the mathematical properties such as consistency, stability, and convergence vital to the numerical solution of a PDE are still satisfied. We show that the proposed AdjointNet framework can be used for parameter estimation (and uncertainty quantification by extension) and experimental design using active learning. The applicability of our framework is demonstrated for four cases – (1) flow in a homogeneous porous medium, (2) data assimilation for homogeneous porous media flow, (3) flow in a heterogeneous porous medium, and (4) cavity flow using the Navier-Stokes equation. Results show that AdjointNet-based inversion can estimate process model parameters with reasonable accuracy. These examples demonstrate the applicability of using existing software with no changes in source code to perform accurate and reliable inversion of model parameters.

[1]  Alan Edelman,et al.  A Differentiable Programming System to Bridge Machine Learning and Scientific Computing , 2019, ArXiv.

[2]  Satish Karra,et al.  PFLOTRAN User Manual A Massively Parallel Reactive Flow and Transport Model for Describing Surface and Subsurface Processes , 2015 .

[3]  Uwe Naumann,et al.  MPI-Parallel Discrete Adjoint OpenFOAM , 2015, ICCS.

[4]  Zheng Ma,et al.  A type of generalization error induced by initialization in deep neural networks , 2019, MSML.

[5]  Thomas J. R. Hughes,et al.  The Continuous Galerkin Method Is Locally Conservative , 2000 .

[6]  Andrzej Cichocki,et al.  Nonnegative Matrix and Tensor Factorization T , 2007 .

[7]  Siddharth Krishna Kumar,et al.  On weight initialization in deep neural networks , 2017, ArXiv.

[8]  O. C. Zienkiewicz,et al.  The Finite Element Method: Its Basis and Fundamentals , 2005 .

[9]  Ankush Khandelwal,et al.  Physics Guided Machine Learning Methods for Hydrology , 2020, ArXiv.

[10]  Paris Perdikaris,et al.  Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations , 2019, J. Comput. Phys..

[11]  Rajesh J. Pawar,et al.  On the feasibility of using physics-informed machine learning for underground reservoir pressure management , 2021, Expert Syst. Appl..

[12]  Amina Adadi,et al.  Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI) , 2018, IEEE Access.

[13]  Paul Messina,et al.  The Exascale Computing Project , 2017, Comput. Sci. Eng..

[14]  Bailian Chen,et al.  A Physics-Constrained Deep Learning Model for Simulating Multiphase Flow in 3D Heterogeneous Porous Media , 2021, Fuel.

[15]  H. Tchelepi,et al.  LIMITATIONS OF PHYSICS INFORMED MACHINE LEARNING FOR NONLINEAR TWO-PHASE TRANSPORT IN POROUS MEDIA , 2020 .

[16]  Aleksandar Jemcov,et al.  OpenFOAM: A C++ Library for Complex Physics Simulations , 2007 .

[17]  Nagiza F. Samatova,et al.  Theory-Guided Data Science: A New Paradigm for Scientific Discovery from Data , 2016, IEEE Transactions on Knowledge and Data Engineering.

[18]  Yuan Yu,et al.  TensorFlow: A system for large-scale machine learning , 2016, OSDI.

[19]  Vijay P. Singh,et al.  The Precipitation-Runoff Modeling System - PRMS. , 1995 .

[20]  M. Rumpfkeil,et al.  The optimal control of unsteady flows with a discrete adjoint method , 2010 .

[21]  G E Hammond,et al.  Evaluating the performance of parallel subsurface simulators: An illustrative example with PFLOTRAN , 2014, Water resources research.

[22]  Luca Antiga,et al.  Automatic differentiation in PyTorch , 2017 .

[23]  Philip W. Jones,et al.  The DOE E3SM Coupled Model Version 1: Overview and Evaluation at Standard Resolution , 2019, Journal of Advances in Modeling Earth Systems.

[24]  X. Jia,et al.  Predicting lake surface water phosphorus dynamics using process-guided machine learning , 2020, Ecological Modelling.

[25]  Antony Jameson,et al.  Optimum Shape Design for Unsteady Flows with Time-Accurate Continuous and Discrete Adjoint Methods , 2007 .

[26]  Paris Perdikaris,et al.  Physics‐Informed Deep Neural Networks for Learning Parameters and Constitutive Relationships in Subsurface Flow Problems , 2020, Water Resources Research.

[27]  Paris Perdikaris,et al.  Understanding and mitigating gradient pathologies in physics-informed neural networks , 2020, ArXiv.

[28]  Javier E. Santos,et al.  Modeling nanoconfinement effects using active learning , 2020, The Journal of Physical Chemistry C.

[29]  Kurama Okubo,et al.  HOSS: an implementation of the combined finite-discrete element method , 2020, Computational Particle Mechanics.

[30]  J. Zico Kolter,et al.  Generalization in Deep Networks: The Role of Distance from Initialization , 2019, ArXiv.

[31]  Satish Karra,et al.  Physics-Informed Machine Learning for Real-time Reservoir Management , 2020, AAAI Spring Symposium: MLPS.

[32]  Long Chen FINITE VOLUME METHODS , 2011 .

[33]  Zhiping Mao,et al.  DeepXDE: A Deep Learning Library for Solving Differential Equations , 2019, AAAI Spring Symposium: MLPS.

[34]  Natalia Gimelshein,et al.  PyTorch: An Imperative Style, High-Performance Deep Learning Library , 2019, NeurIPS.

[35]  Raghavan Srinivasan,et al.  SWAT: Model Use, Calibration, and Validation , 2012 .

[36]  Wojciech Samek,et al.  Explainable AI: Interpreting, Explaining and Visualizing Deep Learning , 2019, Explainable AI.

[37]  Satish Karra,et al.  Unsupervised Machine Learning Based on Non-Negative Tensor Factorization for Analyzing Reactive-Mixing , 2018, J. Comput. Phys..

[38]  Prabhat,et al.  Physics-informed machine learning: case studies for weather and climate modelling , 2021, Philosophical Transactions of the Royal Society A.

[39]  Alexander Binder,et al.  Explaining nonlinear classification decisions with deep Taylor decomposition , 2015, Pattern Recognit..

[40]  Mukund Sundararajan,et al.  The many Shapley values for model explanation , 2019, ICML.

[41]  Carlos Guestrin,et al.  Model-Agnostic Interpretability of Machine Learning , 2016, ArXiv.

[42]  G. Karniadakis,et al.  Conservative physics-informed neural networks on discrete domains for conservation laws: Applications to forward and inverse problems , 2020 .

[43]  Filip Karlo Dosilovic,et al.  Explainable artificial intelligence: A survey , 2018, 2018 41st International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO).

[44]  Eric Darve,et al.  Inverse Modeling of Viscoelasticity Materials using Physics Constrained Learning , 2020, ArXiv.

[45]  Barry F. Smith,et al.  PETSc TSAdjoint: a discrete adjoint ODE solver for first-order and second-order sensitivity analysis , 2019, SIAM J. Sci. Comput..

[46]  Klaus-Robert Müller,et al.  Layer-Wise Relevance Propagation: An Overview , 2019, Explainable AI.

[47]  M. Giles,et al.  Algorithm Developments for Discrete Adjoint Methods , 2003 .

[48]  Eric Darve,et al.  Physics Constrained Learning for Data-driven Inverse Modeling from Sparse Observations , 2020, J. Comput. Phys..