Deep Learning Approaches to Surrogates for Solving the Diffusion Equation for Mechanistic Real-World Simulations

In many mechanistic medical, biological, physical, and engineered spatiotemporal dynamic models the numerical solution of partial differential equations (PDEs), especially for diffusion, fluid flow and mechanical relaxation, can make simulations impractically slow. Biological models of tissues and organs often require the simultaneous calculation of the spatial variation of concentration of dozens of diffusing chemical species. One clinical example where rapid calculation of a diffusing field is of use is the estimation of oxygen gradients in the retina, based on imaging of the retinal vasculature, to guide surgical interventions in diabetic retinopathy. Furthermore, the ability to predict blood perfusion and oxygenation may one day guide clinical interventions in diverse settings, i.e., from stent placement in treating heart disease to BOLD fMRI interpretation in evaluating cognitive function (Xie et al., 2019; Lee et al., 2020). Since the quasi-steady-state solutions required for fast-diffusing chemical species like oxygen are particularly computationally costly, we consider the use of a neural network to provide an approximate solution to the steady-state diffusion equation. Machine learning surrogates, neural networks trained to provide approximate solutions to such complicated numerical problems, can often provide speed-ups of several orders of magnitude compared to direct calculation. Surrogates of PDEs could enable use of larger and more detailed models than are possible with direct calculation and can make including such simulations in real-time or near-real time workflows practical. Creating a surrogate requires running the direct calculation tens of thousands of times to generate training data and then training the neural network, both of which are computationally expensive. Often the practical applications of such models require thousands to millions of replica simulations, for example for parameter identification and uncertainty quantification, each of which gains speed from surrogate use and rapidly recovers the up-front costs of surrogate generation. We use a Convolutional Neural Network to approximate the stationary solution to the diffusion equation in the case of two equal-diameter, circular, constant-value sources located at random positions in a two-dimensional square domain with absorbing boundary conditions. Such a configuration caricatures the chemical concentration field of a fast-diffusing species like oxygen in a tissue with two parallel blood vessels in a cross section perpendicular to the two blood vessels. To improve convergence during training, we apply a training approach that uses roll-back to reject stochastic changes to the network that increase the loss function. The trained neural network approximation is about 1000 times faster than the direct calculation for individual replicas. Because different applications will have different criteria for acceptable approximation accuracy, we discuss a variety of loss functions and accuracy estimators that can help select the best network for a particular application. We briefly discuss some of the issues we encountered with overfitting, mismapping of the field values and the geometrical conditions that lead to large absolute and relative errors in the approximate solution.

[1]  Jia-Shu Zhang,et al.  Predicting Chaotic Time Series Using Recurrent Neural Network , 2000 .

[2]  Mohsen Guizani,et al.  Deep Feature Learning for Medical Image Analysis with Convolutional Autoencoder Neural Network , 2017, IEEE Transactions on Big Data.

[3]  Jay Pathak,et al.  An unsupervised learning approach to solving heat equations on chip based on Auto Encoder and Image Gradient , 2020, ArXiv.

[4]  J Sundnes,et al.  Improved discretisation and linearisation of active tension in strongly coupled cardiac electro-mechanics simulations , 2014, Computer methods in biomechanics and biomedical engineering.

[5]  G. Karniadakis,et al.  Physics-Informed Neural Networks for Heat Transfer Problems , 2021, Journal of Heat Transfer.

[6]  E. Viezzer,et al.  Up to two billion times acceleration of scientific simulations with deep neural architecture search , 2020, ArXiv.

[7]  R. Phillips Membranes by the Numbers , 2017, 1703.02066.

[8]  Nassir Navab,et al.  Autoencoders for Unsupervised Anomaly Segmentation in Brain MR Images: A Comparative Study , 2020, Medical Image Anal..

[9]  M. Ghalambaz,et al.  Using deep learning to learn physics of conduction heat transfer , 2020, Journal of Thermal Analysis and Calorimetry.

[10]  Diederik P. Kingma,et al.  An Introduction to Variational Autoencoders , 2019, Found. Trends Mach. Learn..

[11]  Mike Innes,et al.  Flux: Elegant machine learning with Julia , 2018, J. Open Source Softw..

[12]  Paris Perdikaris,et al.  Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations , 2019, J. Comput. Phys..

[13]  Fabio Pietrucci,et al.  Machine learning force fields and coarse-grained variables in molecular dynamics: application to materials and biological systems. , 2020, Journal of chemical theory and computation.

[14]  Daniel A. Roberts Why is AI hard and Physics simple? , 2021, ArXiv.

[15]  Qing Nie,et al.  DifferentialEquations.jl – A Performant and Feature-Rich Ecosystem for Solving Differential Equations in Julia , 2017, Journal of Open Research Software.

[16]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[17]  Laurent Planckaert,et al.  Data-driven predictions of the Lorenz system , 2020, Physica D: Nonlinear Phenomena.

[18]  Geoffrey C. Fox,et al.  Learning Everywhere: Pervasive Machine Learning for Effective High-Performance Computation , 2019, 2019 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW).

[19]  Frank Noé,et al.  Machine learning for molecular simulation , 2019, Annual review of physical chemistry.

[20]  Franco Scarselli,et al.  On the Complexity of Neural Network Classifiers: A Comparison Between Shallow and Deep Architectures , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[21]  Hao Wu,et al.  Boltzmann generators: Sampling equilibrium states of many-body systems with deep learning , 2018, Science.

[22]  T. Buonassisi,et al.  Machine Learning for Novel Thermal-Materials Discovery: Early Successes, Opportunities, and Challenges. , 2019, 1901.05801.

[23]  Amir Barati Farimani,et al.  Weakly-Supervised Deep Learning of Heat Transport via Physics Informed Loss , 2018, ArXiv.

[24]  Geoffrey C. Fox,et al.  Learning Everywhere: A Taxonomy for the Integration of Machine Learning and Simulations , 2019, 2019 15th International Conference on eScience (eScience).

[25]  V. S. Vladimirov,et al.  Equations of mathematical physics , 1972 .

[26]  W. Schiesser The Numerical Method of Lines: Integration of Partial Differential Equations , 1991 .

[27]  Amir Barati Farimani,et al.  Deep Learning the Physics of Transport Phenomena , 2017, ArXiv.

[28]  Walter Herzog,et al.  Model-based estimation of muscle forces exerted during movements. , 2007, Clinical biomechanics.

[29]  Roy C. P. Kerckhoffs,et al.  Coupling of a 3D Finite Element Model of Cardiac Ventricular Mechanics to Lumped Systems Models of the Systemic and Pulmonic Circulation , 2006, Annals of Biomedical Engineering.

[30]  Kamyar Azizzadenesheli,et al.  Fourier Neural Operator for Parametric Partial Differential Equations , 2021, ICLR.

[31]  Michael I. Jordan,et al.  Advances in Neural Information Processing Systems 30 , 1995 .

[32]  Y. Zhang,et al.  Reaction diffusion system prediction based on convolutional neural network , 2020, Scientific Reports.

[33]  Timothy W Secomb,et al.  A Green's function method for simulation of time-dependent solute transport and reaction in realistic microvascular geometries. , 2016, Mathematical medicine and biology : a journal of the IMA.

[34]  Michael Innes,et al.  Fashionable Modelling with Flux , 2018, ArXiv.

[35]  M. Jeong,et al.  Deep Learning in Personalization of Cardiovascular Stents , 2020, Journal of cardiovascular pharmacology and therapeutics.

[36]  Steven L. Brunton,et al.  Data-driven discovery of coordinates and governing equations , 2019, Proceedings of the National Academy of Sciences.

[37]  Wei Chen,et al.  Molecular enhanced sampling with autoencoders: On‐the‐fly collective variable discovery and accelerated free energy landscape exploration , 2017, J. Comput. Chem..

[38]  Vaibhav Dixit,et al.  DiffEqFlux.jl - A Julia Library for Neural Differential Equations , 2019, ArXiv.

[39]  Hao F. Zhang,et al.  Retinal oxygen: from animals to humans , 2017, Progress in Retinal and Eye Research.

[40]  Yiran Li,et al.  BOLD fMRI-Based Brain Perfusion Prediction Using Deep Dilated Wide Activation Networks , 2019, MLMI@MICCAI.

[41]  Jin Y. Jin,et al.  Progress and Opportunities to Advance Clinical Cancer Therapeutics Using Tumor Dynamic Models , 2019, Clinical Cancer Research.