General solutions for nonlinear differential equations: a rule-based self-learning approach using deep reinforcement learning

Physicists use differential equations to describe the physical dynamical world, and the solutions of these equations constitute our understanding of the world. During the hundreds of years, scientists developed several ways to solve these equations, i.e., the analytical solutions and the numerical solutions. However, for some complex equations, there may be no analytical solutions, and the numerical solutions may encounter the curse of the extreme computational cost if the accuracy is the first consideration. Solving equations is a high-level human intelligence work and a crucial step towards general artificial intelligence (AI), where deep reinforcement learning (DRL) may contribute. This work makes the first attempt of applying (DRL) to solve nonlinear differential equations both in discretized and continuous format with the governing equations (physical laws) embedded in the DRL network, including ordinary differential equations (ODEs) and partial differential equations (PDEs). The DRL network consists of an actor that outputs solution approximations policy and a critic that outputs the critic of the actor's output solution. Deterministic policy network is employed as the actor, and governing equations are embedded in the critic. The effectiveness of the DRL solver in Schr\"odinger equation, Navier-Stocks, Van der Pol equation, Burgers' equation and the equation of motion are discussed.

[1]  J. Cole On a quasi-linear parabolic equation occurring in aerodynamics , 1951 .

[2]  Demis Hassabis,et al.  Mastering the game of Go with deep neural networks and tree search , 2016, Nature.

[3]  M. Feit,et al.  Solution of the Schrödinger equation by a spectral method , 1982 .

[4]  E Weinan,et al.  Overcoming the curse of dimensionality: Solving high-dimensional partial differential equations using deep learning , 2017, ArXiv.

[5]  Schrödinger An Undulatory Theory of the Mechanics of Atoms and Molecules , 1926 .

[6]  Kyle Mills,et al.  Deep learning and the Schrödinger equation , 2017, ArXiv.

[7]  A. A. Soliman,et al.  The modified extended tanh-function method for solving Burgers-type equations , 2006 .

[8]  Alex Graves,et al.  Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.

[9]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[10]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[11]  Anil K. Chopra,et al.  Dynamics of Structures: Theory and Applications to Earthquake Engineering , 1995 .

[12]  Hui Li,et al.  Data-driven modeling of vortex-induced vibration of a long-span suspension bridge using decision tree learning and support vector regression , 2018 .

[13]  E Weinan,et al.  Deep Learning-Based Numerical Methods for High-Dimensional Parabolic Partial Differential Equations and Backward Stochastic Differential Equations , 2017, Communications in Mathematics and Statistics.

[14]  Lexing Ying,et al.  Solving parametric PDE problems with artificial neural networks , 2017, European Journal of Applied Mathematics.

[15]  Guy Lever,et al.  Deterministic Policy Gradient Algorithms , 2014, ICML.

[16]  Paris Perdikaris,et al.  Physics Informed Deep Learning (Part I): Data-driven Solutions of Nonlinear Partial Differential Equations , 2017, ArXiv.

[17]  P. Holmes,et al.  Nonlinear Oscillations, Dynamical Systems, and Bifurcations of Vector Fields , 1983, Applied Mathematical Sciences.

[18]  Marios Tsatsos,et al.  The Van der Pol Equation , 2008, 0803.1658.

[19]  S. Cox,et al.  Exponential Time Differencing for Stiff Systems , 2002 .

[20]  Richard S. Sutton,et al.  Introduction to Reinforcement Learning , 1998 .

[21]  Hui Li,et al.  Prediction model of velocity field around circular cylinder over various Reynolds numbers by fusion convolutional neural networks based on pressure on the cylinder , 2018 .

[22]  E. Hopf The partial differential equation ut + uux = μxx , 1950 .