Distributed deep reinforcement learning for simulation control

Several applications in the scientific simulation of physical systems can be formulated as control/optimization problems. The computational models for such systems generally contain hyperparameters, which control solution fidelity and computational expense. The tuning of these parameters is non-trivial and the general approach is to manually `spot-check' for good combinations. This is because optimal hyperparameter configuration search becomes impractical when the parameter space is large and when they may vary dynamically. To address this issue, we present a framework based on deep reinforcement learning (RL) to train a deep neural network agent that controls a model solve by varying parameters dynamically. First, we validate our RL framework for the problem of controlling chaos in chaotic systems by dynamically changing the parameters of the system. Subsequently, we illustrate the capabilities of our framework for accelerating the convergence of a steady-state CFD solver by automatically adjusting the relaxation factors of discretized Navier-Stokes equations during run-time. The results indicate that the run-time control of the relaxation factors by the learned policy leads to a significant reduction in the number of iterations for convergence compared to the random selection of the relaxation factors. Our results point to potential benefits for learning adaptive hyperparameter learning strategies across different geometries and boundary conditions with implications for reduced computational campaign expenses.

[1]  Lucas Lamata Quantum machine learning and quantum biomimetics: A perspective , 2020, ArXiv.

[2]  Changhoon Lee,et al.  Prediction of turbulent heat transfer using convolutional neural networks , 2019, Journal of Fluid Mechanics.

[3]  Maxime Bassenne,et al.  Computational model discovery with reinforcement learning , 2019, ArXiv.

[4]  Jinlong Wu,et al.  Physics-informed machine learning approach for reconstructing Reynolds stress modeling discrepancies based on DNS data , 2016, 1606.07987.

[5]  Hui Tang,et al.  Applying deep reinforcement learning to active flow control in weakly turbulent conditions , 2020, Physics of Fluids.

[6]  Deborah A. Kaminski,et al.  A FUZZY LOGIC ALGORITHM FOR ACCELERATION OF CONVERGENCE IN SOLVING TURBULENT FLOW AND HEAT TRANSFER PROBLEMS , 2004 .

[7]  Karthik Duraisamy,et al.  Turbulence Modeling in the Age of Data , 2018, Annual Review of Fluid Mechanics.

[8]  Jia Wu,et al.  Efficient hyperparameter optimization through model-based reinforcement learning , 2020, Neurocomputing.

[9]  Guido Novati,et al.  Automating Turbulence Modeling by Multi-Agent Reinforcement Learning , 2020, ArXiv.

[10]  J. Templeton,et al.  Reynolds averaged turbulence modelling using deep neural networks with embedded invariance , 2016, Journal of Fluid Mechanics.

[11]  James A. Yorke,et al.  Preturbulence: A regime observed in a fluid flow model of Lorenz , 1979 .

[12]  Yousef Saad,et al.  Iterative methods for sparse linear systems , 2003 .

[13]  Ion Stoica,et al.  A View on Deep Reinforcement Learning in System Optimization , 2019 .

[14]  Petros Koumoutsakos,et al.  Efficient collective swimming by harnessing vortices through deep reinforcement learning , 2018, Proceedings of the National Academy of Sciences.

[15]  Siddhartha Verma,et al.  Restoring Chaos Using Deep Reinforcement Learning , 2020, Chaos.

[16]  Lakshminarayanan Mahadevan,et al.  Controlled gliding and perching through deep-reinforcement-learning , 2019, Physical Review Fluids.

[17]  Liu Yang,et al.  Reinforcement learning for bluff body active flow control in experiments and simulations , 2020, Proceedings of the National Academy of Sciences.

[18]  Michele Milano,et al.  Neural network modeling for near wall turbulent flow , 2002 .

[19]  R. B. Gopaluni,et al.  Deep reinforcement learning approaches for process control , 2017, 2017 6th International Symposium on Advanced Control of Industrial Processes (AdCONIP).

[20]  Jin Keun Seo,et al.  Framelet pooling aided deep learning network: the method to process high dimensional medical data , 2019, Mach. Learn. Sci. Technol..

[21]  Paris Perdikaris,et al.  Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations , 2019, J. Comput. Phys..

[22]  Sergio Gomez Colmenarejo,et al.  Acme: A Research Framework for Distributed Reinforcement Learning , 2020, ArXiv.

[23]  Karthik Kashinath,et al.  MESHFREEFLOWNET: A Physics-Constrained Deep Continuous Space-Time Super-Resolution Framework , 2020, SC20: International Conference for High Performance Computing, Networking, Storage and Analysis.

[24]  Jan Peters,et al.  Reinforcement learning in robotics: A survey , 2013, Int. J. Robotics Res..

[25]  Gautam Reddy,et al.  Learning to soar in turbulent environments , 2016, Proceedings of the National Academy of Sciences.

[26]  K. Taira,et al.  Super-resolution reconstruction of turbulent flows with machine learning , 2018, Journal of Fluid Mechanics.

[27]  Deborah A. Kaminski,et al.  Tuning of a fuzzy rule set for controlling convergence of a CFD solver in turbulent flow , 2001 .

[28]  Sergey Levine,et al.  Trust Region Policy Optimization , 2015, ICML.

[29]  Bin Dong,et al.  PDE-Net 2.0: Learning PDEs from Data with A Numeric-Symbolic Hybrid Deep Network , 2018, J. Comput. Phys..

[30]  P. Meliga,et al.  Sensitivity of aerodynamic forces in laminar and turbulent flow past a square cylinder , 2014 .

[31]  Sergey Levine,et al.  High-Dimensional Continuous Control Using Generalized Advantage Estimation , 2015, ICLR.

[32]  Gang Wang,et al.  Reinforcement Learning for Learning Rate Control , 2017, ArXiv.

[33]  H. L. Seegmiller,et al.  Features of a reattaching turbulent shear layer in divergent channel flow , 1985 .

[34]  Jean Rabault,et al.  Artificial neural networks trained through deep reinforcement learning discover control strategies for active flow control , 2018, Journal of Fluid Mechanics.

[35]  Prasanna Balaprakash,et al.  Reduced-order modeling of advection-dominated systems with recurrent neural networks and convolutional autoencoders , 2020, Physics of Fluids.

[36]  M. P. Brenner,et al.  Perspective on machine learning for advancing fluid mechanics , 2019, Physical Review Fluids.

[37]  Piotr Stanczyk,et al.  SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference , 2020, ICLR.

[38]  Yishay Mansour,et al.  Policy Gradient Methods for Reinforcement Learning with Function Approximation , 1999, NIPS.

[39]  Christopher L. Rumssey Recent Developments on the Turbulence Modeling Resource Website (Invited) , 2015 .

[40]  F. Menter,et al.  Ten Years of Industrial Experience with the SST Turbulence Model , 2003 .

[41]  J. Yorke,et al.  Crises, sudden changes in chaotic attractors, and transient chaos , 1983 .

[42]  Hossein Azizpour,et al.  Predictions of turbulent shear flows using deep neural networks , 2019, Physical Review Fluids.

[43]  Matthias Rupp,et al.  Big Data Meets Quantum Chemistry Approximations: The Δ-Machine Learning Approach. , 2015, Journal of chemical theory and computation.

[44]  Tristan Bereau,et al.  Interpretable embeddings from molecular simulations using Gaussian mixture variational autoencoders , 2019, Mach. Learn. Sci. Technol..

[45]  R. Goodman,et al.  Application of neural networks to turbulence control for drag reduction , 1997 .

[46]  E. Lorenz Deterministic nonperiodic flow , 1963 .

[47]  John N. Tsitsiklis,et al.  Actor-Critic Algorithms , 1999, NIPS.

[48]  Petros Koumoutsakos,et al.  Machine Learning for Fluid Mechanics , 2019, Annual Review of Fluid Mechanics.

[49]  S. Girimaji,et al.  Turbulence closure modeling with data-driven techniques: physical compatibility and consistency considerations , 2020, 2004.03031.

[50]  Babak Hejazialhosseini,et al.  Reinforcement Learning and Wavelet Adapted Vortex Methods for Simulations of Self-propelled Swimmers , 2014, SIAM J. Sci. Comput..

[51]  Prakash Vedula,et al.  Subgrid modelling for two-dimensional turbulence using neural networks , 2018, Journal of Fluid Mechanics.

[52]  Alec Radford,et al.  Proximal Policy Optimization Algorithms , 2017, ArXiv.

[53]  A. Mohan,et al.  Compressed Convolutional LSTM: An Efficient Deep Learning framework to Model High Fidelity 3D Turbulence , 2019, 1903.00033.

[54]  Antonio Celani,et al.  Flow Navigation by Smart Microswimmers via Reinforcement Learning , 2017, Physical review letters.

[55]  Terrence J. Sejnowski,et al.  Glider soaring via reinforcement learning in the field , 2018, Nature.

[56]  S. Manzhos,et al.  Machine learning for the solution of the Schrödinger equation , 2020, Mach. Learn. Sci. Technol..

[57]  William Gropp,et al.  High-performance parallel implicit CFD , 2001, Parallel Comput..

[58]  H. Ghraieb,et al.  Optimization and passive flow control using single-step deep reinforcement learning , 2020, ArXiv.

[59]  Katja Bachmeier,et al.  Numerical Heat Transfer And Fluid Flow , 2016 .

[60]  Deborah A. Kaminski,et al.  Control of convergence in a computational fluid dynamics simulation using ANFIS , 2005, IEEE Transactions on Fuzzy Systems.

[61]  Ian Dobson,et al.  Towards a theory of voltage collapse in electric power systems , 1989 .

[62]  Kai Fukami,et al.  Nonlinear mode decomposition with convolutional neural networks for fluid dynamics , 2019, Journal of Fluid Mechanics.

[63]  Qiqi Wang,et al.  The drag-adjoint field of a circular cylinder wake at Reynolds numbers 20, 100 and 500 , 2012, Journal of Fluid Mechanics.

[64]  Stefan Schaal,et al.  Reinforcement Learning for Humanoid Robotics , 2003 .

[65]  Aleksandar Jemcov,et al.  OpenFOAM: A C++ Library for Complex Physics Simulations , 2007 .

[66]  Daniel W. Davies,et al.  Machine learning for molecular and materials science , 2018, Nature.

[67]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[68]  Brian L. DeCost,et al.  Scientific AI in materials science: a path to a sustainable and scalable paradigm , 2020, Mach. Learn. Sci. Technol..

[69]  Alex Graves,et al.  Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.

[70]  M. Yousuff Hussaini,et al.  A statistical learning strategy for closed-loop control of fluid flows , 2016, 1604.03392.

[71]  Michael I. Jordan,et al.  RLlib: Abstractions for Distributed Reinforcement Learning , 2017, ICML.

[72]  E. Solano,et al.  Reinforcement learning for semi-autonomous approximate quantum eigensolver , 2019, Mach. Learn. Sci. Technol..

[73]  Demis Hassabis,et al.  Mastering the game of Go with deep neural networks and tree search , 2016, Nature.

[74]  Alexander Kuhnle,et al.  A review on Deep Reinforcement Learning for Fluid Mechanics , 2019, Computers & Fluids.

[75]  Dinggang Shen,et al.  Machine Learning in Medical Imaging , 2012, Lecture Notes in Computer Science.

[76]  J. Periaux,et al.  Turbulent separated shear flow control by surface plasma actuator: experimental optimization by genetic algorithm approach , 2016 .

[77]  Daniel Nikovski,et al.  Deep reinforcement learning for partial differential equation control , 2017, 2017 American Control Conference (ACC).

[78]  J. Ottino Mixing, chaotic advection, and turbulence , 1990 .

[79]  Horst D. Simon,et al.  Parallel computational fluid dynamics: implementations and results , 1992 .

[80]  Ott,et al.  Preserving chaos: Control strategies to preserve complex dynamics with potential relevance to biological disorders. , 1995, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.