Distributed deep reinforcement learning for simulation control

Several applications in the scientific simulation of physical systems can be formulated as control/optimization problems. The computational models for such systems generally contain hyperparameters, which control solution fidelity and computational expense. The tuning of these parameters is non-trivial and the general approach is to manually ‘spot-check’ for good combinations. This is because optimal hyperparameter configuration search becomes intractable when the parameter space is large and when they may vary dynamically. To address this issue, we present a framework based on deep reinforcement learning (RL) to train a deep neural network agent that controls a model solve by varying parameters dynamically. First, we validate our RL framework for the problem of controlling chaos in chaotic systems by dynamically changing the parameters of the system. Subsequently, we illustrate the capabilities of our framework for accelerating the convergence of a steady-state computational fluid dynamics solver by automatically adjusting the relaxation factors of the discretized Navier–Stokes equations during run-time. The results indicate that the run-time control of the relaxation factors by the learned policy leads to a significant reduction in the number of iterations for convergence compared to the random selection of the relaxation factors. Our results point to potential benefits for learning adaptive hyperparameter learning strategies across different geometries and boundary conditions with implications for reduced computational campaign expenses 4 4 Data and codes available at https://github.com/Romit-Maulik/PAR-RL.. Data and codes available at https://github.com/Romit-Maulik/PAR-RL.

[1]  Gautam Reddy,et al.  Learning to soar in turbulent environments , 2016, Proceedings of the National Academy of Sciences.

[2]  Jan Peters,et al.  Reinforcement learning in robotics: A survey , 2013, Int. J. Robotics Res..

[3]  Petros Koumoutsakos,et al.  Machine Learning for Fluid Mechanics , 2019, Annual Review of Fluid Mechanics.

[4]  Babak Hejazialhosseini,et al.  Reinforcement Learning and Wavelet Adapted Vortex Methods for Simulations of Self-propelled Swimmers , 2014, SIAM J. Sci. Comput..

[5]  Lucas Lamata Quantum machine learning and quantum biomimetics: A perspective , 2020, ArXiv.

[6]  H. Ghraieb,et al.  Optimization and passive flow control using single-step deep reinforcement learning , 2020, ArXiv.

[7]  Terrence J. Sejnowski,et al.  Glider soaring via reinforcement learning in the field , 2018, Nature.

[8]  A. Mohan,et al.  Compressed Convolutional LSTM: An Efficient Deep Learning framework to Model High Fidelity 3D Turbulence , 2019, 1903.00033.

[9]  Bin Dong,et al.  PDE-Net 2.0: Learning PDEs from Data with A Numeric-Symbolic Hybrid Deep Network , 2018, J. Comput. Phys..

[10]  R. Maulik,et al.  Reduced-order modeling of advection-dominated systems with recurrent neural networks and convolutional autoencoders , 2020, 2002.00470.

[11]  Dinggang Shen,et al.  Machine Learning in Medical Imaging , 2012, Lecture Notes in Computer Science.

[12]  Alexander Kuhnle,et al.  Accelerating deep reinforcement learning strategies of flow control through a multi-environment approach , 2019, Physics of Fluids.

[13]  Prakash Vedula,et al.  Subgrid modelling for two-dimensional turbulence using neural networks , 2018, Journal of Fluid Mechanics.

[14]  Piotr Stanczyk,et al.  SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference , 2020, ICLR.

[15]  Horst D. Simon,et al.  Parallel computational fluid dynamics: implementations and results , 1992 .

[16]  Christopher L. Rumssey Recent Developments on the Turbulence Modeling Resource Website (Invited) , 2015 .

[17]  Demis Hassabis,et al.  Mastering the game of Go with deep neural networks and tree search , 2016, Nature.

[18]  S. Patankar Numerical Heat Transfer and Fluid Flow , 2018, Lecture Notes in Mechanical Engineering.

[19]  P. Meliga,et al.  Sensitivity of aerodynamic forces in laminar and turbulent flow past a square cylinder , 2014 .

[20]  Deborah A. Kaminski,et al.  Tuning of a fuzzy rule set for controlling convergence of a CFD solver in turbulent flow , 2001 .

[21]  Yousef Saad,et al.  Iterative methods for sparse linear systems , 2003 .

[22]  Paris Perdikaris,et al.  Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations , 2019, J. Comput. Phys..

[23]  Hui Tang,et al.  Applying deep reinforcement learning to active flow control in turbulent conditions , 2020, 2006.10683.

[24]  R. B. Gopaluni,et al.  Deep reinforcement learning approaches for process control , 2017, 2017 6th International Symposium on Advanced Control of Industrial Processes (AdCONIP).

[25]  Jean Rabault,et al.  Artificial neural networks trained through deep reinforcement learning discover control strategies for active flow control , 2018, Journal of Fluid Mechanics.

[26]  Maxime Bassenne,et al.  Computational model discovery with reinforcement learning , 2019, ArXiv.

[27]  Sergey Levine,et al.  High-Dimensional Continuous Control Using Generalized Advantage Estimation , 2015, ICLR.

[28]  Yishay Mansour,et al.  Policy Gradient Methods for Reinforcement Learning with Function Approximation , 1999, NIPS.

[29]  Lakshminarayanan Mahadevan,et al.  Controlled gliding and perching through deep-reinforcement-learning , 2019, Physical Review Fluids.

[30]  Jinlong Wu,et al.  Physics-informed machine learning approach for reconstructing Reynolds stress modeling discrepancies based on DNS data , 2016, 1606.07987.

[31]  J. Ottino Mixing, chaotic advection, and turbulence , 1990 .

[32]  Guido Novati,et al.  Automating Turbulence Modeling by Multi-Agent Reinforcement Learning , 2020, ArXiv.

[33]  Matthias Rupp,et al.  Big Data Meets Quantum Chemistry Approximations: The Δ-Machine Learning Approach. , 2015, Journal of chemical theory and computation.

[34]  K. Taira,et al.  Super-resolution reconstruction of turbulent flows with machine learning , 2018, Journal of Fluid Mechanics.

[35]  Karthik Kashinath,et al.  MESHFREEFLOWNET: A Physics-Constrained Deep Continuous Space-Time Super-Resolution Framework , 2020, SC20: International Conference for High Performance Computing, Networking, Storage and Analysis.

[36]  E. Lorenz Deterministic nonperiodic flow , 1963 .

[37]  Deborah A. Kaminski,et al.  Control of convergence in a computational fluid dynamics simulation using ANFIS , 2005, IEEE Transactions on Fuzzy Systems.

[38]  J. Periaux,et al.  Turbulent separated shear flow control by surface plasma actuator: experimental optimization by genetic algorithm approach , 2016 .

[39]  Daniel Nikovski,et al.  Deep reinforcement learning for partial differential equation control , 2017, 2017 American Control Conference (ACC).

[40]  Changhoon Lee,et al.  Prediction of turbulent heat transfer using convolutional neural networks , 2019, Journal of Fluid Mechanics.

[41]  M. Yousuff Hussaini,et al.  A statistical learning strategy for closed-loop control of fluid flows , 2016, 1604.03392.

[42]  Jia Wu,et al.  Efficient hyperparameter optimization through model-based reinforcement learning , 2020, Neurocomputing.

[43]  Deborah A. Kaminski,et al.  A FUZZY LOGIC ALGORITHM FOR ACCELERATION OF CONVERGENCE IN SOLVING TURBULENT FLOW AND HEAT TRANSFER PROBLEMS , 2004 .

[44]  Alexander Kuhnle,et al.  A review on Deep Reinforcement Learning for Fluid Mechanics , 2019, Computers & Fluids.

[45]  Hossein Azizpour,et al.  Predictions of turbulent shear flows using deep neural networks , 2019, Physical Review Fluids.

[46]  Siddhartha Verma,et al.  Restoring Chaos Using Deep Reinforcement Learning , 2020, Chaos.

[47]  Qiqi Wang,et al.  The drag-adjoint field of a circular cylinder wake at Reynolds numbers 20, 100 and 500 , 2012, Journal of Fluid Mechanics.

[48]  Ian Dobson,et al.  Towards a theory of voltage collapse in electric power systems , 1989 .

[49]  Alec Radford,et al.  Proximal Policy Optimization Algorithms , 2017, ArXiv.

[50]  Liu Yang,et al.  Reinforcement learning for bluff body active flow control in experiments and simulations , 2020, Proceedings of the National Academy of Sciences.

[51]  Michele Milano,et al.  Neural network modeling for near wall turbulent flow , 2002 .

[52]  S. Manzhos,et al.  Machine learning for the solution of the Schrödinger equation , 2020, Mach. Learn. Sci. Technol..

[53]  William Gropp,et al.  High-performance parallel implicit CFD , 2001, Parallel Comput..

[54]  Ott,et al.  Preserving chaos: Control strategies to preserve complex dynamics with potential relevance to biological disorders. , 1995, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[55]  Tristan Bereau,et al.  Interpretable embeddings from molecular simulations using Gaussian mixture variational autoencoders , 2019, Mach. Learn. Sci. Technol..

[56]  F. Menter,et al.  Ten Years of Industrial Experience with the SST Turbulence Model , 2003 .

[57]  R. Goodman,et al.  Application of neural networks to turbulence control for drag reduction , 1997 .

[58]  Brian L. DeCost,et al.  Scientific AI in materials science: a path to a sustainable and scalable paradigm , 2020, Mach. Learn. Sci. Technol..

[59]  Alex Graves,et al.  Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.

[60]  J. Yorke,et al.  Crises, sudden changes in chaotic attractors, and transient chaos , 1983 .

[61]  Michael I. Jordan,et al.  RLlib: Abstractions for Distributed Reinforcement Learning , 2017, ICML.

[62]  Karthik Duraisamy,et al.  Turbulence Modeling in the Age of Data , 2018, Annual Review of Fluid Mechanics.

[63]  E. Solano,et al.  Reinforcement learning for semi-autonomous approximate quantum eigensolver , 2019, Mach. Learn. Sci. Technol..

[64]  Petros Koumoutsakos,et al.  Efficient collective swimming by harnessing vortices through deep reinforcement learning , 2018, Proceedings of the National Academy of Sciences.

[65]  M. P. Brenner,et al.  Perspective on machine learning for advancing fluid mechanics , 2019, Physical Review Fluids.

[66]  Kai Fukami,et al.  Nonlinear mode decomposition with convolutional neural networks for fluid dynamics , 2019, Journal of Fluid Mechanics.

[67]  Gang Wang,et al.  Reinforcement Learning for Learning Rate Control , 2017, ArXiv.

[68]  James A. Yorke,et al.  Preturbulence: A regime observed in a fluid flow model of Lorenz , 1979 .

[69]  Sergio Gomez Colmenarejo,et al.  Acme: A Research Framework for Distributed Reinforcement Learning , 2020, ArXiv.

[70]  H. L. Seegmiller,et al.  Features of a reattaching turbulent shear layer in divergent channel flow , 1985 .

[71]  Ion Stoica,et al.  A View on Deep Reinforcement Learning in System Optimization , 2019 .

[72]  Sergey Levine,et al.  Trust Region Policy Optimization , 2015, ICML.

[73]  Stefan Schaal,et al.  Reinforcement Learning for Humanoid Robotics , 2003 .

[74]  Daniel W. Davies,et al.  Machine learning for molecular and materials science , 2018, Nature.

[75]  Antonio Celani,et al.  Flow Navigation by Smart Microswimmers via Reinforcement Learning , 2017, Physical review letters.

[76]  Jin Keun Seo,et al.  Framelet pooling aided deep learning network: the method to process high dimensional medical data , 2019, Mach. Learn. Sci. Technol..

[77]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[78]  Max Jaderberg,et al.  Population Based Training of Neural Networks , 2017, ArXiv.

[79]  J. Templeton,et al.  Reynolds averaged turbulence modelling using deep neural networks with embedded invariance , 2016, Journal of Fluid Mechanics.

[80]  S. Girimaji,et al.  Turbulence closure modeling with data-driven techniques: physical compatibility and consistency considerations , 2020, 2004.03031.

[81]  Aleksandar Jemcov,et al.  OpenFOAM: A C++ Library for Complex Physics Simulations , 2007 .