Reinforcement learning for Energies of the future and carbon neutrality: a Challenge Design

—Current rapid changes in climate increase the ur- gency to change energy production and consumption management, in order to reduce carbon and other greenhouse gas production. In this context, the French electricity network management company RTE (R´eseau de Transport d’´Electricit´e) has recently published the results of an extensive study outlining var- ious scenarios for tomorrow’s French power management [10]. We propose a challenge that will test the viability of such scenarios [1]. The goal is to control electricity transportation in power networks while pursuing multiple objectives: balanc- ing production and consumption, minimizing energetic losses, keeping people and equipment safe, and particularly avoiding catastrophic failures. While the importance of the application provides a goal in itself, this challenge also aims to push the state-of-the-art in a branch of Artificial Intelligence (AI) called Reinforcement Learning (RL), which offers new possibilities to tackle control problems. In particular, various aspects of the combination of Deep Learning and RL called Deep Reinforce- ment Learning remain to be harnessed in this application domain. This challenge belongs to a series started in 2019 under the name ”Learning to run a power network” (L2RPN). In this new edition, we introduce new more realistic scenarios proposed by RTE to reach carbon neutrality by 2050, retiring fossil fuel electricity production, increasing proportions of renewable and nuclear energy and introducing batteries. Furthermore, we provide a baseline using a state-of-the-art reinforcement learning algorithm to stimulate future participants.

[1]  Balthazar Donon Deep statistical solvers & power systems applications. (Solveurs statistiques profonds & applications au réseau électrique) , 2022 .

[2]  Hongsheng Zeng,et al.  Action Set Based Policy Optimization for Safe Power Grid Management , 2021, ECML/PKDD.

[3]  A. Gleave,et al.  Stable-Baselines3: Reliable Reinforcement Learning Implementations , 2021, J. Mach. Learn. Res..

[4]  Deunsol Yoon,et al.  Winning the L2RPN Challenge: Power Grid Management via Semi-Markov Afterstate Actor-Critic , 2021, ICLR.

[5]  Gabriel Dulac-Arnold,et al.  Learning to run a Power Network Challenge: a Retrospective Analysis , 2021, NeurIPS.

[6]  In-So Kweon,et al.  An Efficient Asynchronous Method for Integrating Evolutionary and Gradient-based Policy Search , 2020, NeurIPS.

[7]  Isabelle Guyon,et al.  Learning to run a power network challenge for training topology controllers , 2019, Electric Power Systems Research.

[8]  Zhiwei Wang,et al.  AI-Based Autonomous Line Flow Control via Topology Adjustment for Maximizing Time-Series ATCs , 2019, 2020 IEEE Power & Energy Society General Meeting (PESGM).

[9]  R. Qiu,et al.  Deep Reinforcement Learning for Power System Applications: An Overview , 2020 .

[10]  Jakub W. Pachocki,et al.  Dota 2 with Large Scale Deep Reinforcement Learning , 2019, ArXiv.

[11]  Hao Tian,et al.  Efficient and Robust Reinforcement Learning with Uncertainty-based Value Expansion , 2019, ArXiv.

[12]  Matiur Rahman Minar,et al.  Recent Advances in Deep Learning: An Overview , 2018, ArXiv.

[13]  Demis Hassabis,et al.  Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm , 2017, ArXiv.

[14]  Demis Hassabis,et al.  Mastering the game of Go without human knowledge , 2017, Nature.

[15]  Alec Radford,et al.  Proximal Policy Optimization Algorithms , 2017, ArXiv.

[16]  Yuxi Li,et al.  Deep Reinforcement Learning: An Overview , 2017, ArXiv.

[17]  David Silver,et al.  Deep Reinforcement Learning with Double Q-Learning , 2015, AAAI.

[18]  Robin Girard,et al.  A 100% renewable electricity mix? Analyses and optimisations Testing the boundaries of renewable energy-based electricity development in metropolitan France by 2050 , 2016 .

[19]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[20]  Alex Graves,et al.  Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.

[21]  Yuval Tassa,et al.  MuJoCo: A physics engine for model-based control , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[22]  D. Ernst,et al.  Power systems stability control: reinforcement learning framework , 2004, IEEE Transactions on Power Systems.

[23]  Vijay R. Konda,et al.  Actor-Critic Algorithms , 1999, NIPS.

[24]  R. Bellman A Markovian Decision Process , 1957 .