Combined heat and power system intelligent economic dispatch: A deep reinforcement learning approach

Abstract This paper proposed a Deep Reinforcement learning (DRL) approach for Combined Heat and Power (CHP) system economic dispatch which obtain adaptability for different operating scenarios and significantly decrease the computational complexity without affecting accuracy. In the respect of problem description, a vast of Combined Heat and Power (CHP) economic dispatch problems are modeled as a high-dimensional and non-smooth objective function with a large number of non-linear constraints for which powerful optimization algorithms and considerable time are required to solve it. In order to reduce the solution time, most engineering applications choose to linearize the optimization target and devices model. To avoid complicated linearization process, this paper models CHP economic dispatch problems as Markov Decision Process (MDP) that making the model highly encapsulated to preserve the input and output characteristics of various devices. Furthermore, we improve an advanced deep reinforcement learning algorithm: distributed proximal policy optimization (DPPO), to make it applicable to CHP economic dispatch problem. Based on this algorithm, the agent will be trained to explore optimal dispatch strategies for different operation scenarios and respond to system emergencies efficiently. In the utility phase, the trained agent will generate optimal control strategy in real time based on current system state. Compared with existing optimization methods, advantages of DRL methods are mainly reflected in the following three aspects: 1) Adaptability: under the premise of the same network topology, the trained agent can handle the economic scheduling problem in various operating scenarios without recalculation. 2) High encapsulation: The user only needs to input the operating state to get the control strategy, while the optimization algorithm needs to re-write the constraints and other formulas for different situations. 3) Time scale flexibility: It can be applied to both the day-ahead optimized scheduling and the real-time control. The proposed method is applied to two test system with different characteristics. The results demonstrate that the DRL method could handle with varieties of operating situations while get better optimization performance than most of other algorithms.

[1]  Dexuan Zou,et al.  Solving the combined heat and power economic dispatch problems by an improved genetic algorithm and a new constraint handling strategy , 2019, Applied Energy.

[2]  Behnam Mohammadi-Ivatloo,et al.  Large-scale combined heat and power economic dispatch using a novel multi-player harmony search method , 2019, Applied Thermal Engineering.

[3]  Pierluigi Mancarella,et al.  Matrix modelling of small-scale trigeneration systems and application to operational optimization , 2009 .

[4]  Jun Wang,et al.  An Online Optimal Dispatch Schedule for CCHP Microgrids Based on Model Predictive Control , 2017, IEEE Transactions on Smart Grid.

[5]  Enrico Zio,et al.  A reinforcement learning framework for optimal operation and maintenance of power grids , 2019, Applied Energy.

[6]  Bo Zhao,et al.  Bi-Level Two-Stage Robust Optimal Scheduling for AC/DC Hybrid Multi-Microgrids , 2018, IEEE Transactions on Smart Grid.

[7]  Risto Lahdelma,et al.  Non-convex power plant modelling in energy optimisation , 2006, Eur. J. Oper. Res..

[8]  Mehdi Rahmani-andebili,et al.  Dynamic and adaptive reconfiguration of electrical distribution system including renewables applying stochastic model predictive control , 2017 .

[9]  Wei Gu,et al.  A smart community energy management scheme considering user dominated demand side response and P2P trading , 2020 .

[10]  Risto Lahdelma,et al.  An efficient envelope-based Branch and Bound algorithm for non-convex combined heat and power production planning , 2007, Eur. J. Oper. Res..

[11]  Demis Hassabis,et al.  A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play , 2018, Science.

[12]  Malabika Basu Combined Heat and Power Economic Dispatch by Using Differential Evolution , 2010 .

[13]  Risto Lahdelma,et al.  An efficient linear programming algorithm for combined heat and power production , 2003, Eur. J. Oper. Res..

[14]  Shuai Lu,et al.  Coordinated dispatch of multi-energy system with district heating network: Modeling and solution strategy , 2018, Energy.

[15]  F. J. Rooijers,et al.  Static economic dispatch for co-generation systems , 1994 .

[16]  H. F. Ravn,et al.  A method to perform probabilistic production simulation involving combined heat and power units , 1996 .

[17]  Pierluigi Mancarella,et al.  Multi-energy systems : An overview of concepts and evaluation models , 2015 .

[18]  Pierluigi Mancarella,et al.  A General Model for Thermal Energy Storage in Combined Heat and Power Dispatch Considering Heat Transfer Constraints , 2018, IEEE Transactions on Sustainable Energy.

[19]  Kit Po Wong,et al.  Evolutionary programming approach for combined heat and power dispatch , 2002 .

[20]  Naser Ghorbani,et al.  Combined heat and power economic dispatch using exchange market algorithm , 2016 .

[21]  Audrius Bagdanavicius,et al.  Combined analysis of electricity and heat networks , 2014 .

[22]  Edward J. Williams,et al.  Cuckoo optimization algorithm with penalty function for combined heat and power economic dispatch problem , 2015 .

[23]  W. Gu,et al.  Residential CCHP microgrid with load aggregator: Operation mode, pricing strategy, and optimal dispatch , 2017 .

[24]  B. Mohammadi-ivatloo,et al.  Combined heat and power economic dispatch problem solution using particle swarm optimization with ti , 2013 .

[25]  E. A. Jasmin,et al.  Reinforcement Learning approaches to Economic Dispatch problem , 2011 .

[26]  Yong Min,et al.  Dispatch Model of Combined Heat and Power Plant Considering Heat Transfer Process , 2017, IEEE Transactions on Sustainable Energy.

[27]  Gevork B. Gharehpetian,et al.  A comprehensive review of heuristic optimization algorithms for optimal combined heat and power dispatch from economic and environmental perspectives , 2018 .

[28]  Shahab Bahrami,et al.  From Demand Response in Smart Grid Toward Integrated Demand Response in Smart Energy Hub , 2016, IEEE Transactions on Smart Grid.