A fuzzy reinforcement learning approach to thermal unit commitment problem

Unit commitment problem (UCP) aims at optimizing generation cost for meeting a given load demand under several operational constraints. We propose to use fuzzy reinforcement learning (RL) approach for efficient and reliable solution to the unit commitment problem. In particular, we cast UCP as a multiagent fuzzy reinforcement learning task wherein individual generators act as players for optimizing the cost to meet a given load over a twenty-four-hour period. Unit commitment task has been fuzzified, and the most optimal unit commitment solution is generated by employing RL on this fuzzy multigenerator setup. Our proposed multiagent RL framework does not assume any a priori task or system knowledge, and the generators gradually learn to produce most optimal output solely based on their collective generation. We look at the UCP as a sequential decision-making task with reward/penalty to reduce the collective generation cost of generators. To the best of our knowledge, ours is a first attempt at solving UCP by employing fuzzy reinforcement learning. We test our approach on a ten-generating-unit system with several equality and inequality constraints. Simulation results and comparisons against several recent UCP solution methods prove superiority and viability of our proposed multiagent fuzzy reinforcement learning technique.

[1]  Dilip Datta,et al.  A binary-real-coded differential evolution for unit commitment problem , 2012 .

[2]  Oveis Abedinia,et al.  A new stochastic search algorithm bundled honeybee mating for solving optimization problems , 2014, Neural Computing and Applications.

[3]  Po-Hung Chen,et al.  Two-Level Hierarchical Approach to Unit Commitment Using Expert System and Elite PSO , 2012, IEEE Transactions on Power Systems.

[4]  Eiichi Tanaka,et al.  An Evolutionary Programming Solution to the Unit Commitment Problem , 1997 .

[5]  Taher Niknam,et al.  New self-adaptive bat-inspired algorithm for unit commitment problem , 2014 .

[6]  Habib Rajabi Mashhadi,et al.  An Adaptive $Q$-Learning Algorithm Developed for Agent-Based Computational Modeling of Electricity Market , 2010, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[7]  A. Bakirtzis,et al.  A solution to the unit-commitment problem using integer-coded genetic algorithm , 2004, IEEE Transactions on Power Systems.

[8]  Malihe M. Farsangi,et al.  Solving unit commitment problem by a binary shuffled frog leaping algorithm , 2014 .

[9]  T. Funabashi,et al.  Fuzzy unit commitment using absolutely stochastic simulated annealing , 2005, 2005 IEEE International Conference on Industrial Technology.

[10]  B. Venkatesh,et al.  Fuzzy MILP Unit Commitment Incorporating Wind Generators , 2008, IEEE Transactions on Power Systems.

[11]  Lionel Jouffe,et al.  Fuzzy inference system learning by reinforcement methods , 1998, IEEE Trans. Syst. Man Cybern. Part C.

[12]  Pierre-Yves Glorennec,et al.  Tuning fuzzy PD and PI controllers using reinforcement learning. , 2010, ISA transactions.

[13]  F. Aminifar,et al.  A Novel Straightforward Unit Commitment Method for Large-Scale Power Systems , 2007, IEEE Transactions on Power Systems.

[14]  Gary G. Yen,et al.  Reinforcement learning algorithms for robotic navigation in dynamic environments. , 2004, ISA transactions.

[15]  Xiaohui Yuan,et al.  Application of enhanced discrete differential evolution approach to unit commitment problem , 2009 .

[16]  Narayana Prasad Padhy,et al.  Binary real coded firefly algorithm for solving unit commitment problem , 2013, Inf. Sci..

[17]  Behrooz Vahidi,et al.  A Novel Optimized Fuzzy Approach Based on Monte Carlo Method for System Load, Wind Turbine and Photovoltaic Unit Uncertainty Modeling in Unit Commitment , 2016 .

[18]  Mohammed Azmi Al-Betar,et al.  Economic load dispatch problems with valve-point loading using natural updated harmony search , 2018, Neural Computing and Applications.

[19]  Chidentree Treesatayapun,et al.  Fuzzy-rule emulated networks, based on reinforcement learning for nonlinear discrete-time controllers. , 2008, ISA transactions.

[20]  Hao Tian,et al.  A new approach for unit commitment problem via binary gravitational search algorithm , 2014, Appl. Soft Comput..

[21]  Bart De Schutter,et al.  Reinforcement Learning and Dynamic Programming Using Function Approximators , 2010 .

[22]  D. Srinivasan,et al.  A priority list-based evolutionary algorithm to solve large scale unit commitment problem , 2004, 2004 International Conference on Power System Technology, 2004. PowerCon 2004..

[23]  杨林峰,et al.  An improved priority list and neighborhood search method for unit commitment , 2015 .

[24]  Provas Kumar Roy,et al.  Solution of unit commitment problem using quasi-oppositional teaching learning based algorithm , 2014 .

[25]  Habib Rajabi Mashhadi,et al.  Measurement of Power Supplier's Market Power Using a Proposed Fuzzy Estimator , 2011, IEEE Transactions on Power Systems.

[26]  T. Lau,et al.  Quantum-Inspired Evolutionary Algorithm Approach for Unit Commitment , 2009, IEEE Transactions on Power Systems.

[27]  K. Chandram,et al.  Unit Commitment by improved pre-prepared power demand table and Muller method , 2011 .

[28]  Ranjit Roy,et al.  Economic analysis of unit commitment with distributed energy resources , 2015 .

[29]  H. Chiang,et al.  A Novel Solution Methodology for Solving Large-scale Thermal Unit Commitment Problems , 2010 .

[30]  S. Ganesan,et al.  Cost, emission and reserve pondered pre-dispatch of thermal power generating units coordinated with real coded grey wolf optimisation , 2016 .

[31]  J. Jian,et al.  Outer Approximation and Outer-Inner Approximation Approaches for Unit Commitment Problem , 2014, IEEE Transactions on Power Systems.

[32]  B. Vahidi,et al.  Bacterial Foraging-Based Solution to the Unit-Commitment Problem , 2009, IEEE Transactions on Power Systems.

[33]  Vikram Kumar Kamboj A novel hybrid PSO–GWO approach for unit commitment problem , 2015, Neural Computing and Applications.

[34]  Marco Wiering,et al.  Reinforcement Learning , 2014, Adaptation, Learning, and Optimization.

[35]  Allen. J. Wood and Bruce F. Wollenberg ‘Power Generation, Operation and Control’, John Wiley & Sons, Inc., 2003. , 2015 .

[36]  Seyed Hossein Hosseinian,et al.  Imperialistic Competitive Algorithm Based Unit Commitment Considering Risk of Cascading Blackout , 2015 .

[37]  Chuan-Ping Cheng,et al.  Unit commitment by Lagrangian relaxation and genetic algorithms , 2000 .

[38]  Vikram Kumar Kamboj,et al.  Implementation of hybrid harmony search/random search algorithm for single area unit commitment problem , 2016 .

[39]  W. Ongsakul,et al.  Unit commitment by enhanced adaptive Lagrangian relaxation , 2004, IEEE Transactions on Power Systems.

[40]  N. Sisworahardjo,et al.  Unit commitment using the ant colony search algorithm , 2002, LESCOPE'02. 2002 Large Engineering Systems Conference on Power Engineering. Conference Proceedings.

[41]  Francisco D. Galiana,et al.  Unit commitment by simulated annealing , 1990 .

[42]  Provas Kumar Roy,et al.  Solution of unit commitment problem using gravitational search algorithm , 2013 .

[43]  S. Virmani,et al.  Implementation of a Lagrangian Relaxation Based Unit Commitment Problem , 1989, IEEE Power Engineering Review.

[44]  Kankar Bhattacharya,et al.  Stochastic Unit Commitment with Wind Generation Penetration , 2012 .

[45]  T. Senjyu,et al.  Fuzzy unit commitment scheduling using absolutely stochastic simulated annealing , 2006, IEEE Transactions on Power Systems.

[46]  Abbas Khosravi,et al.  Incorporating Wind Power Forecast Uncertainties Into Stochastic Unit Commitment Using Neural Network-Based Prediction Intervals , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[47]  T. Logenthiran,et al.  Particle Swarm Optimization for unit commitment problem , 2010, 2010 IEEE 11th International Conference on Probabilistic Methods Applied to Power Systems.

[48]  T. Chow,et al.  Binary neighbourhood field optimisation for unit commitment problems , 2013 .

[49]  Gary G Yen,et al.  Reinforcement learning algorithms for robotic navigation in dynamic environments , 2002, Proceedings of the 2002 International Joint Conference on Neural Networks. IJCNN'02 (Cat. No.02CH37290).

[50]  Allen J. Wood,et al.  Power Generation, Operation, and Control , 1984 .

[51]  Jong-Bae Park,et al.  A New Quantum-Inspired Binary PSO: Application to Unit Commitment Problems for Power Systems , 2010, IEEE Transactions on Power Systems.

[52]  Swapan Kumar Goswami,et al.  Differential Evolution Algorithm for Solving Unit Commitment with Ramp Constraints , 2008 .

[53]  G B Gharehpetian,et al.  Unit Commitment Problem Solution Using Shuffled Frog Leaping Algorithm , 2011, IEEE Transactions on Power Systems.

[54]  N.P. Padhy,et al.  Unit commitment-a bibliographical survey , 2004, IEEE Transactions on Power Systems.

[55]  R.-H. Liang,et al.  Thermal generating unit commitment using an extended mean field annealing neural network , 2000 .

[56]  Wei Xiong,et al.  An Improved Particle Swarm Optimization Algorithm for Unit Commitment , 2008, 2008 International Conference on Intelligent Computation Technology and Automation (ICICTA).

[57]  A. H. Mantawy,et al.  Unit commitment by tabu search , 1998 .

[58]  Yong He,et al.  Optimal control in microgrid using multi-agent reinforcement learning. , 2012, ISA transactions.