Reinforcement Learning Solution for Unit Commitment Problem through Pursuit Method

Unit commitment is an optimization task in electric power generation control sector. It involves scheduling the ON/OFF status of the generating units to meet the load demand with minimum generation cost satisfying the different constraints existing in the system. Numerical solutions developed are limited for small systems and heuristic methodologies find difficulty in handling stochastic cost functions associated with practical systems. This paper models Unit Commitment as a multi stage decision task and Reinforcement Learning solution is formulated through one efficient exploration strategy: Pursuit method. The correctness and efficiency of the developed solutions are verified for standard test systems.

[1]  Gerald B. Sheblé,et al.  A profit-based unit commitment GA for the competitive environment , 2000 .

[2]  K. Shanti Swarup,et al.  Neural computation using discrete and continuous Hopfield networks for power system economic dispatch and unit commitment , 2006, Neurocomputing.

[3]  D. Ernst,et al.  Power systems stability control: reinforcement learning framework , 2004, IEEE Transactions on Power Systems.

[4]  P. S. Nagendra Rao,et al.  A reinforcement learning approach to automatic generation control , 2002 .

[5]  D. Ernst,et al.  Approximate Value Iteration in the Reinforcement Learning Context. Application to Electrical Power System Control. , 2005 .

[6]  D. Bertsekas,et al.  Solution of Large-Scale Optimal Unit Commitment Problems , 1982, IEEE Transactions on Power Apparatus and Systems.

[7]  Gerald Tesauro,et al.  Temporal Difference Learning and TD-Gammon , 1995, J. Int. Comput. Games Assoc..

[8]  E.A. Jasmin,et al.  A Reinforcement Learning algorithm to economic dispatch considering transmission losses , 2008, TENCON 2008 - 2008 IEEE Region 10 Conference.

[9]  John N. Tsitsiklis,et al.  Neuro-Dynamic Programming , 1996, Encyclopedia of Machine Learning.

[10]  Walter L. Snyder,et al.  Dynamic Programming Approach to Unit Commitment , 1987, IEEE Transactions on Power Systems.

[11]  S. Shankar Sastry,et al.  Autonomous Helicopter Flight via Reinforcement Learning , 2003, NIPS.

[12]  Tadashi Horiuchi,et al.  Adaptive state construction for reinforcement learning and its application to robot navigation problems , 2001, 2001 IEEE International Conference on Systems, Man and Cybernetics. e-Systems and e-Man for Cybernetics in Cyberspace (Cat.No.01CH37236).

[13]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.