Direct heuristic dynamic programming method for power system stability enhancement

In this paper a neural network-based approximate dynamic programming method, namely direct heuristic dynamic programming (direct HDP), is applied to power system stability control. Direct HDP is a learning and approximation based approach to address nonlinear system control under uncertainty. In the present paper, real-time system responses provided by wide area measurement system (WAMS) are used to construct such controllers which are uniquely tailored for the problems under consideration. In addition, the controller learning objective is formulated as a reward function that reflects global characteristics of the power system low frequency oscillation under the consideration of coupling effect among system components. The contribution of the paper includes a convergence proof of the direct HDP algorithm using an LQR framework, as well as case study to illustrate the proposed learning control algorithm. The case study aims at providing a new solution to a difficult large scale system coordination problem where the China Southern Power Grid is used for.

[1]  Paul J. Werbos,et al.  Stable adaptive control using new critic designs , 1998, Other Conferences.

[2]  B. E. Eckbo,et al.  Appendix , 1826, Epilepsy Research.

[3]  Xiaorong Xie,et al.  SVC supplementary damping control using direct neural dynamic programming , 2004, Proceedings of the 2004 IEEE International Symposium on Intelligent Control, 2004..

[4]  J. Humberto Pérez-Cruz,et al.  Evolving intelligent system for the modelling of nonlinear systems with dead-zone input , 2014, Appl. Soft Comput..

[5]  F. F. Wu,et al.  Nonlinear Decentralized Disturbance Attenuation Excitation Control via New Recursive Design for Multimachine Power Systems , 2001, IEEE Power Engineering Review.

[6]  Enrico Gobbetti,et al.  Encyclopedia of Electrical and Electronics Engineering , 1999 .

[7]  Warren B. Powell,et al.  Handbook of Learning and Approximate Dynamic Programming , 2006, IEEE Transactions on Automatic Control.

[8]  James Robert Brown,et al.  What is a Definition? , 1998 .

[9]  Victor M. Becerra,et al.  Optimal control , 2008, Scholarpedia.

[10]  S. Elangovan,et al.  Efficient pole-assignment method for designing stabilisers in multimachine power systems , 1987 .

[11]  Jennie Si,et al.  Helicopter trimming and tracking control using direct neural dynamic programming , 2003, IEEE Trans. Neural Networks.

[12]  A. Barto,et al.  LEARNING AND APPROXIMATE DYNAMIC PROGRAMMING Scaling Up to the Real World , 2003 .

[13]  Vladislav Tadic,et al.  On the Convergence of Temporal-Difference Learning with Linear Function Approximation , 2001, Machine Learning.

[14]  José de Jesús Rubio,et al.  Evolving intelligent algorithms for the modelling of brain and eye signals , 2014, Appl. Soft Comput..

[15]  Michael Basin,et al.  Robust mini–max regulator for uncertain non-linear polynomial systems , 2012 .

[16]  Yao-nan Yu,et al.  Application of an Optimal Control Theory to a Power System , 1970 .

[17]  B. Chaudhuri,et al.  Wide-area measurement-based stabilizing control of power system considering signal transmission delay , 2004, IEEE Transactions on Power Systems.

[18]  Vladislav Tadic,et al.  Convergence analysis of temporal-difference learning algorithms with linear function approximation , 1999, COLT '99.

[19]  César Torres,et al.  Stable optimal control applied to a cylindrical robotic arm , 2014, Neural Computing and Applications.

[20]  Pouyan Pourbeik,et al.  Simultaneous coordination of power system stabilizers and FACTS device stabilizers in a multimachine power system for enhancing dynamic performance , 1998 .

[21]  Shengwei Mei,et al.  Nonlinear Control Systems and Power System Dynamics , 2001, The Springer International Series on Asian Studies in Computer and Information Science.

[22]  Leiba Rodman,et al.  Algebraic Riccati equations , 1995 .

[23]  Andrew G. Barto,et al.  Adaptive linear quadratic control using policy iteration , 1994, Proceedings of 1994 American Control Conference - ACC '94.

[24]  A. R. Vahidi,et al.  An improved method for determining the solution of Riccati equations , 2012, Neural Computing and Applications.

[25]  Babu Narayanan,et al.  POWER SYSTEM STABILITY AND CONTROL , 2015 .

[26]  Terrence J. Sejnowski,et al.  TD(λ) Converges with Probability 1 , 1994, Machine Learning.

[27]  George Cybenko,et al.  Approximation by superpositions of a sigmoidal function , 1989, Math. Control. Signals Syst..

[28]  Anne Kuefer,et al.  Voltage Stability Of Electric Power Systems , 2016 .

[29]  Jose de Jesus Rubio,et al.  Modified optimal control with a backpropagation network for robotic arms , 2012 .

[30]  Jennie Si,et al.  Online learning control by association and reinforcement. , 2001, IEEE transactions on neural networks.