On the Accuracy Versus Transparency Trade-Off of Data-Mining Models for Fast-Response PMU-Based Catastrophe Predictors

In all areas of engineering, modelers are constantly pushing for more accurate models and their goal is generally achieved with increasingly complex, data-mining-based black-box models. On the other hand, model users which include policy makers and systems operators tend to favor transparent, interpretable models not only for predictive decision-making but also for after-the-fact auditing and forensic purposes. In this paper, we investigate this trade-off between the accuracy and the transparency of data-mining-based models in the context of catastrophe predictors for power grid response-based remedial action schemes, at both the protective and operator levels. Wide area severity indices (WASI) are derived from PMU measurements and fed to the corresponding predictors based on data-mining models such as decision trees (DT), random forests (RF), neural networks (NNET), support vector machines (SVM), and fuzzy rule based models (Fuzzy_DT and Fuzzy_ID3). It is observed that while switching from black-box solutions such as NNET, SVM, and RF to transparent fuzzy rule-based predictors, the accuracy deteriorates sharply while transparency and interpretability are improved. Although transparent automation schemes are historically preferred in power system control and operations, we show that, with existing modeling tools, this philosophy fails to achieve the “3-nines” accuracy figures expected from a modern power grid. The transparency and accuracy trade-offs between the developed catastrophe predictors is demonstrated thoroughly on a data base with more than 60 000 instances from a test (10%) and an actual (90%) system combined.

[1]  I. Kamwa,et al.  Automatic Segmentation of Large Power Systems Into Fuzzy Coherent Areas for Dynamic Vulnerability Assessment , 2007, IEEE Transactions on Power Systems.

[2]  U. D. Annakkage,et al.  Support Vector Machine-Based Algorithm for Post-Fault Transient Stability Status Prediction Using Synchronized Measurements , 2011, IEEE Transactions on Power Systems.

[3]  Louis Wehenkel,et al.  OPERATION RULES DETERMINED BY RISK ANALYSIS FOR SPECIAL PROTECTION SYSTEMS AT HYDRO-QUÉBEC , 2004 .

[4]  M.A. El-Sharkawi,et al.  Support vector machines for transient stability analysis of large-scale power systems , 2004, IEEE Transactions on Power Systems.

[5]  Eric R. Ziegel,et al.  The Elements of Statistical Learning , 2003, Technometrics.

[6]  V. Brandwajn,et al.  Severity indices for contingency screening in dynamic security assessment , 1997 .

[7]  Zhiyong Li,et al.  Phasor Measurements-Aided Decision Trees for Power System Security Assessment , 2009, 2009 Second International Conference on Information and Computing Science.

[8]  Mark Lauby,et al.  Reliability considerations from the integration of smart grid , 2012, 2012 IEEE PES Innovative Smart Grid Technologies (ISGT).

[9]  R. Grondin,et al.  Time-varying contingency screening for dynamic security assessment using intelligent-systems techniques , 2001, PICA 2001. Innovative Computing for Power - Electric Energy Meets the Market. 22nd IEEE Power Engineering Society. International Conference on Power Industry Computer Applications (Cat. No.01CH37195).

[10]  Kenneth E. Martin,et al.  WACS-Wide-Area Stability and Voltage Control System: R&D and Online Demonstration , 2005, Proceedings of the IEEE.

[11]  S.M. Rovnyak,et al.  Response-based decision trees to trigger one-shot stabilizing control , 2004, IEEE Transactions on Power Systems.

[12]  Lars Niklasson,et al.  Accuracy vs. comprehensibility in data mining models , 2004 .

[13]  C. W. Taylor WACS-wide-area stability and voltage control system: R&D and on-line demonstration , 2004 .

[14]  Alberto Maria Segre,et al.  Programs for Machine Learning , 1994 .

[15]  I Kamwa,et al.  Development of rule-based classifiers for rapid stability assessment of wide-area post disturbance records , 2009, IEEE PES General Meeting.

[16]  N. Amjady,et al.  Transient Stability Prediction by a Hybrid Intelligent System , 2007, IEEE Transactions on Power Systems.

[17]  Innocent Kamwa,et al.  Fast topology detection based on open line detection , 2004 .

[18]  Waldemar Rebizant,et al.  Fuzzy logic application to out-of-step protection of generators , 2001, 2001 Power Engineering Society Summer Meeting. Conference Proceedings (Cat. No.01CH37262).

[19]  Hidetomo Ichihashi,et al.  Neuro-fuzzy ID3: a method of inducing fuzzy decision trees with linear programming for maximizing entropy and an algebraic method for incremental learning , 1996, Fuzzy Sets Syst..

[20]  Balazs Feil,et al.  Cluster Analysis for Data Mining and System Identification , 2007 .

[21]  Geza Joos,et al.  Catastrophe Predictors From Ensemble Decision-Tree Learning of Wide-Area Severity Indices , 2010, IEEE Transactions on Smart Grid.

[22]  Zuhair Bandar,et al.  On constructing a fuzzy inference framework using crisp decision trees , 2006, Fuzzy Sets Syst..

[23]  Kai Sun,et al.  An Online Dynamic Security Assessment Scheme Using Phasor Measurements and Decision Trees , 2007, IEEE Transactions on Power Systems.

[24]  Serge Guillaume,et al.  Designing fuzzy inference systems from data: An interpretability-oriented review , 2001, IEEE Trans. Fuzzy Syst..

[25]  I. Kamwa,et al.  Fuzzy Partitioning of a Real Power System for Dynamic Vulnerability Assessment , 2009, IEEE Transactions on Power Systems.

[26]  Chul-Won Park,et al.  Fuzzy logic-based relaying for large power transformer protection , 2003 .

[27]  Ferenc Szeifert,et al.  Data-driven generation of compact, accurate, and linguistically sound fuzzy classifiers based on a decision-tree initialization , 2003, Int. J. Approx. Reason..