Power distribution system fault cause analysis by using association rule mining

Abstract In recent years, with the increasing requirements on power distribution utilities to ensure system reliability and to improve customers and regulators satisfaction, utilities seek to find practical solutions that enable them to restrict specific faults or to better manage their responses to unavoidable power outages. For achieving either, it is crucial to acquire a profound understanding of different faults by exploring their underlying causes and identifying key variables related to those causes. Currently, statistical models as well as advanced data analytics techniques are common tools to gain such understanding. Although basic statistical analysis provides a general knowledge of the primary causes of faults; nevertheless, it falls short of describing nuanced conditions that lead to a fault. On the other hand, applying sophisticated algorithms can produce deeper insight into the main causes; however, it would be computationally burdensome and might require a tremendous amount of running time. In order to overcome these problems, this paper proposes a novel approach for fault cause analysis by using association rule mining. The primary goals are to characterize faults according to their underlying causes and to identify important variables that strongly impact fault frequency. This paper proposes a step-by-step procedure, which deals with data preparation, practical issues associated with fault data sets, and implementation of association rule mining. The procedure is followed by a comprehensive case study to demonstrate how the proposed approach can be used to mine for causal structures and identify frequent patterns for vegetation, animal, equipment failure, public accident, and lightning-related faults.

[1]  S. S. Venkata,et al.  Predicting vegetation-related failure rates for overhead distribution feeders , 2002 .

[2]  M.-Y. Chow,et al.  Data Mining and Analysis of Tree-Caused Faults in Power Distribution Systems , 2006, 2006 IEEE PES Power Systems Conference and Exposition.

[3]  Vitor Hugo Ferreira,et al.  A survey on intelligent system application to fault diagnosis in electric power system transmission lines , 2016 .

[4]  Badrul H. Chowdhury,et al.  Power distribution system equipment failure identification using machine learning algorithms , 2017, 2017 IEEE Power & Energy Society General Meeting.

[5]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[6]  Mo-Yuen Chow,et al.  Analysis and prevention of animal-caused faults in power distribution systems , 1995 .

[7]  F. Karimipour,et al.  Extracting Human Behavioral Patterns by Mining Geo-Social Networks , 2014 .

[8]  Mo-Yuen Chow,et al.  A classification approach for power distribution systems fault cause identification , 2006, IEEE Transactions on Power Systems.

[9]  Mo-Yuen Chow,et al.  Power Distribution Fault Cause Identification With Imbalanced Data Using the Data Mining-Based Fuzzy Classification $E$-Algorithm , 2007, IEEE Transactions on Power Systems.

[10]  Lior Rokach,et al.  Data Mining And Knowledge Discovery Handbook , 2005 .

[11]  Ramakrishnan Srikant,et al.  Fast algorithms for mining association rules , 1998, VLDB 1998.

[12]  Jonas Johansson,et al.  A Method for Assessing Margin and Sensitivity of Electricity Networks With Respect to Repair System Resources , 2016, IEEE Transactions on Smart Grid.

[13]  Matti Lehtonen,et al.  Impacts of Fault Diagnosis Schemes on Distribution System Reliability , 2012, IEEE Transactions on Smart Grid.

[14]  Nicola Torelli,et al.  ROSE: a Package for Binary Imbalanced Learning , 2014, R J..

[15]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[16]  Lexin Li,et al.  Statistical Feature Selection From Massive Data in Distribution Fault Diagnosis , 2010, IEEE Transactions on Power Systems.