Evolutionary Optimization of a One-Class Classification System for Faults Recognition in Smart Grids

The Computational Intelligence paradigm has proven to be a useful approach when facing problems related to Smart Grids (SG). The modern SG systems are equipped with Smart Sensors scattered in the real-world power distribution lines that are able to take a fine-grained picture of the actual power grid state gathering a huge amount of heterogeneous data. Modeling and predicting general faults instances by means of processing structured patterns of faults data coming from Smart Sensors is a very challenging task. This paper deals with the problem of faults modeling and recognition on MV feeders in the real-world Smart Grid system that feeds the city of Rome, Italy. The faults recognition problem is faced by means of a One-Class classifier based on a modified k-means algorithm trained through an evolutive approach. Due to the nature of the specific data-driven problem at hand, a custom weighted dissimilarity measure designed to cope with mixed data type like numerical data, Time Series and categorical data is adopted. For the latter a Semantic Distance (SD) is proposed, capable to grasp semantical information from clustered data. A genetic algorithm is in charge to optimize system’s performance. Tests were performed on data gathered over three years by ACEA Distribuzione S.p.A., the company that manages the power grid of Rome.

[1]  Donald W. Bouldin,et al.  A Cluster Separation Measure , 1979, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Lorenzo Livi,et al.  Fault recognition in smart grids by a one-class classification approach , 2014, 2014 International Joint Conference on Neural Networks (IJCNN).

[3]  Michael J. Laszlo,et al.  A genetic algorithm using hyper-quadtrees for low-dimensional k-means clustering , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Robert Tibshirani,et al.  Estimating the number of clusters in a data set via the gap statistic , 2000 .

[5]  Muhammad Afzal,et al.  Analytics for distributed smart Grid sensing , 2012, 2012 IEEE PES Innovative Smart Grid Technologies (ISGT).

[6]  S. Guikema,et al.  Statistical models of the effects of tree trimming on power system outages , 2006, IEEE Transactions on Power Delivery.

[7]  Ganesh K. Venayagamoorthy,et al.  Dynamic, Stochastic, Computational, and Scalable Technologies for Smart Grids , 2011, IEEE Computational Intelligence Magazine.

[8]  Zengyou He,et al.  Attribute value weighting in k-modes clustering , 2011, Expert Syst. Appl..

[9]  Ijaz Haider Naqvi,et al.  Support Vector Machine based fault detection & classification in smart grids , 2012, 2012 IEEE Globecom Workshops.

[10]  Meinard Müller,et al.  Dynamic Time Warping , 2008 .

[11]  Joshua Zhexue Huang,et al.  Extensions to the k-Means Algorithm for Clustering Large Data Sets with Categorical Values , 1998, Data Mining and Knowledge Discovery.

[12]  Ho Tu Bao,et al.  Conditional Probability Distribution-based Dissimilarity Measure for Categorial Data (特集 「医療及び化学情報マイニング」および一般) , 2004 .

[13]  Andrew W. Moore,et al.  X-means: Extending K-means with Efficient Estimation of the Number of Clusters , 2000, ICML.

[14]  C. Mazzetti,et al.  Genetic Optimization of a PD Diagnostic System for Cable Accessories , 2009, IEEE Transactions on Power Delivery.

[15]  Lorenzo Livi,et al.  On the Problem of Modeling Structured Data with the MinSOD Representative , 2014 .

[16]  James Llinas,et al.  Data fusion/data mining-based architecture for condition-based maintenance , 2006 .

[17]  Antonello Rizzi,et al.  Genetic optimization of a fuzzy control system for energy flow management in micro-grids , 2013, 2013 Joint IFSA World Congress and NAFIPS Annual Meeting (IFSA/NAFIPS).

[18]  Victor Cheng,et al.  Dissimilarity learning for nominal data , 2004, Pattern Recognit..

[19]  Yasushi Kiyoki,et al.  A pillar algorithm for K-means optimization by distance maximization for initial centroid designation , 2009, 2009 IEEE Symposium on Computational Intelligence and Data Mining.

[20]  Shehroz S. Khan,et al.  A Survey of Recent Trends in One Class Classification , 2009, AICS.

[21]  Michael K. Ng,et al.  On the Impact of Dissimilarity Measure in k-Modes Clustering Algorithm , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Mo-Yuen Chow,et al.  Exploratory analysis of massive data for distribution fault diagnosis in smart grids , 2009, 2009 IEEE Power & Energy Society General Meeting.