Kohonen Maps Combined to Fuzzy C-means, a Two Level Clustering Approach. Application to Electricity Load Data

In a deregulated electricity market, load forecasting is nowadays of paramount importance to estimate next day load resulting in energy save and environment protection. Electricity demand is influenced (among other things) by the day of the week, the time of year and special periods and/or days such as religious and national events, all of which must be identified prior to modelling. This identification, known as day type identification, must be included in the design stages either by segmenting the data and modelling each day type separately or by including the day type as an input, which implies data classification and cluster creation. Data classification consists in regrouping objects of a similar data set into homogenous classes. Two main types of classifications exist: supervised and unsupervised classification. Supervised classification is based on a set of objects L of known classes, called training set, with the main goal being to identify candidate objects into their belonging classes. Where, unsupervised classification consists in partitioning a set of data D into sub-sets of similar attributes called classes or clusters (Halgamuge, 2005). Unsupervised classification is termed clustering, and will be so in the remaining of the chapter. For clustering means, conventional research usually employs multivariate analysis procedures. However, it was found that clustering the data directly, becomes computationally heavy using statistical method as the size of the data set increases (Jain & Dubes, 1988; Xu & Wunsch, 2005). Despite this fact, many linear approaches such as Principal Component Analysis (PCA) (Jolliffe, 2002) and K-means were and remain, extensively used for classification and clustering purposes. Nonlinear classification and clustering approaches stand as a strong alternative in order to treat the complexity and visualisation problems issued from large multidimensional data sets. In recent years, due to their high performance in engineering, Artificial Neural Networks (ANN), more specifically Self Organising Maps (SOM), and fuzzy logic are now being used as alternate statistical tools. Combining both paradigms in a two-level approach may be profitable to reduce significantly the computational cost as shown in (Khadir et. al., 2010) where SOM and K-means were combined for time series clustering.

[1]  Saman K. Halgamuge,et al.  Classification and clustering for knowledge discovery , 2005 .

[2]  Damien Fay A strategy for short-term load forecasting in Ireland , 2004 .

[3]  James C. Bezdek,et al.  Efficient Implementation of the Fuzzy c-Means Clustering Algorithms , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Rui Xu,et al.  Survey of clustering algorithms , 2005, IEEE Transactions on Neural Networks.

[5]  Anil K. Jain,et al.  Algorithms for Clustering Data , 1988 .

[6]  Khadir M. Tarek,et al.  Kohonen Maps Combined to K-means in a Two Level Strategy for Time Series ClusteringApplication to Meteorological and Electricity Load data , 2010 .

[7]  Dipti Srinivasan,et al.  Parallel neural network-fuzzy expert system strategy for short-term load forecasting: system implementation and performance evaluation , 1999 .

[8]  James C. Bezdek,et al.  Validity-guided (re)clustering with applications to image segmentation , 1996, IEEE Trans. Fuzzy Syst..

[9]  Juha Vesanto,et al.  SOM-based data visualization methods , 1999, Intell. Data Anal..

[10]  Y.-Y. Hsu,et al.  Design of artificial neural networks for short - term load forecasting , 1991 .

[11]  Johan Himberg,et al.  A SOM based cluster visualization and its application for false coloring , 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks. IJCNN 2000. Neural Computing: New Challenges and Perspectives for the New Millennium.

[12]  Josphat Igadwa Mwasiagi,et al.  Self Organizing Maps - Applications and Novel Algorithm Design , 2011 .

[13]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[14]  Patrick Rousset Applications des algorithmes d'auto-organisation à la classification et à la prévision , 1999 .

[15]  J. Wernstedt,et al.  Forecast strategy using an adaptive fuzzy classification algorithm for load , 1999, 1999 European Control Conference (ECC).

[16]  李幼升,et al.  Ph , 1989 .

[17]  Teuvo Kohonen,et al.  The self-organizing map , 1990 .

[18]  Lawrence O. Hall,et al.  Fast Accurate Fuzzy Clustering through Data Reduction , 2003 .

[19]  I. Jolliffe Principal Component Analysis , 2002 .

[20]  Esa Alhoniemi,et al.  Clustering of the self-organizing map , 2000, IEEE Trans. Neural Networks Learn. Syst..

[21]  Gerardo Beni,et al.  A Validity Measure for Fuzzy Clustering , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[22]  David C. Yu,et al.  Weather sensitive short-term load forecasting using nonfully connected artificial neural network , 1992 .

[23]  G. Dreyfus,et al.  Réseaux de neurones - Méthodologie et applications , 2002 .

[24]  Chuen-Sheng Cheng,et al.  Identification of seasonal short-term load forecasting models using statistical decision functions , 1990 .

[25]  James C. Bezdek,et al.  Some new indexes of cluster validity , 1998, IEEE Trans. Syst. Man Cybern. Part B.

[26]  Damien Fay,et al.  24-h Electrical Load Data - a Sequential or Partitioned Time Series? , 2003, Neurocomputing.

[27]  Ian T. Jolliffe,et al.  Principal Component Analysis , 2002, International Encyclopedia of Statistical Science.

[28]  Ben J Hicks,et al.  SPIE - The International Society for Optical Engineering , 2001 .

[29]  Damien Fay,et al.  Kohonen Map Combined to the K-Means Algorithm for the Identification of Day Types of Algerian Electricity Load , 2008, 2008 7th Computer Information Systems and Industrial Management Applications.

[30]  Saman K. Halgamuge,et al.  Classification and Clustering for Knowledge Discovery (Studies in Computational Intelligence) , 2005 .

[31]  V. J. Rayward-Smith,et al.  Fuzzy Cluster Analysis: Methods for Classification, Data Analysis and Image Recognition , 1999 .

[32]  Sándor M. Szilágyi,et al.  Intensity inhomogeneity correction and segmentation of magnetic resonance images using a multi-stage fuzzy clustering approach , 2009 .

[33]  Roger Xu,et al.  New data clustering technique and its applications , 2001, SPIE Defense + Commercial Sensing.

[34]  Hannu Koivisto,et al.  Classification and Clustering for Knowledge Discovery , 2005, Studies in Computational Intelligence.

[35]  Christine W. Chan,et al.  An architectural framework for the construction of hybrid intelligent forecasting systems: application for electricity demand prediction , 1998 .

[36]  Jian Yu,et al.  Analysis of the weighting exponent in the FCM , 2004, IEEE Trans. Syst. Man Cybern. Part B.