Fuzzy Clustering Stability Evaluation of Time Series

The discovery of knowledge by analyzing time series is an important field of research. In this paper we investigate multiple multivariate time series, because we assume a higher information value than regarding only one time series at a time. There are several approaches which make use of the granger causality or the cross correlation in order to analyze the influence of time series on each other. In this paper we extend the idea of mutual influence and present FCSETS (Fuzzy Clustering Stability Evaluation of Time Series), a new approach which makes use of the membership degree produced by the fuzzy c-means (FCM) algorithm. We first cluster time series per timestamp and then compare the relative assignment agreement (introduced by Eyke Hüllermeier and Maria Rifqi) of all subsequences. This leads us to a stability score for every time series which itself can be used to evaluate single time series in the data set. It is then used to rate the stability of the entire clustering. The stability score of a time series is higher the more the time series sticks to its peers over time. This not only reveals a new idea of mutual time series impact but also enables the identification of an optimal amount of clusters per timestamp. We applied our model on different data, such as financial, country related economy and generated data, and present the results.

[1]  Frank Klawonn,et al.  Fuzzy Clustering of Short Time-Series and Unevenly Distributed Sampling Points , 2003, IDA.

[2]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[3]  Ricardo J. G. B. Campello,et al.  A fuzzy extension of the Rand index and other related indexes for clustering and classification assessment , 2007, Pattern Recognit. Lett..

[4]  Duong Tuan Anh,et al.  A novel clustering-based method for time series motif discovery under time warping measure , 2017, International Journal of Data Science and Analytics.

[5]  Young-Il Kim,et al.  A cluster validation index for GK cluster analysis based on relative degree of sharing , 2004, Inf. Sci..

[6]  Y. Fukuyama,et al.  A new method of choosing the number of clusters for the fuzzy c-mean method , 1989 .

[7]  Santo Fortunato,et al.  A benchmark model to assess community structure in evolving networks , 2015, Physical review. E, Statistical, nonlinear, and soft matter physics.

[8]  James C. Bezdek,et al.  Fuzzy c-means clustering of incomplete data , 2001, IEEE Trans. Syst. Man Cybern. Part B.

[9]  Thomas A. Runkler Comparing Partitions by Subset Similarities , 2010, IPMU.

[10]  William M. Rand,et al.  Objective Criteria for the Evaluation of Clustering Methods , 1971 .

[11]  Ziv Bar-Joseph,et al.  Clustering short time series gene expression data , 2005, ISMB.

[12]  Sudipto Guha,et al.  Streaming-data algorithms for high-quality clustering , 2002, Proceedings 18th International Conference on Data Engineering.

[13]  Stefan Conrad,et al.  Fuzzy c-Means Clustering of Incomplete Data Using Dimension-Wise Fuzzy Variances of Clusters , 2016, IPMU.

[14]  David A. Bader,et al.  A dynamic algorithm for local community detection in graphs , 2015, 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[15]  James M. Keller,et al.  A possibilistic fuzzy c-means clustering algorithm , 2005, IEEE Transactions on Fuzzy Systems.

[16]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[17]  Eyke Hüllermeier,et al.  A Fuzzy Variant of the Rand Index for Comparing Clustering Structures , 2009, IFSA/EUSFLAT Conf..

[18]  Joachim M. Buhmann,et al.  A Resampling Approach to Cluster Validation , 2002, COMPSTAT.

[19]  J. C. Dunn,et al.  A Fuzzy Relative of the ISODATA Process and Its Use in Detecting Compact Well-Separated Clusters , 1973 .

[20]  Eyke Hüllermeier,et al.  Adaptive Optimization of the Number of Clusters in Fuzzy Clustering , 2007, 2007 IEEE International Fuzzy Systems Conference.

[21]  Martha Tatusch,et al.  Show Me Your Friends and I'll Tell You Who You Are. Finding Anomalous Time Series by Conspicuous Cluster Transitions , 2019, AusDM.

[22]  Hoel Le Capitaine,et al.  A Cluster-Validity Index Combining an Overlap Measure and a Separation Measure Based on Fuzzy-Aggregation Operators , 2011, IEEE Transactions on Fuzzy Systems.

[23]  K. Swan,et al.  The Global Economy , 2021, Global Marketing.

[24]  Nick Filer,et al.  The rise and fall of spatio-temporal clusters in mobile ad hoc networks , 2013, Ad Hoc Networks.

[25]  Hichem Frigui,et al.  Clustering and aggregation of relational data with applications to image database categorization , 2007, Pattern Recognit..

[26]  Heiko Timm,et al.  Different approaches to fuzzy clustering of incomplete datasets , 2004, Int. J. Approx. Reason..

[27]  Arindam Banerjee,et al.  Clickstream clustering using weighted longest common subsequences , 2001 .

[28]  Camille Roth,et al.  Natural Scales in Geographical Patterns , 2017, Scientific Reports.

[29]  Gerardo Beni,et al.  A Validity Measure for Fuzzy Clustering , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[30]  Shengrui Wang,et al.  An objective approach to cluster validation , 2006, Pattern Recognit. Lett..

[31]  Liang Zhao,et al.  Time series clustering via community detection in networks , 2015, Inf. Sci..

[32]  Luis Gravano,et al.  k-Shape: Efficient and Accurate Clustering of Time Series , 2016, SGMD.

[33]  Witold Pedrycz,et al.  Fuzzy clustering of time series data using dynamic time warping distance , 2015, Eng. Appl. Artif. Intell..

[34]  Raymond Y. K. Lau,et al.  Time series k-means: A new k-means type smooth subspace clustering for time series data , 2016, Inf. Sci..

[35]  Sudipto Guha,et al.  Clustering Data Streams: Theory and Practice , 2003, IEEE Trans. Knowl. Data Eng..

[36]  S. P. Lloyd,et al.  Least squares quantization in PCM , 1982, IEEE Trans. Inf. Theory.

[37]  Derek Greene,et al.  Tracking the Evolution of Communities in Dynamic Social Networks , 2010, 2010 International Conference on Advances in Social Networks Analysis and Mining.