A Time Series Clustering Technique based on Community Detection in Networks

Abstract Time series clustering is a research topic of practical importance in temporal data mining. The goal is to identify groups of similar time series in a data base. In this paper, we propose a technique for time series clustering via community detection in complex networks. First, we construct a network where every vertex represents a time series connected its most similar ones,. Similarity was calculated using different time series distance functions. Then, we applied a community detection algorithm to identify groups of strongly connected vertices in order to produce time series clusters. We verified which distance function works better with every clustering algorithm and compared them to our approach. The experimental results show that our approach statistically outperformed many traditional clustering algorithms. We find that the community detection approach can detect groups that other techniques fail to identify.

[1]  Yang Zhang,et al.  Unsupervised Feature Extraction for Time Series Clustering Using Orthogonal Wavelet Transform , 2006, Informatica.

[2]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[3]  M. Newman,et al.  Finding community structure in very large networks. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[4]  P. Boesiger,et al.  A new correlation‐based fuzzy logic clustering algorithm for FMRI , 1998, Magnetic resonance in medicine.

[5]  V. Latora,et al.  Complex networks: Structure and dynamics , 2006 .

[6]  Yannis Theodoridis,et al.  Index-based Most Similar Trajectory Search , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[7]  Carlos Agón,et al.  Time-series data mining , 2012, CSUR.

[8]  Dong Zhou,et al.  Translation techniques in cross-language information retrieval , 2012, CSUR.

[9]  Jiaqi Liu,et al.  A novel clustering method on time series data , 2011, Expert Syst. Appl..

[10]  Dit-Yan Yeung,et al.  Time series clustering with ARMA mixtures , 2004, Pattern Recognit..

[11]  Donald J. Berndt,et al.  Using Dynamic Time Warping to Find Patterns in Time Series , 1994, KDD Workshop.

[12]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[13]  Jianhong Wu,et al.  Data clustering - theory, algorithms, and applications , 2007 .

[14]  Michalis Vazirgiannis,et al.  c ○ 2001 Kluwer Academic Publishers. Manufactured in The Netherlands. On Clustering Validation Techniques , 2022 .

[15]  Christos Faloutsos,et al.  Fast Time Sequence Indexing for Arbitrary Lp Norms , 2000, VLDB.

[16]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[17]  Dimitrios Gunopulos,et al.  Discovering similar multidimensional trajectories , 2002, Proceedings 18th International Conference on Data Engineering.

[18]  T. Warren Liao,et al.  Clustering of time series data - a survey , 2005, Pattern Recognit..

[19]  Eamonn J. Keogh,et al.  Experimental comparison of representation methods and distance measures for time series data , 2010, Data Mining and Knowledge Discovery.

[20]  Eamonn J. Keogh,et al.  CID: an efficient complexity-invariant distance for time series , 2013, Data Mining and Knowledge Discovery.