Clustering Time Series Data Stream - A Literature Survey

Mining Time Series data has a tremendous growth of interest in today's world. To provide an indication various implementations are studied and summarized to identify the different problems in existing applications. Clustering time series is a trouble that has applications in an extensive assortment of fields and has recently attracted a large amount of research. Time series data are frequently large and may contain outliers. In addition, time series are a special type of data set where elements have a temporal ordering. Therefore clustering of such data stream is an important issue in the data mining process. Numerous techniques and clustering algorithms have been proposed earlier to assist clustering of time series data streams. The clustering algorithms and its effectiveness on various applications are compared to develop a new method to solve the existing problem. This paper presents a survey on various clustering algorithms available for time series datasets. Moreover, the distinctiveness and restriction of previous research are discussed and several achievable topics for future study are recognized. Furthermore the areas that utilize time series clustering are also summarized.

[1]  Yang Zhang,et al.  Unsupervised Feature Extraction for Time Series Clustering Using Orthogonal Wavelet Transform , 2006, Informatica.

[2]  João Gama,et al.  Hierarchical Clustering of Time-Series Data Streams , 2008, IEEE Transactions on Knowledge and Data Engineering.

[3]  Ali S. Hadi,et al.  Finding Groups in Data: An Introduction to Chster Analysis , 1991 .

[4]  R. F. Li,et al.  Combining Conceptual Clustering and Principal Component Analysis for State Space Based Process Monitoring , 1999 .

[5]  Gareth J. Janacek,et al.  Clustering Time Series with Clipped Data , 2005, Machine Learning.

[6]  Eyke Hüllermeier,et al.  Online clustering of parallel data streams , 2006, Data Knowl. Eng..

[7]  Sudipto Guha,et al.  Clustering Data Streams: Theory and Practice , 2003, IEEE Trans. Knowl. Data Eng..

[8]  Jian Yin,et al.  A Clustering Algorithm for Time Series Data , 2006, 2006 Seventh International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT'06).

[9]  Xiaozhe Wang,et al.  Characteristic-Based Clustering for Time Series Data , 2006, Data Mining and Knowledge Discovery.

[10]  Shusaku Tsumoto,et al.  Cluster Analysis of Time-Series Medical Data Based on the Trajectory Representation and Multiscale Comparison Techniques , 2006, Sixth International Conference on Data Mining (ICDM'06).

[11]  Xiang Lian,et al.  Efficient Similarity Search over Future Stream Time Series , 2008, IEEE Transactions on Knowledge and Data Engineering.

[12]  Pasi Fränti,et al.  Time-series clustering by approximate prototypes , 2008, ICPR.

[13]  Hui Ding,et al.  Querying and mining of time series data: experimental comparison of representations and distance measures , 2008, Proc. VLDB Endow..

[14]  D. Seborg,et al.  Clustering multivariate time‐series data , 2005 .

[15]  Lisa M. Talbot,et al.  Application of Fuzzy Grade-of-Membership Clustering to Analysis of Remote Sensing Data , 1999 .

[16]  Weimin Li,et al.  Clustering Streaming Time Series Using CBC , 2007, International Conference on Computational Science.

[17]  Philippe Besse,et al.  Clustering Time-Series Gene Expression Data Using Smoothing Spline Derivatives , 2007, EURASIP J. Bioinform. Syst. Biol..

[18]  Gareth J. Janacek,et al.  Clustering time series from ARMA models with clipped data , 2004, KDD.

[19]  Ziv Bar-Joseph,et al.  Clustering short time series gene expression data , 2005, ISMB.