A Novel Mining Algorithm for Periodic Clustering Sequential Patterns

In knowledge discovery, data mining of time series data has many important applications. Especially, sequential patterns and periodic patterns, which evolved from the association rule, have been applied in many useful practices. This paper presents another useful concept, the periodic clustering sequential (PCS) pattern, which uses clustering to mine valuable information from temporal or serially ordered data in a period of time. For example, one can cluster patients according to symptoms of the illness under study, but this may just result in several clusters with specific symptoms for analyzing the distribution of patients. Adding time series analysis to the above investigation, we can examine the distribution of patients over the same or different seasons. For policymakers, the PCS pattern is more useful than traditional clustering result and provides a more effective support of decision-making.

[1]  Ramakrishnan Srikant,et al.  Fast algorithms for mining association rules , 1998, VLDB 1998.

[2]  Hamid Pirahesh,et al.  Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Totals , 1996, Data Mining and Knowledge Discovery.

[3]  David D. Jensen,et al.  A Family of Algorithms for Finding Temporal Structure in Data , 1997 .

[4]  Z. Neji,et al.  Neural network and time series identification and prediction , 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks. IJCNN 2000. Neural Computing: New Challenges and Perspectives for the New Millennium.

[5]  K. alik An efficient k'-means clustering algorithm , 2008 .

[6]  Xiaodong Chen,et al.  Discovering Temporal Association Rules in Temporal Databases , 1998, IADT.

[7]  Sanjay Ranka,et al.  An Efficient Space-Partitioning Based Algorithm for the K-Means Clustering , 1999, PAKDD.

[8]  Jingtao Yao,et al.  Time dependent directional profit model for financial time series forecasting , 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks. IJCNN 2000. Neural Computing: New Challenges and Perspectives for the New Millennium.

[9]  Yu Luo,et al.  Applications of clustering data mining in customer analysis in department store , 2005, Proceedings of ICSSSM '05. 2005 International Conference on Services Systems and Services Management, 2005..

[10]  Tzung-Pei Hong,et al.  Analyzing time-series data by fuzzy data-mining technique , 2005, 2005 IEEE International Conference on Granular Computing.

[11]  A. Akhmetova Discovery of Frequent Episodes in Event Sequences , 2006 .

[12]  Joshua Zhexue Huang,et al.  A Fast Clustering Algorithm to Cluster Very Large Categorical Data Sets in Data Mining , 1997, DMKD.

[13]  Georges Gardarin,et al.  Advances in Database Technology — EDBT '96 , 1996, Lecture Notes in Computer Science.

[14]  Jiawei Han,et al.  Efficient mining of partial periodic patterns in time series database , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[15]  Ramakrishnan Srikant,et al.  Mining Sequential Patterns: Generalizations and Performance Improvements , 1996, EDBT.

[16]  M. Lloyd-Williams,et al.  Case studies in the data mining approach to health information analysis , 1998, KDD 1998.

[17]  F. W. Kellaway,et al.  Advanced Engineering Mathematics , 1969, The Mathematical Gazette.

[18]  Eamonn J. Keogh,et al.  Mining motifs in massive time series databases , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[19]  Zhexue Huang,et al.  CLUSTERING LARGE DATA SETS WITH MIXED NUMERIC AND CATEGORICAL VALUES , 1997 .

[20]  John A. Keane,et al.  Mining association rules in temporal databases , 1998, SMC'98 Conference Proceedings. 1998 IEEE International Conference on Systems, Man, and Cybernetics (Cat. No.98CH36218).

[21]  Zheng-Ou Wang,et al.  Stock market time series data mining based on regularized neural network and rough set , 2002, Proceedings. International Conference on Machine Learning and Cybernetics.

[22]  Mohammed J. Zaki Efficient enumeration of frequent sequences , 1998, CIKM '98.

[23]  Srinivasan Parthasarathy,et al.  Incremental and interactive sequence mining , 1999, CIKM '99.

[24]  P. O'Neil Advanced Engineering Mathematics , 1991 .

[25]  Wolfgang Lehner,et al.  The Cube-Query-Languages (CQL) for Multidimensional Statistical and Scientific Database Systems , 1997, DASFAA.

[26]  Richard J. Povinelli,et al.  A New Temporal Pattern Identification Method for Characterization and Prediction of Complex Time Series Events , 2003, IEEE Trans. Knowl. Data Eng..

[27]  Eamonn J. Keogh,et al.  Clustering of time-series subsequences is meaningless: implications for previous and future research , 2004, Knowledge and Information Systems.

[28]  George Koundourakis,et al.  EasyMiner: data mining in medical databases , 1998 .

[29]  Sridhar Ramaswamy,et al.  Cyclic association rules , 1998, Proceedings 14th International Conference on Data Engineering.