Mining time-interval univariate uncertain sequential patterns

In this study, we propose two algorithms to discover time-interval univariate uncertain (U2) -sequential patterns from a set of univariate uncertain (U2)-sequences. A U2-sequence is a sequence that contains transactions of univariate uncertain data, where each attribute in a transaction is associated with a quantitative interval and a probability density function indicating the possibility that each value exists in the interval. Many sources record U2-sequences, such as atmospheric pollution sensors and network monitoring systems. Mining sequential patterns from these U2-sequences is important for understanding the intrinsic characteristics of the U2-sequences. The proposed two algorithms are based on the candidate generate-and-test methodology and pattern growth methodology, respectively. We performed a series of experiments to evaluate them in terms of runtime and memory consumption. The experimental results show that different algorithms excel when applied to different conditions. In general, the algorithm based on the pattern growth methodology is the better choice.

[1]  Johannes Gehrke,et al.  Sequential PAttern mining using a bitmap representation , 2002, KDD.

[2]  Jiawei Han,et al.  IncSpan: incremental mining of sequential patterns in large database , 2004, KDD.

[3]  Keun Ho Ryu,et al.  Mining temporal interval relational rules from temporal data , 2009, J. Syst. Softw..

[4]  Chih-Jung Chen,et al.  Generating touring path suggestions using time-interval sequential pattern mining , 2012, Expert Syst. Appl..

[5]  Jiawei Han,et al.  BIDE: efficient mining of frequent closed sequences , 2004, Proceedings. 20th International Conference on Data Engineering.

[6]  Joong Hyuk Chang,et al.  Mining weighted sequential patterns in a sequence database with a time-interval weight , 2011, Knowl. Based Syst..

[7]  Yen-Liang Chen,et al.  Discovering fuzzy time-interval sequential patterns in sequence databases , 2005, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[8]  Masaru Kitsuregawa,et al.  Mining Algorithms for Sequential Patterns in Parallel: Hash Based Approach , 1998, PAKDD.

[9]  Ben Kao,et al.  A Decremental Approach for Mining Frequent Itemsets from Uncertain Data , 2008, PAKDD.

[10]  Qiming Chen,et al.  PrefixSpan,: mining sequential patterns efficiently by prefix-projected pattern growth , 2001, Proceedings 17th International Conference on Data Engineering.

[11]  Maguelonne Teisseire,et al.  Incremental mining of sequential patterns in large databases , 2003, Data Knowl. Eng..

[12]  Mohammed J. Zaki,et al.  SPADE: An Efficient Algorithm for Mining Frequent Sequences , 2004, Machine Learning.

[13]  Ramakrishnan Srikant,et al.  Mining sequential patterns , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[14]  Shih-Sheng Chen,et al.  New and efficient knowledge discovery of partial periodic patterns with multiple minimum supports , 2011, J. Syst. Softw..

[15]  Engelbert Mephu Nguifo,et al.  CMRules: Mining sequential rules common to several sequences , 2012, Knowl. Based Syst..

[16]  Umeshwar Dayal,et al.  FreeSpan: frequent pattern-projected sequential pattern mining , 2000, KDD '00.

[17]  Muhammad Muzammal,et al.  Mining Sequential Patterns from Probabilistic Databases by Pattern-Growth , 2011, BNCOD.

[18]  Kyuseok Shim,et al.  SPIRIT: Sequential Pattern Mining with Regular Expression Constraints , 1999, VLDB.

[19]  Edward Hung,et al.  Mining Frequent Itemsets from Uncertain Data , 2007, PAKDD.

[20]  Yasuo Kudo,et al.  A sequential pattern mining algorithm using rough set theory , 2011, Int. J. Approx. Reason..

[21]  Chun-sheng Wang,et al.  Constrained frequent pattern mining on univariate uncertain data , 2013, J. Syst. Softw..

[22]  Umeshwar Dayal,et al.  PrefixSpan: Mining Sequential Patterns by Prefix-Projected Growth , 2001, ICDE 2001.

[23]  Kyuseok Shim,et al.  SQUIRE: Sequential pattern mining with quantities , 2007, J. Syst. Softw..

[24]  Xifeng Yan,et al.  CloSpan: Mining Closed Sequential Patterns in Large Datasets , 2003, SDM.

[25]  Rajeev Raman,et al.  Mining sequential patterns from probabilistic databases , 2011, Knowledge and Information Systems.

[26]  Unil Yun,et al.  A new framework for detecting weighted sequential patterns in large sequence databases , 2008, Knowl. Based Syst..

[27]  Ming-Tat Ko,et al.  Discovering time-interval sequential patterns in sequence databases , 2003, Expert Syst. Appl..

[28]  Mohamed E. El-Sharkawi,et al.  Vertical Mining of Frequent Patterns from Uncertain Data , 2010, Comput. Inf. Sci..

[29]  Chedy Raïssi,et al.  Towards bounding sequential patterns , 2011, KDD.

[30]  Rajeev Raman,et al.  Uncertainty in Sequential Pattern Mining , 2010, BNCOD.

[31]  Ying-Ho Liu,et al.  Mining frequent patterns from univariate uncertain data , 2012, Data Knowl. Eng..

[32]  Charu C. Aggarwal,et al.  Frequent pattern mining with uncertain data , 2009, KDD.

[33]  João Gama,et al.  Constrained Sequential Pattern Knowledge in Multi-relational Learning , 2011, EPIA.

[34]  Carson Kai-Sang Leung,et al.  A Tree-Based Approach for Frequent Pattern Mining from Uncertain Data , 2008, PAKDD.

[35]  James Bailey,et al.  Mining Minimal Distinguishing Subsequence Patterns with Gap Constraints , 2005, ICDM.

[36]  Ramakrishnan Srikant,et al.  Mining Sequential Patterns: Generalizations and Performance Improvements , 1996, EDBT.

[37]  Ying-Ho Liu Stream mining on univariate uncertain data , 2012, Applied Intelligence.

[38]  Reynold Cheng,et al.  Mining uncertain data with probabilistic guarantees , 2010, KDD.

[39]  Rajeev Raman,et al.  On Probabilistic Models for Uncertain Sequential Pattern Mining , 2010, ADMA.

[40]  Carson Kai-Sang Leung,et al.  Efficient Mining of Frequent Patterns from Uncertain Data , 2007 .