A new approach for discovering fuzzy quantitative sequential patterns in sequence databases

Given a sequence database and minimum support threshold, the goal of mining quantitative sequential patterns is to discover the complete set of sequential patterns with purchased quantities in databases. Although this type of pattern can provide more information than the traditional sequential pattern, it also causes a sharp boundary problem. This means that when an item's quantity is close to the boundary of two adjacent quantity intervals, it is either ignored or overemphasized. In view of this weakness, a recent paper from Hong, Kuo, and Chi proposed a new kind of extended patterns, called fuzzy quantitative sequential patterns (FQSP), where an item's quantity in the pattern is represented by a fuzzy term rather than a quantity interval. In their work an Apriori-like algorithm was developed to mine all FQSP. In this paper, we propose a new and novel algorithm to mine FQSP based on the divide-and-conquer strategy. Since the proposed algorithm greatly reduces the candidate subsequence generation efforts, the performance is improved significantly. Experiments show that the proposed algorithm is much more efficient and scalable than the previous algorithm.

[1]  Swarup Medasani,et al.  An overview of membership function generation techniques for pattern recognition , 1998, Int. J. Approx. Reason..

[2]  Wai-Ho Au,et al.  FARM: a data mining system for discovering fuzzy association rules , 1999, FUZZ-IEEE'99. 1999 IEEE International Fuzzy Systems. Conference Proceedings (Cat. No.99CH36315).

[3]  Keith C. C. Chan,et al.  Mining fuzzy association rules in a bank-account database , 2003, IEEE Trans. Fuzzy Syst..

[4]  Ramakrishnan Srikant,et al.  Mining sequential patterns , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[5]  Jian Pei,et al.  Mining Access Patterns Efficiently from Web Logs , 2000, PAKDD.

[6]  Ronald L. Rivest,et al.  Introduction to Algorithms , 1990 .

[7]  Jianyong Wang,et al.  Mining sequential patterns by pattern-growth: the PrefixSpan approach , 2004, IEEE Transactions on Knowledge and Data Engineering.

[8]  Ramakrishnan Srikant,et al.  Mining Sequential Patterns: Generalizations and Performance Improvements , 1996, EDBT.

[9]  Mohammed J. Zaki,et al.  SPADE: An Efficient Algorithm for Mining Frequent Sequences , 2004, Machine Learning.

[10]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[11]  Ramakrishnan Srikant,et al.  Mining quantitative association rules in large relational tables , 1996, SIGMOD '96.

[12]  Ping-Yu Hsu,et al.  Mining hybrid sequential patterns and sequential rules , 2002, Inf. Syst..

[13]  Man Hon Wong,et al.  Mining fuzzy association rules in databases , 1998, SGMD.

[14]  Umeshwar Dayal,et al.  FreeSpan: frequent pattern-projected sequential pattern mining , 2000, KDD '00.

[15]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[16]  Guoqing Chen,et al.  Fuzzy association rules and the extended mining algorithms , 2002, Inf. Sci..

[17]  Michalis Vazirgiannis A Classification and Relationship Extraction Scheme for Raltional Databases Based on Fuzzy Logic , 1998, PAKDD.

[18]  Kyuseok Shim,et al.  SQUIRE: sequential pattern mining with quantities , 2004, Proceedings. 20th International Conference on Data Engineering.

[19]  J.W.T. Lee,et al.  An ordinal framework for data mining of fuzzy rules , 2000, Ninth IEEE International Conference on Fuzzy Systems. FUZZ- IEEE 2000 (Cat. No.00CH37063).

[20]  Bijan Fazlollahi,et al.  A Method for Generation of Alternatives by Decision Support Systems , 2001, J. Manag. Inf. Syst..

[21]  T. Ross Fuzzy Logic with Engineering Applications , 1994 .

[22]  Keith C. C. Chan,et al.  An effective algorithm for discovering fuzzy rules in relational databases , 1998, 1998 IEEE International Conference on Fuzzy Systems Proceedings. IEEE World Congress on Computational Intelligence (Cat. No.98CH36228).

[23]  Ming-Tat Ko,et al.  Discovering time-interval sequential patterns in sequence databases , 2003, Expert Syst. Appl..

[24]  Yen-Liang Chen,et al.  Mining sequential patterns from multidimensional sequence data , 2005, IEEE Transactions on Knowledge and Data Engineering.

[25]  Tzung-Pei Hong,et al.  Mining association rules from quantitative data , 1999, Intell. Data Anal..

[26]  Lotfi A. Zadeh,et al.  Toward a theory of fuzzy information granulation and its centrality in human reasoning and fuzzy logic , 1997, Fuzzy Sets Syst..

[27]  Keith C. C. Chan,et al.  Mining fuzzy association rules , 1997, CIKM '97.

[28]  Tzung-Pei Hong,et al.  Mining fuzzy sequential patterns from quantitative data , 1999, IEEE SMC'99 Conference Proceedings. 1999 IEEE International Conference on Systems, Man, and Cybernetics (Cat. No.99CH37028).

[29]  Weining Zhang,et al.  Mining fuzzy quantitative association rules , 1999, Proceedings 11th International Conference on Tools with Artificial Intelligence.

[30]  Ramakrishnan Srikant,et al.  Fast algorithms for mining association rules , 1998, VLDB 1998.

[31]  Daming Shi,et al.  Mining fuzzy association rules with weighted items , 2000, Smc 2000 conference proceedings. 2000 ieee international conference on systems, man and cybernetics. 'cybernetics evolving to systems, humans, organizations, and their complex interactions' (cat. no.0.