A Fuzzy Data Mining Algorithm for Incremental Mining of Quantitative Sequential Patterns

In real world applications, the databases are constantly added with a large number of transactions and hence maintaining latest sequential patterns valid on the updated database is crucial. Existing data mining algorithms can incrementally mine the sequential patterns from databases with binary values. Temporal transactions with quantitative values are commonly seen in real world applications. In addition, several methods have been proposed for representing uncertain data in a database. In this paper, a fuzzy data mining algorithm for incremental mining of sequential patterns from quantitative databases is proposed. Proposed algorithm called IQSP algorithm uses the fuzzy grid notion to generate fuzzy sequential patterns validated on the updated database containing the transactions in the original database and in the incremental database. It uses the information about sequential patterns that are already mined from original database and avoids start-from-scratch process. Also, it minimizes the number of candidates to check as well as number of scans to original database by identifying the potential sequences in incremental database.

[1]  Zvi M. Kedem,et al.  Pincer-Search: An Efficient Algorithm for Discovering the Maximum Frequent Set , 2002, IEEE Trans. Knowl. Data Eng..

[2]  Elke A. Rundensteiner,et al.  On nearness measures in fuzzy relational data models , 1989, Int. J. Approx. Reason..

[3]  David D. Jensen,et al.  A Family of Algorithms for Finding Temporal Structure in Data , 1997 .

[4]  Mohammed J. Zaki,et al.  SPADE: An Efficient Algorithm for Mining Frequent Sequences , 2004, Machine Learning.

[5]  Jiawei Han,et al.  IncSpan: incremental mining of sequential patterns in large database , 2004, KDD.

[6]  Vikram Pudi,et al.  Quantifying the Utility of the Past in Mining Large Databases , 2000, Inf. Syst..

[7]  Heikki Mannila,et al.  Knowledge discovery from telecommunication network alarm databases , 1996, Proceedings of the Twelfth International Conference on Data Engineering.

[8]  H. Zimmermann Fuzzy sets, decision making, and expert systems , 1987 .

[9]  Heikki Mannila,et al.  Discovery of Frequent Episodes in Event Sequences , 1997, Data Mining and Knowledge Discovery.

[10]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[11]  Umeshwar Dayal,et al.  PrefixSpan: Mining Sequential Patterns by Prefix-Projected Growth , 2001, ICDE 2001.

[12]  Ke Wang,et al.  Incremental Discovery of Sequential Patterns , 1996 .

[13]  Mohammed J. Zaki Efficient enumeration of frequent sequences , 1998, CIKM '98.

[14]  Srinivasan Parthasarathy,et al.  Incremental and interactive sequence mining , 1999, CIKM '99.

[15]  D. Cheung,et al.  Maintenance of Discovered Association Rules , 2002 .

[16]  Attila Gyenesei,et al.  A Fuzzy Approach for Mining Quantitative Association Rules , 2000, Acta Cybern..

[17]  Kyuseok Shim,et al.  SPIRIT: Sequential Pattern Mining with Regular Expression Constraints , 1999, VLDB.

[18]  P. A. Paraskevas,et al.  An advanced integrated expert system for wastewater treatment plants control: an addendum , 2003, Knowl. Based Syst..

[19]  Gwo-Hshiung Tzeng,et al.  A Fuzzy Data Mining Algorithm for Finding Sequential Patterns , 2003, Int. J. Uncertain. Fuzziness Knowl. Based Syst..

[20]  Suh-Yin Lee,et al.  Incremental update on sequential patterns in large databases , 1998, Proceedings Tenth IEEE International Conference on Tools with Artificial Intelligence (Cat. No.98CH36294).

[21]  Ramakrishnan Srikant,et al.  Mining Sequential Patterns: Generalizations and Performance Improvements , 1996, EDBT.

[22]  Ramakrishnan Srikant,et al.  Mining sequential patterns , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[23]  Thomas H. Davenport,et al.  Book review:Working knowledge: How organizations manage what they know. Thomas H. Davenport and Laurence Prusak. Harvard Business School Press, 1998. $29.95US. ISBN 0‐87584‐655‐6 , 1998 .

[24]  T. Hong,et al.  Mining fuzzy sequential patterns from multiple-item transactions , 2001, Proceedings Joint 9th IFSA World Congress and 20th NAFIPS International Conference (Cat. No. 01TH8569).

[25]  Siegfried Gottwald,et al.  Fuzzy Sets and Fuzzy Logic , 1993 .

[26]  Maguelonne Teisseire,et al.  Incremental mining of sequential patterns in large databases , 2003, Data Knowl. Eng..

[27]  Mohammed J. Zaki,et al.  PlanMine: Sequence Mining for Plan Failures , 1998, KDD.

[28]  Yi-Chung Hu,et al.  Discovering fuzzy association rules using fuzzy partition methods , 2003, Knowl. Based Syst..

[29]  Guizhen Yang,et al.  The complexity of mining maximal frequent itemsets and maximal frequent patterns , 2004, KDD.