Post-mining: maintenance of association rules by weighting

This paper proposes a new strategy for maintaining association rules in dynamic databases. This method uses weighting technique to highlight new data. Our approach is novel in that recently added transactions are given higher weights. In particular, we look at how frequent itemsets can be maintained incrementally. We propose a competitive model to 'promote' infrequent itemsets to frequent itemsets, and to 'degrade' frequent itemsets to infrequent itemsets incrementally. This competitive strategy can avoid retracing the whole data set. We have evalualed the proposed method. The experiments have shown that our approach is efficient and promising.

[1]  Victor J. Rayward-Smith,et al.  Discovering knowledge from low-quality meterological databases , 1999, KDD 1999.

[2]  Huan Liu,et al.  Instance Selection and Construction for Data Mining , 2001 .

[3]  Jiawei Han,et al.  Maintenance of discovered association rules in large databases: an incremental updating technique , 1996, Proceedings of the Twelfth International Conference on Data Engineering.

[4]  Rokia Missaoui,et al.  An Incremental Concept Formation Approach for Learning from Databases , 1994, Theor. Comput. Sci..

[5]  Chengqi Zhang,et al.  Anytime mining for multiuser applications , 2002, IEEE Trans. Syst. Man Cybern. Part A.

[6]  Philip S. Yu,et al.  Data Mining: An Overview from a Database Perspective , 1996, IEEE Trans. Knowl. Data Eng..

[7]  Shichao Zhang,et al.  Association Rule Mining: Models and Algorithms , 2002 .

[8]  Ramakrishnan Srikant,et al.  Mining generalized association rules , 1995, Future Gener. Comput. Syst..

[9]  Rajeev Motwani,et al.  Beyond market baskets: generalizing association rules to correlations , 1997, SIGMOD '97.

[10]  Chengqi Zhang,et al.  Estimating itemsets of interest by sampling , 2001, 10th IEEE International Conference on Fuzzy Systems. (Cat. No.01CH37297).

[11]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[12]  C No Anytime Mining for Multi-User Applications , 2002 .

[13]  William Frawley,et al.  Knowledge Discovery in Databases , 1991 .

[14]  David Wai-Lok Cheung,et al.  A General Incremental Technique for Maintaining Discovered Association Rules , 1997, DASFAA.

[15]  Gregory Piatetsky-Shapiro,et al.  Discovery, Analysis, and Presentation of Strong Rules , 1991, Knowledge Discovery in Databases.

[16]  Paul E. Utgoff,et al.  Incremental Induction of Decision Trees , 1989, Machine Learning.

[17]  Julius T. Tou,et al.  Information Systems , 1973, GI Jahrestagung.