Proposal of New Objective Measures for Mining Association Rules: Cannibalization and Unexpectedness

In view of the problems that a few of the existing measures for association rules does not directly meet user’s requirements, and association mining algorithms produce huge number of trivial rules, this paper proposes two new objective measures for mining association rules to solve the problems. The first measure is the degree of cannibalization between itemsets, which is bounded up with marketing strategy, and the second is the objective measure that intends to discover unexpected rules in the database. Experimental studies with application to public dataset and comparison of running time using synthetic datasets demonstrate the validity and effectiveness of the proposed measures.

[1]  L. R. Dice Measures of the Amount of Ecologic Association Between Species , 1945 .

[2]  Mohammed J. Zaki Scalable Algorithms for Association Mining , 2000, IEEE Trans. Knowl. Data Eng..

[3]  T. Sørensen,et al.  A method of establishing group of equal amplitude in plant sociobiology based on similarity of species content and its application to analyses of the vegetation on Danish commons , 1948 .

[4]  Edward Omiecinski,et al.  Alternative Interest Measures for Mining Associations in Databases , 2003, IEEE Trans. Knowl. Data Eng..

[5]  Wynne Hsu,et al.  Analyzing the Subjective Interestingness of Association Rules , 2000, IEEE Intell. Syst..

[6]  Marnik G. Dekimpe,et al.  Estimating Cannibalization Rates for Pioneering Innovations , 2010, Mark. Sci..

[7]  Rajeev Motwani,et al.  Dynamic itemset counting and implication rules for market basket data , 1997, SIGMOD '97.

[8]  Claudio Carpineto,et al.  Concept data analysis - theory and applications , 2004 .

[9]  Patrick Meyer,et al.  On selecting interestingness measures for association rules: User oriented description and multiple criteria decision aid , 2008, Eur. J. Oper. Res..

[10]  J. T. Curtis,et al.  An Ordination of the Upland Forest Communities of Southern Wisconsin , 1957 .

[11]  E. Nijssen,et al.  Exploring product and service innovation similarities and differences , 2006 .

[12]  Abraham Silberschatz,et al.  What Makes Patterns Interesting in Knowledge Discovery Systems , 1996, IEEE Trans. Knowl. Data Eng..

[13]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[14]  M. Gupta,et al.  Theory of T -norms and fuzzy inference methods , 1991 .

[15]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[16]  J. Kenney,et al.  Mathematics of statistics , 1940 .

[17]  Ramakrishnan Srikant,et al.  Fast algorithms for mining association rules , 1998, VLDB 1998.

[18]  William Copulsky,et al.  Cannibalism in the Marketplace , 1976 .

[19]  Andreas Kuckertz,et al.  Cannibalization or Synergy? Consumers’ Channel Selection in Online-Offline Multichannel Systems , 2011 .

[20]  Jaideep Srivastava,et al.  Selecting the right objective measure for association analysis , 2004, Inf. Syst..

[21]  Heikki Mannila,et al.  Fast Discovery of Association Rules , 1996, Advances in Knowledge Discovery and Data Mining.