论文信息 - Introduction to arules – A computational environment for mining association rules and frequent item sets

Introduction to arules – A computational environment for mining association rules and frequent item sets

Mining frequent itemsets and association rules is a popular and well researched approach for discovering interesting relationships between variables in large databases. The R package arules presented in this paper provides a basic infrastructure for creating and manipulating input data sets and for analyzing the resulting itemsets and rules. The package also includes interfaces to two fast mining algorithms, the popular C implementations of Apriori and Eclat by Christian Borgelt. These algorithms can be used to mine frequent itemsets, maximal frequent itemsets, closed frequent itemsets and association rules.

[1] Donald E. Knuth,et al. The art of computer programming: sorting and searching (volume 3) , 1973 .

[2] Hannu Toivonen,et al. Sampling Large Databases for Association Rules , 1996, VLDB.

[3] Nicolas Pasquier,et al. Discovering Frequent Closed Itemsets for Association Rules , 1999, ICDT.

[4] Mohammed J. Zaki. Mining Non-Redundant Association Rules , 2004, Data Min. Knowl. Discov..

[5] Jaideep Srivastava,et al. Selecting the right objective measure for association analysis , 2004, Inf. Syst..

[6] Kurt Hornik,et al. Building on the Arules Infrastructure for Analyzing Transaction Data with R , 2006, GfKl.

[7] M. Schwarz,et al. Otto-von-Guericke-University of Magdeburg , 2007 .

[8] Joydeep Ghosh,et al. Distance based clustering of association rules , 1999 .

[9] Nie Yong. Mining quantitative association rules , 2000 .

[10] Mohammed J. Zaki. Scalable Algorithms for Association Mining , 2000, IEEE Trans. Knowl. Data Eng..

[11] Philip S. Yu,et al. Finding Localized Associations in Market Basket Data , 2002, IEEE Trans. Knowl. Data Eng..

[12] Catherine Blake,et al. UCI Repository of machine learning databases , 1998 .

[13] Michael J. A. Berry,et al. Data mining techniques - for marketing, sales, and customer support , 1997, Wiley computer publishing.

[14] Christian Borgelt,et al. Induction of Association Rules: Apriori Implementation , 2002, COMPSTAT.

[15] Sushil Jajodia,et al. Proceedings of the 1993 ACM SIGMOD international conference on Management of data , 1993, SIGMOD 1993.

[16] William Frawley,et al. Knowledge Discovery in Databases , 1991 .

[17] Rajeev Motwani,et al. Beyond Market Baskets: Generalizing Association Rules to Dependence Rules , 1998, Data Mining and Knowledge Discovery.

[18] Trevor Hastie,et al. The Elements of Statistical Learning , 2001 .

[19] Srinivasan Parthasarathy,et al. New Algorithms for Fast Discovery of Association Rules , 1997, KDD.

[20] Dimitrios Gunopulos,et al. Constraint-Based Rule Mining in Large, Dense Databases , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[21] Srinivasan Parthasarathy,et al. Efficient progressive sampling for association rules , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[22] Ramakrishnan Srikant,et al. Mining quantitative association rules in large relational tables , 1996, SIGMOD '96.

[23] Tomasz Imielinski,et al. Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[24] Kurt Hornik,et al. New probabilistic interest measures for association rules , 2007, Intell. Data Anal..

[25] Rajeev Motwani,et al. Beyond market baskets: generalizing association rules to correlations , 1997, SIGMOD '97.

[26] Hui Xiong,et al. Mining strong affinity association patterns in data sets with skewed support distribution , 2003, Third IEEE International Conference on Data Mining.

[27] Kurt Hornik,et al. Implications of probabilistic data modeling for rule mining , 2005 .

[28] Ulrich Güntzer,et al. Algorithms for association rule mining — a general survey and comparison , 2000, SKDD.

[29] D. Ruppert. The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2004 .

[30] Edward Omiecinski,et al. Alternative Interest Measures for Mining Associations in Databases , 2003, IEEE Trans. Knowl. Data Eng..

[31] Rajeev Motwani,et al. Dynamic itemset counting and implication rules for market basket data , 1997, SIGMOD '97.

[32] Christopher J. Merz,et al. UCI Repository of Machine Learning Databases , 1996 .

[33] Christian Borgelt,et al. EFFICIENT IMPLEMENTATIONS OF APRIORI AND ECLAT , 2003 .

[34] Heikki Mannila,et al. Efficient Algorithms for Discovering Association Rules , 1994, KDD Workshop.

[35] Bart Goethals,et al. Advances in frequent itemset mining implementations: report on FIMI'03 , 2004, SKDD.

[36] Christian Borgelt. Apriori-Finding Association Rules/Hyperedges with the Apriori Algorithm , 2004 .

[37] William DuMouchel,et al. Empirical bayes screening for multi-item associations , 2001, KDD '01.

[38] Gregory Piatetsky-Shapiro,et al. Discovery, Analysis, and Presentation of Strong Rules , 1991, Knowledge Discovery in Databases.

[39] Heike Hofmann,et al. Visual Comparison of Association Rules , 2001, Comput. Stat..

[40] Joydeep Ghosh,et al. Relationship-Based Clustering and Visualization for High-Dimensional Data Mining , 2003, INFORMS J. Comput..

[41] Gerald W. Kimble,et al. Information and Computer Science , 1975 .

[42] Donald E. Knuth,et al. The Art of Computer Programming: Volume 3: Sorting and Searching , 1998 .

[43] Srinivasan Parthasarathy,et al. Evaluation of sampling for data mining of association rules , 1997, Proceedings Seventh International Workshop on Research Issues in Data Engineering. High Performance Database Management for Large-Scale Applications.

[44] Bart Goethals,et al. Advances in Frequent Itemset Mining Implementations: Introduction to FIMI03 , 2003, FIMI.

[45] Wynne Hsu,et al. Pruning and summarizing the discovered associations , 1999, KDD '99.

[46] Ramakrishnan Srikant,et al. Fast algorithms for mining association rules , 1998, VLDB 1998.