CCAR: An efficient method for mining class association rules with itemset constraints

Abstract Class association rules (CARs) are basically used to build a classification model for prediction; they can also be used to describe correlations between itemsets and class labels. The latter is very popular in mining medical data. For example, epidemiologists often consider rules which indicate the relations between risk factors (itemsets) and HIV test results (class labels). However, in the real world, end users are often interested in a subset of class association rules. Particularly, they may consider only rules which contain at least one itemset from a user-defined set of itemsets in the rule antecedent. For example, when classifying which populations are at high risk for HIV infection, epidemiologists often concentrate on rules that include demographic information such as sex, age, and marital status in rule antecedents. Two naive strategies are to solve this problem by applying the itemset constraints into the pre-processing or post-processing step. However, such approaches are time-intensive. This paper thus proposes an efficient method for integrating the constraints into the class association rule mining process. The experimental results show that the proposed algorithm outperforms two basic approaches in the mining time and the memory consumption. The practical benefits of our method are demonstrated by a real-life application in the HIV/AIDS domain.

[1]  Ujjwal Maulik,et al.  A Novel Biclustering Approach to Association Rule Mining for Predicting HIV-1–Human Protein Interactions , 2012, PloS one.

[2]  Bay Vo,et al.  An efficient method for mining frequent itemsets with double constraints , 2014, Eng. Appl. Artif. Intell..

[3]  Engelbert Mephu Nguifo,et al.  CMRules: Mining sequential rules common to several sequences , 2012, Knowl. Based Syst..

[4]  Jagmeet kaur,et al.  Review of Data Mining Applications , 2014 .

[5]  Mihui Kim,et al.  A Combined Data Mining Approach for DDoS Attack Detection , 2004, ICOIN.

[6]  Tzung-Pei Hong,et al.  CAR-Miner: An efficient algorithm for mining class-association rules , 2013, Expert Syst. Appl..

[7]  Man Zhao,et al.  An Algorithm of Mining Class Association Rules , 2009, ISICA.

[8]  Amar K. Das,et al.  A combined data mining approach for infrequent events: analyzing HIV mutation changes based on treatment history. , 2006, Computational systems bioinformatics. Computational Systems Bioinformatics Conference.

[9]  Ahmad Mirabadi,et al.  Application of Association Rules in Iranian Railways (RAI) Accident Data Analysis , 2010 .

[10]  Jian Pei,et al.  Mining frequent patterns without candidate generation , 2000, SIGMOD '00.

[11]  Bernard Kamsu-Foguem,et al.  Mining association rules for the quality improvement of the production process , 2013, Expert Syst. Appl..

[12]  Wynne Hsu,et al.  Integrating Classification and Association Rule Mining , 1998, KDD.

[13]  Wen-Yang Lin,et al.  MCFPTree: An FP-tree-based algorithm for multi-constraint patterns discovery , 2010, Int. J. Bus. Intell. Data Min..

[14]  K. Rameshkuma Extracting Association Rules from Hiv Infected Patients’ Treatment Dataset , 2011 .

[15]  Bay Vo,et al.  Mining Class-Association Rules with Constraints , 2013, KSE.

[16]  Hussein H. Aly,et al.  Mining association rules , 2001, CATA.

[17]  Marek Wojciechowski,et al.  Dataset Filtering Techniques in Constraint-Based Frequent Pattern Mining , 2002, Pattern Detection and Discovery.

[18]  Vassilis S. Kodogiannis,et al.  Data mining techniques for HIV/AIDS data management in Thailand , 2008, J. Enterp. Inf. Manag..

[19]  Bay Vo,et al.  A Novel Classification Algorithm Based on Association Rules Mining , 2009, PKAW.

[20]  Jian Pei,et al.  Mining frequent patterns without candidate generation , 2000, SIGMOD 2000.

[21]  Laks V. S. Lakshmanan,et al.  Exploratory mining and pruning optimizations of constrained associations rules , 1998, SIGMOD '98.

[22]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[23]  Amir Abbas Shojaie,et al.  Clustering and association rules in analyzing the efficiency of maintenance system of an urban bus network , 2012, Int. J. Syst. Assur. Eng. Manag..

[24]  Inci Batmaz,et al.  A review of data mining applications for quality improvement in manufacturing industry , 2011, Expert Syst. Appl..

[25]  Ramakrishnan Srikant,et al.  Mining Association Rules with Item Constraints , 1997, KDD.

[26]  Philippe Fournier-Viger,et al.  A computational model for causal learning in cognitive agents , 2012, Knowl. Based Syst..

[27]  Bhasker Pant,et al.  Association Rule Mining to Deduce the Most Frequently Occurring Amino Acid Patterns in HIV , 2012 .

[28]  Bernard Grabot,et al.  Generating knowledge in maintenance from Experience Feedback , 2014, Knowl. Based Syst..

[29]  Bay Vo,et al.  Efficient strategies for parallel mining class association rules , 2014, Expert Syst. Appl..

[30]  Tzung-Pei Hong,et al.  Classification based on association rules: A lattice-based approach , 2012, Expert Syst. Appl..

[31]  Jian Pei,et al.  CMAR: accurate and efficient classification based on multiple class-association rules , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[32]  C. Campbell,et al.  The role of HIV counseling and testing in the developing world. , 1997, AIDS education and prevention : official publication of the International Society for AIDS Education.

[33]  F Rhodes,et al.  Efficacy of risk-reduction counseling to prevent human immunodeficiency virus and sexually transmitted diseases: a randomized controlled trial. Project RESPECT Study Group. , 1998, JAMA.

[34]  Engelbert Mephu Nguifo,et al.  Learning task models in ill-defined domain using an hybrid knowledge discovery framework , 2011, Knowl. Based Syst..