EFFICIENT TRANSACTION REDUCTION IN ACTIONABLE PATTERN MINING FOR HIGH VOLUMINOUS DATASETS BASED ON BITMAP AND CLASS LABELS

Frequent pattern mining in databases plays an indispensable role in many data mining tasks namely, classification, clustering, and association rules analysis. When a large number of item sets are processed by the database, it needs to be scanned multiple times. Consecutively, multiple scanning of the database increases the number of rules generation, which then consume more system resources. Existing CCARM (Combined and Composite Association Rule Mining) algorithm used minimum support in order to generate combined actionable association rules, which in turn suffer from the large number of generating rules. Explosion of a large number of rules is the major problem in frequent pattern mining that adds difficult to find the interesting frequent patterns. This paper presents an efficient transaction reduction technique named TR-BC to mine the frequent pattern based on bitmap and class labels. The proposed approach reduces the rule generation by counting the item support and class support instead of only item support. Moreover, the database storage is compressed by using bitmap that significantly reduces the number of database scan. The rules are reduced by horizontal and vertical transaction and then finally combined rules are generated by eliminating the redundancy. Experimental results validate the performance of the proposed approach and expose that proposed method is more effective and efficient than previously proposed algorithm. Index terms - bitmap, item support, database scan, class support, class labels, database transaction, and combined rules.

[1]  Zhenglu Yang,et al.  LAPIN-SPAM: An Improved Algorithm for Mining Sequential Pattern , 2005, 21st International Conference on Data Engineering Workshops (ICDEW'05).

[2]  Johannes Gehrke,et al.  Sequential PAttern mining using a bitmap representation , 2002, KDD.

[3]  Mohammed J. Zaki,et al.  GenMax: An Efficient Algorithm for Mining Maximal Frequent Itemsets , 2005, Data Mining and Knowledge Discovery.

[4]  Qian Wan,et al.  Compact transaction database for efficient frequent pattern mining , 2005, 2005 IEEE International Conference on Granular Computing.

[5]  Jinlin Chen,et al.  BISC: A bitmap itemset support counting approach for efficient frequent itemset mining , 2010, TKDD.

[6]  Young-Koo Lee,et al.  Efficient single-pass frequent pattern mining using a prefix-tree , 2009, Inf. Sci..

[7]  Masaru Kitsuregawa,et al.  Towards Efficient Discovery of Frequent Patterns with Relative Support , 2012, COMAD.

[8]  Chengqi Zhang,et al.  Combined Mining: Discovering Informative Knowledge in Complex Data , 2011, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[9]  P. Krishna Reddy,et al.  Selecting a Right Interestingness Measure for Rare Association Rules , 2010, COMAD.

[10]  Suvarna R. Bhagwat Combined Mining And Actionable Pattern Discovery Using DDID-PD Framework: A Review , 2013 .

[11]  Kuen-Fang Jea,et al.  Discovering frequent itemsets by support approximation and itemset clustering , 2008, Data Knowl. Eng..

[12]  Chengqi Zhang,et al.  Combined Pattern Mining: From Learned Rules to Actionable Knowledge , 2008, Australasian Conference on Artificial Intelligence.

[13]  Jie Dong,et al.  BitTableFI: An efficient mining frequent itemsets algorithm , 2007, Knowl. Based Syst..

[14]  P. Krishna Reddy,et al.  Novel techniques to reduce search space in multiple minimum supports-based frequent pattern mining algorithms , 2011, EDBT/ICDT '11.

[15]  Jiawei Han,et al.  Re-examination of interestingness measures in pattern mining: a unified framework , 2010, Data Mining and Knowledge Discovery.