Efficient Infrequent Pattern Mining Using Negative Itemset Tree

In this work, we focus on a simple and fundamental question: How to find infrequent patterns, i.e. patterns with small support value, in a transactional database. In various practical applications such as science, medical and accident data analysis, frequent patterns usually represent obvious and expected phenomena. Really interesting information might hide in obscure rarity. Existing rare pattern mining approaches are mainly adapted from frequent itemset mining algorithms, which either suffered from the expensive candidate generation step or need to traverse all frequent patterns first. In this paper, we propose an infrequent pattern mining algorithm using a top-down and depth-first traversing strategy to avoid the two obstacles above. A negative itemset tree is employed to accelerate the mining process with its dataset compressing and fast counting ability.

[1]  Tai-Wen Yue,et al.  A Q'tron Neural-Network Approach to Solve the Graph Coloring Problems , 2007 .

[2]  Yun Sing Koh,et al.  Finding Sporadic Rules Using Apriori-Inverse , 2005, PAKDD.

[3]  Antonio Gomariz,et al.  The SPMF Open-Source Data Mining Library Version 2 , 2016, ECML/PKDD.

[4]  Luigi Troiano,et al.  A time-efficient breadth-first level-wise lattice-traversal algorithm to discover rare itemsets , 2013, Data Mining and Knowledge Discovery.

[5]  Wynne Hsu,et al.  Mining association rules with multiple minimum supports , 1999, KDD '99.

[6]  Lei Wu,et al.  Rare Itemset Mining , 2007, ICMLA 2007.

[7]  Mohammed J. Zaki,et al.  Fast vertical mining using diffsets , 2003, KDD '03.

[8]  Mohammed J. Zaki Scalable Algorithms for Association Mining , 2000, IEEE Trans. Knowl. Data Eng..

[9]  Arnab Bhattacharya,et al.  Minimally Infrequent Itemset Mining using Pattern-Growth Paradigm and Residual Trees , 2011, COMAD.

[10]  Hiroki Arimura,et al.  LCM ver. 2: Efficient Mining Algorithms for Frequent/Closed/Maximal Itemsets , 2004, FIMI.

[11]  B. Nath,et al.  An Efficient Approach on Rare Association Rule Mining , 2012, BIC-TA.

[12]  Jian Pei,et al.  Mining frequent patterns without candidate generation , 2000, SIGMOD 2000.

[13]  Wynne Hsu,et al.  Pruning and summarizing the discovered associations , 1999, KDD '99.

[14]  Gillian Dobbie,et al.  RP-Tree: Rare Pattern Tree Mining , 2011, DaWaK.

[15]  Ramakrishnan Srikant,et al.  Fast algorithms for mining association rules , 1998, VLDB 1998.

[16]  Yun Sing Koh,et al.  Unsupervised Rare Pattern Mining , 2016, ACM Trans. Knowl. Discov. Data.

[17]  Charu C. Aggarwal,et al.  A Tree Projection Algorithm for Generation of Frequent Item Sets , 2001, J. Parallel Distributed Comput..

[18]  Amedeo Napoli,et al.  Towards Rare Itemset Mining , 2007 .

[19]  Shamkant B. Navathe,et al.  Text Mining and Ontology Applications in Bioinformatics and GIS , 2007, International Conference on Machine Learning and Applications.

[20]  Thomas Seidl,et al.  Efficient Infrequent Itemset Mining Using Depth-First and Top-Down Lattice Traversal , 2018, DASFAA.

[21]  Manish Gupta,et al.  Mining Low-Support Discriminative Patterns from Dense and High-Dimensional Data , 2012, IEEE Transactions on Knowledge and Data Engineering.