Mining top-k frequent-regular closed patterns

Mining top-k frequent-regular closed patterns with minimal length is proposed.A new compact bit-vector representation is designed.An efficient single-pass algorithm is proposed. Frequent-regular pattern mining has attracted recently many works. Most of the approaches focus on discovering a complete set of patterns under the user-given support and regularity threshold constraints. This leads to several quantitative and qualitative drawbacks. First, it is often difficult to set appropriate support threshold. Second, algorithms produce a huge number of patterns, many of them being redundant. Third, most of the patterns are of very small size and it is arduous to extract interesting relationship among items. To reduce the number of patterns a common solution is to consider the desired number k of outputs and to mine the top-k patterns. In addition, this approach does not require to set a support threshold. To cope with redundancy and interestingness relationship among items, we suggest to focus on closed patterns and introduce a minimal length constraint. We thus propose to mine the top-k frequent-regular closed patterns with minimal length. An efficient single-pass algorithm, called TFRC-Mine, and a new compact bit-vector representation which allows to prune uninteresting candidate, are designed. Experiments show that the proposed algorithm is efficient to produce longer - non redundant - patterns, and that the new data representation is efficient for both computational time and memory usage.

[1]  Tzung-Pei Hong,et al.  DBV-Miner: A Dynamic Bit-Vector approach for fast mining frequent closed itemsets , 2012, Expert Syst. Appl..

[2]  Philippe Lenca,et al.  Efficient Mining Top-k Regular-Frequent Itemset Using Compressed Tidsets , 2011, PAKDD Workshops.

[3]  P. Krishna Reddy,et al.  An Efficient Approach to Mine Periodic-Frequent Patterns in Transactional Databases , 2011, PAKDD Workshops.

[4]  Masaru Kitsuregawa,et al.  Novel Techniques to Reduce Search Space in Periodic-Frequent Pattern Mining , 2014, DASFAA.

[5]  Ho-Jin Choi,et al.  Efficient Mining Regularly Frequent Patterns in Transactional Databases , 2012, DASFAA.

[6]  Devavrat Shah,et al.  Turbo-charging vertical mining of large databases , 2000, SIGMOD 2000.

[7]  Philippe Lenca,et al.  Monitoring the Habits of Elderly People through Data Mining from Home Automation Devices Data , 2013, EPIA.

[8]  Johannes Gehrke,et al.  MAFIA: a maximal frequent itemset algorithm for transactional databases , 2001, Proceedings 17th International Conference on Data Engineering.

[9]  Joong Hyuk Chang,et al.  Mining weighted sequential patterns in a sequence database with a time-interval weight , 2011, Knowl. Based Syst..

[10]  Mohammed Abdul Khaleel,et al.  Medical Data Mining for Discovering Periodically Frequent Diseases from Transactional Databases , 2015 .

[11]  Jie Dong,et al.  BitTableFI: An efficient mining frequent itemsets algorithm , 2007, Knowl. Based Syst..

[12]  Philippe Lenca,et al.  Periodic Episode Discovery Over Event Streams , 2015, EPIA.

[13]  Jie Chen,et al.  Bioinformatics Original Paper Detecting Periodic Patterns in Unevenly Spaced Gene Expression Time Series Using Lomb–scargle Periodograms , 2022 .

[14]  Hua Yuan,et al.  On Discovering Feasible Periodic Patterns in Large Database , 2013, 2013 IEEE 11th International Conference on Dependable, Autonomic and Secure Computing.

[15]  Philippe Lenca,et al.  Mining top-k regular-frequent itemsets using database partitioning and support estimation , 2012, Expert Syst. Appl..

[16]  Masaru Kitsuregawa,et al.  Discovering Chronic-Frequent Patterns in Transactional Databases , 2015, DNIS.

[17]  Nicolas Pasquier,et al.  Discovering Frequent Closed Itemsets for Association Rules , 1999, ICDT.

[18]  Philippe Lenca,et al.  Mining Periodic-Frequent Itemsets with Approximate Periodicity Using Interval Transaction-Ids List Tree , 2010, 2010 Third International Conference on Knowledge Discovery and Data Mining.

[19]  Stefan Lüth,et al.  Serologic Markers Compared With Liver Biopsy for Monitoring Disease Activity in Autoimmune Hepatitis , 2008, Journal of clinical gastroenterology.

[20]  Masaru Kitsuregawa,et al.  Discovering Quasi-Periodic-Frequent Patterns in Transactional Databases , 2013, BDA.

[21]  Philippe Lenca,et al.  Mining Top-K Periodic-Frequent Pattern from Transactional Databases without Support Threshold , 2009, IAIT.

[22]  Jian Tang,et al.  Mining N-most Interesting Itemsets , 2000, ISMIS.

[23]  Amihood Amir,et al.  Approximate Period Detection and Correction , 2012, SPIRE.

[24]  M. Sreedevi,et al.  Mining regular closed patterns in transactional databases , 2013, 2013 7th International Conference on Intelligent Systems and Control (ISCO).

[25]  Sorin Moga,et al.  Closeness Preference - A new interestingness measure for sequential rules mining , 2013, Knowl. Based Syst..

[26]  Byeong-Soo Jeong,et al.  Mining Regular Patterns in Incremental Transactional Databases , 2010, 2010 12th International Asia-Pacific Web Conference.

[27]  J. Engler Mining periodic patterns in manufacturing test data , 2008, IEEE SoutheastCon 2008.

[28]  V. Valli Kumari,et al.  Sliding window technique to mine regular frequent patterns in data streams using vertical format , 2012, 2012 IEEE International Conference on Computational Intelligence and Computing Research.

[29]  Young-Koo Lee,et al.  Discovering Periodic-Frequent Patterns in Transactional Databases , 2009, PAKDD.

[30]  Xifeng Yan,et al.  CloSpan: Mining Closed Sequential Patterns in Large Datasets , 2003, SDM.

[31]  Jiawei Han,et al.  Mining periodic behaviors for moving objects , 2010, KDD.

[32]  Byeong-Soo Jeong,et al.  Mining Regular Patterns in Data Streams , 2010, DASFAA.

[33]  Bingru Yang,et al.  Index-BitTableFI: An improved algorithm for mining frequent itemsets , 2008, Knowl. Based Syst..

[34]  Jiawei Han,et al.  Mining top-k frequent closed patterns without minimum support , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[35]  P. Krishna Reddy,et al.  Towards Efficient Mining of Periodic-Frequent Patterns in Transactional Databases , 2016, DEXA.