Mining Trending High Utility Itemsets from Temporal Transaction Databases

In this paper, we address a novel and important topic in the area of HUI mining, named Trending High Utility Itemset (TrendHUI) mining, with the promise of expanding the applications of HUI mining with the power of trend analytics. We introduce formal definitions for TrendHUI mining and highlighted the importance of the TrendHUI output. Moreover, we develop two algorithms, Two-Phase Trending High Utility Itemset (TP-THUI) miner and Two-Phase Trending High Utility Itemset Guided (TP-THUI-Guided) miner. Both are two-phase algorithms that mine a complete set of TrendHUI. TP-THUI-Guided miner utilizes a remainder utility to calculate the temporal trend of a given itemset to reduce the search space effectively, such that the execution efficiency can be enhanced substantially. Through a series of experiments, using three different datasets, the proposed algorithms prove to be excellent for validity and efficiency. To the best of our knowledge, this is the first work addressing the promising topic on Trending High Utility Itemset mining, which is expected to facilitate numerous applications in data mining fields.

[1]  Vincent S. Tseng,et al.  EFIM: a fast and memory efficient algorithm for high-utility itemset mining , 2016, Knowledge and Information Systems.

[2]  Srikumar Krishnamoorthy,et al.  Pruning strategies for mining high utility itemsets , 2015, Expert Syst. Appl..

[3]  Mengchi Liu,et al.  Mining high utility itemsets without candidate generation , 2012, CIKM.

[4]  Benjamin C. M. Fung,et al.  Direct Discovery of High Utility Itemsets without Candidate Generation , 2012, 2012 IEEE 12th International Conference on Data Mining.

[5]  Raj P. Gopalan,et al.  CTU-Mine: An Efficient High Utility Itemset Mining Algorithm Using the Pattern Growth Approach , 2007, 7th IEEE International Conference on Computer and Information Technology (CIT 2007).

[6]  Yu Liu,et al.  BAHUI: Fast and Memory Efficient Mining of High Utility Itemsets Based on Bitmap , 2014, Int. J. Data Warehous. Min..

[7]  Philip S. Yu,et al.  Efficient Algorithms for Mining High Utility Itemsets from Transactional Databases , 2013, IEEE Transactions on Knowledge and Data Engineering.

[8]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[9]  Hari Om,et al.  Time-Fading Based High Utility Pattern Mining from Uncertain Data Streams , 2014 .

[10]  Philip S. Yu,et al.  Efficient algorithms for mining maximal high utility itemsets from data streams with different models , 2012, Expert Syst. Appl..

[11]  Qiang Yang,et al.  Mining high utility itemsets , 2003, Third IEEE International Conference on Data Mining.

[12]  A. Choudhary,et al.  A fast high utility itemsets mining algorithm , 2005, UBDM '05.