Efficient Approach for Damped Window-Based High Utility Pattern Mining With List Structure

Traditional pattern mining is designed to handle binary database that assume all items in the database have same importance, there is a limitation to recognize accurate information from real-world databases using traditional method. To solve this problem, the high utility pattern mining approaches from non-binary database have been proposed and actively studied by many researchers. Lately, new data is progressively created with the passage of time in diverse area such as biometric data of a patient diagnosed in a medical device and log data of an internet user, and the volume of a database is gradually increasing. A database with these characteristics is called a dynamic database. Under these circumstances, high utility mining techniques suitable for analyzing dynamic databases have recently been extensively studied. In this paper, we propose a new list-based algorithm that mines high utility patterns considering the arrival time of each transaction in an incremental database environment. That is, our algorithm efficiently performs pattern pruning by using a damped window model that considers the importance of the previously inputted data lower than that of recently inserted data and identifies high utility patterns. Experimental results indicate that our proposed method has better performance than the state-of-the-art techniques in terms of runtime, memory, and scalability.

[1]  Unil Yun,et al.  Mining high utility itemsets based on the time decaying model , 2016, Intell. Data Anal..

[2]  Philip S. Yu,et al.  HUOPM: High-Utility Occupancy Pattern Mining , 2018, IEEE Transactions on Cybernetics.

[3]  Tzung-Pei Hong,et al.  Efficiently Updating the Discovered Multiple Fuzzy Frequent Itemsets with Transaction Insertion , 2018, Int. J. Fuzzy Syst..

[4]  Jieh-Shan Yeh,et al.  Efficient algorithms for incremental utility mining , 2008, ICUIMC '08.

[5]  Hamido Fujita,et al.  An efficient algorithm for mining high utility patterns from incremental databases with one database scan , 2017, Knowl. Based Syst..

[6]  Philip S. Yu,et al.  A Survey of Utility-Oriented Pattern Mining , 2018, IEEE Transactions on Knowledge and Data Engineering.

[7]  Philippe Fournier-Viger,et al.  Efficient Vertical Mining of High Average-Utility Itemsets Based on Novel Upper-Bounds , 2019, IEEE Transactions on Knowledge and Data Engineering.

[8]  Young-Koo Lee,et al.  Efficient Tree Structures for High Utility Pattern Mining in Incremental Databases , 2009, IEEE Transactions on Knowledge and Data Engineering.

[9]  Benjamin C. M. Fung,et al.  Mining High Utility Patterns in One Phase without Generating Candidates , 2016, IEEE Transactions on Knowledge and Data Engineering.

[10]  Chun-Wei Lin,et al.  Efficient approach for incremental weighted erasable pattern mining with list structure , 2020, Expert Syst. Appl..

[11]  Hamido Fujita,et al.  An efficient method for mining high utility closed itemsets , 2019, Inf. Sci..

[12]  Jimmy Ming-Tai Wu,et al.  TUB-HAUPM: Tighter Upper Bound for Mining High Average-Utility Patterns , 2018, IEEE Access.

[13]  Philip S. Yu,et al.  A Survey of Parallel Sequential Pattern Mining , 2018, ACM Trans. Knowl. Discov. Data.

[14]  Hamido Fujita,et al.  Damped window based high average utility pattern mining over data streams , 2017, Knowl. Based Syst..

[15]  Mengchi Liu,et al.  Mining high utility itemsets without candidate generation , 2012, CIKM.

[16]  Gang Wu,et al.  Mining recent maximal frequent itemsets over data streams with sliding window , 2019, Int. Arab J. Inf. Technol..

[17]  Unil Yun,et al.  Efficient transaction deleting approach of pre-large based high utility pattern mining in dynamic databases , 2020, Future Gener. Comput. Syst..

[18]  Unil Yun,et al.  Efficient incremental high utility pattern mining based on pre-large concept , 2018, Eng. Appl. Artif. Intell..

[19]  Heungmo Ryang,et al.  Indexed list-based high utility pattern mining with utility upper-bound reduction and pattern combination techniques , 2017, Knowledge and Information Systems.

[20]  Wei Liu,et al.  A Self-Adaptive Process Mining Algorithm Based on Information Entropy to Deal With Uncertain Data , 2019, IEEE Access.

[21]  Hamido Fujita,et al.  A survey of incremental high‐utility itemset mining , 2018, WIREs Data Mining Knowl. Discov..

[22]  Philip S. Yu,et al.  Efficient Algorithms for Mining Top-K High Utility Itemsets , 2016, IEEE Transactions on Knowledge and Data Engineering.

[23]  Chin-Chen Chang,et al.  Isolated items discarding strategy for discovering high utility itemsets , 2008, Data Knowl. Eng..

[24]  Benjamin C. M. Fung,et al.  Incremental Mining of High Utility Patterns in One Phase by Absence and Legacy-Based Pruning , 2019, IEEE Access.

[25]  Jimmy Ming-Tai Wu,et al.  High-Utility Itemset Mining with Effective Pruning Strategies , 2019, ACM Trans. Knowl. Discov. Data.

[26]  Durga Toshniwal,et al.  Frequent Pattern Mining on Time and Location Aware Air Quality Data , 2019, IEEE Access.

[27]  Philip S. Yu,et al.  Efficient Algorithms for Mining High Utility Itemsets from Transactional Databases , 2013, IEEE Transactions on Knowledge and Data Engineering.

[28]  Jerry Chun-Wei Lin,et al.  Efficient Incremental High Utility Itemset Mining , 2015, ASE BD&SI.

[29]  Tzung-Pei Hong,et al.  FDHUP: Fast algorithm for mining discriminative high utility patterns , 2017, Knowledge and Information Systems.

[30]  Srikumar Krishnamoorthy,et al.  Pruning strategies for mining high utility itemsets , 2015, Expert Syst. Appl..