A Sliding Window-Based Approach for Mining Frequent Weighted Patterns Over Data Streams

The mining of frequent weighted patterns (FWPs) that considers the different semantic significance (weight) of items is more suitable for practice than the mining of frequent patterns. Therefore, it plays a vital role in real-world scenarios. However, there exist several limitations when applying methods for mining FWPs designed for static data on growth datasets, especially data streams. Hence, this study proposes an algorithm for mining FWPs over data streams. First, we introduce the concept of mining FWPs over data streams via a sliding window model. Then, we introduce a modification of the weighted node tree (WN-tree) named SWN-tree that has the ability to maintain the information over data streams. Next, this study develops a method for mining FWPs over data streams employing a sliding window model based on SWN-tree. This method is called FWPODS (Frequent Weighted Patterns Over Data Stream) algorithm. Finally, we conduct empirical experiments to compare the performances of our approach and the state-of-the-art algorithm (NFWI) for mining FWPs over data streams. The results of experiment indicate that our approach outperforms the NFWI algorithm when running in batch mode in a sliding window.

[1]  Fionn Murtagh,et al.  Weighted Association Rule Mining using weighted support and significance framework , 2003, KDD '03.

[2]  Unil Yun,et al.  Erasable pattern mining based on tree structures with damped window over data streams , 2020, Eng. Appl. Artif. Intell..

[3]  Hamido Fujita,et al.  Efficiently mining erasable stream patterns for intelligent systems over uncertain data , 2020, Int. J. Intell. Syst..

[4]  Keun Ho Ryu,et al.  Mining Frequent Weighted Itemsets without Storing Transaction IDs and Generating Candidates , 2017, Int. J. Uncertain. Fuzziness Knowl. Based Syst..

[5]  Zhonghui Wang,et al.  A new algorithm for fast mining frequent itemsets using N-lists , 2012, Science China Information Sciences.

[6]  Unil Yun,et al.  Efficient Approach for Damped Window-Based High Utility Pattern Mining With List Structure , 2020, IEEE Access.

[7]  Witold Pedrycz,et al.  An efficient algorithm for mining frequent weighted itemsets using interval word segments , 2016, Applied Intelligence.

[8]  Hongjun Lu,et al.  A false negative approach to mining frequent itemsets from high speed transactional data streams , 2006, Inf. Sci..

[9]  Sung Wook Baik,et al.  SPPC: a new tree structure for mining erasable patterns in data streams , 2018, Applied Intelligence.

[10]  Mohammad Hadi Sadreddini,et al.  An Efficient Sliding Window Based Algorithm for Adaptive Frequent Itemset Mining over Data Streams , 2013, J. Inf. Sci. Eng..

[11]  Tu-Anh Nguyen-Hoang,et al.  A weighted N-list-based method for mining frequent weighted itemsets , 2018, Expert Syst. Appl..

[12]  Tuong Le,et al.  Mining top-rank-k frequent weighted itemsets using WN-list structures and an early pruning strategy , 2020, Knowl. Based Syst..

[13]  Bay Vo,et al.  An N-list-based algorithm for mining frequent closed patterns , 2015, Expert Syst. Appl..

[14]  Chun-Wei Lin,et al.  Efficient approach of recent high utility stream pattern mining with indexed list structure and pruning strategy considering arrival times of transactions , 2020, Inf. Sci..

[15]  Jerry Chun-Wei Lin,et al.  One scan based high average-utility pattern mining in static and dynamic databases , 2020, Future Gener. Comput. Syst..

[16]  Zhi-Hong Deng,et al.  PrePost+: An efficient N-lists-based algorithm for mining frequent itemsets via Children-Parent Equivalence pruning , 2015, Expert Syst. Appl..

[17]  Hua-Fu Li,et al.  Incremental mining of closed inter-transaction itemsets over data stream sliding windows , 2011, J. Inf. Sci..

[18]  Tzung-Pei Hong,et al.  Mining frequent itemsets using the N-list and subsume concepts , 2014, Int. J. Mach. Learn. Cybern..

[19]  Hamido Fujita,et al.  Damped window based high average utility pattern mining over data streams , 2017, Knowl. Based Syst..

[20]  Bay Vo,et al.  A novel approach for mining maximal frequent patterns , 2017, Expert Syst. Appl..

[21]  Hui Chen,et al.  Mining frequent patterns in a varying-size sliding window of online transactional data streams , 2012, Inf. Sci..

[22]  Sung Wook Baik,et al.  Efficient algorithms for mining top-rank-k erasable patterns using pruning strategies and the subsume concept , 2018, Eng. Appl. Artif. Intell..

[23]  Unil Yun,et al.  Mining frequent weighted closed itemsets using the WN-list structure and an early pruning strategy , 2020, Applied Intelligence.

[24]  Gautam Srivastava,et al.  Fuzzy high-utility pattern mining in parallel and distributed Hadoop framework , 2021, Inf. Sci..

[25]  Mohammad Hadi Sadreddini,et al.  EclatDS: An efficient sliding window based frequent pattern mining method for data streams , 2011, Intell. Data Anal..

[26]  Frans Coenen,et al.  A new method for mining Frequent Weighted Itemsets based on WIT-trees , 2013, Expert Syst. Appl..

[27]  Yu Zhang,et al.  Mining Frequent Items Over the Distributed Hierarchical Continuous Weighted Data Streams in Internet of Things , 2019, IEEE Access.

[28]  Hong Chen,et al.  An Efficient Algorithm for Frequent Itemset Mining on Data Streams , 2006, Industrial Conference on Data Mining.

[29]  Philippe Fournier-Viger,et al.  Mining high utility itemsets using extended chain structure and utility machine , 2020, Knowl. Based Syst..

[30]  Won Suk Lee,et al.  estMax: Tracing Maximal Frequent Item Sets Instantly over Online Transactional Data Streams , 2009, IEEE Transactions on Knowledge and Data Engineering.

[31]  Won Suk Lee,et al.  Finding recently frequent itemsets adaptively over online transactional data streams, , 2006, Inf. Syst..