An Efficient Incremental Mining Algorithm for Discovering Sequential Pattern in Wireless Sensor Network Environments

Wireless sensor networks (WSNs) are an important type of network for sensing the environment and collecting information. It can be deployed in almost every type of environment in the real world, providing a reliable and low-cost solution for management. Huge amounts of data are produced from WSNs all the time, and it is significant to process and analyze data effectively to support intelligent decision and management. However, the new characteristics of sensor data, such as rapid growth and frequent updates, bring new challenges to the mining algorithms, especially given the time constraints for intelligent decision-making. In this work, an efficient incremental mining algorithm for discovering sequential pattern (novel incremental algorithm, NIA) is proposed, in order to enhance the efficiency of the whole mining process. First, a reasoned proof is given to demonstrate how to update the frequent sequences incrementally, and the mining space is greatly narrowed based on the proof. Second, an improvement is made on PrefixSpan, which is a classic sequential pattern mining algorithm with a high-complexity recursive process. The improved algorithm, named PrefixSpan+, utilizes a mapping structure to extend the prefixes to sequential patterns, making the mining step more efficient. Third, a fast support number-counting algorithm is presented to choose frequent sequences from the potential frequent sequences. A reticular tree is constructed to store all the potential frequent sequences according to subordinate relations between them, and then the support degree can be efficiently calculated without scanning the original database repeatedly. NIA is compared with various kinds of mining algorithms via intensive experiments on the real monitoring datasets, benchmarking datasets and synthetic datasets from aspects including time cost, sensitivity of factors, and space cost. The results show that NIA performs better than the existed methods.

[1]  Rajesh Kumar Boghey,et al.  A sequential tree approach for incremental sequential pattern mining , 2016 .

[2]  Longbing Cao,et al.  e-NSP: Efficient negative sequential pattern mining , 2016, Artif. Intell..

[3]  Bay Vo,et al.  Mining sequential patterns with itemset constraints , 2018, Knowledge and Information Systems.

[4]  Yanyan Wang,et al.  Incremental Mining Algorithm of Sequential Patterns Based on Sequence Tree , 2012 .

[5]  Dawei Liu,et al.  Incremental sequential pattern mining algorithms of Web site access in grid structure database , 2017, Neural Computing and Applications.

[6]  Xindong Wu,et al.  NOSEP: Nonoverlapping Sequence Pattern Mining With Gap Constraints , 2018, IEEE Transactions on Cybernetics.

[7]  Philippe Fournier-Viger,et al.  High-Utility Sequential Pattern Mining with Multiple Minimum Utility Thresholds , 2017, APWeb/WAIM.

[8]  Tzung-Pei Hong,et al.  Incrementally updating the discovered sequential patterns based on pre-large concept , 2015, Intell. Data Anal..

[9]  Qiming Chen,et al.  PrefixSpan,: mining sequential patterns efficiently by prefix-projected pattern growth , 2001, Proceedings 17th International Conference on Data Engineering.

[10]  Jiun-Long Huang,et al.  Incremental Mining of High Utility Sequential Patterns in Incremental Databases , 2016, CIKM.

[11]  Abdul Razak Hamdan,et al.  IncSPADE: An Incremental Sequential Pattern Mining Algorithm Based on SPADE Property , 2016, ICML 2016.

[12]  Longbing Cao,et al.  Mining Partially-Ordered Sequential Rules Common to Multiple Sequences , 2015, IEEE Trans. Knowl. Data Eng..

[13]  Antonio Gomariz,et al.  SPMF: a Java open-source pattern mining library , 2014, J. Mach. Learn. Res..

[14]  Mohammed J. Zaki,et al.  SPADE: An Efficient Algorithm for Mining Frequent Sequences , 2004, Machine Learning.

[15]  Teresa A. P. Rocha-Santos,et al.  Recent Progress in Biosensors for Environmental Monitoring: A Review , 2017, Sensors.

[16]  Antonio Gomariz,et al.  The SPMF Open-Source Data Mining Library Version 2 , 2016, ECML/PKDD.

[17]  Susel Fernández,et al.  Optimized Sensor Network and Multi-Agent Decision Support for Smart Traffic Light Management , 2018, Sensors.

[18]  Longbing Cao,et al.  F-NSP+: A fast negative sequential patterns mining method with self-adaptive data storage , 2018, Pattern Recognit..

[19]  Tzung-Pei Hong,et al.  Maintaining the discovered sequential patterns for sequence insertion in dynamic databases , 2014, Eng. Appl. Artif. Intell..

[20]  Xiaolin Meng,et al.  Design and Implementation of a New System for Large Bridge Monitoring—GeoSHM , 2018, Sensors.

[21]  Maria E. Orlowska,et al.  Improvements of IncSpan: Incremental Mining of Sequential Patterns in Large Database , 2005, PAKDD.

[22]  Jiawei Han,et al.  IncSpan: incremental mining of sequential patterns in large database , 2004, KDD.

[23]  Ashley Collier-Oxandale,et al.  Comparing Building and Neighborhood-Scale Variability of CO2 and O3 to Inform Deployment Considerations for Low-Cost Sensor System Use , 2018, Sensors.

[24]  Václav Snásel,et al.  An efficient approach for mining sequential patterns using multiple threads on very large databases , 2018, Eng. Appl. Artif. Intell..

[25]  Manuel Campos,et al.  Fast Vertical Mining of Sequential Patterns Using Co-occurrence Information , 2014, PAKDD.

[26]  Roee Diamant,et al.  Observability Analysis of DVL/PS Aided INS for a Maneuvering AUV , 2015, Sensors.

[27]  Dan Yang,et al.  Passive Infrared (PIR)-Based Indoor Position Tracking for Smart Homes Using Accessibility Maps and A-Star Algorithm , 2018, Sensors.

[28]  Johannes Gehrke,et al.  Sequential PAttern mining using a bitmap representation , 2002, KDD.

[29]  Henrik Edman Sequential Pattern Mining on Electronic Medical Records for Finding Optimal Clinical Pathways , 2018 .

[30]  Kevin Weekly,et al.  Building-in-Briefcase: A Rapidly-Deployable Environmental Sensor Suite for the Smart Building , 2018, Sensors.

[31]  Ramakrishnan Srikant,et al.  Mining sequential patterns , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[32]  Li-Ta Hsu,et al.  Passive Sensor Integration for Vehicle Self-Localization in Urban Traffic Environment † , 2015, Sensors.

[33]  Ahmad Almogren,et al.  Scalable regular pattern mining in evolving body sensor data , 2017, Future Gener. Comput. Syst..

[34]  Unil Yun,et al.  Efficient incremental high utility pattern mining based on pre-large concept , 2018, Eng. Appl. Artif. Intell..

[35]  Suh-Yin Lee,et al.  Incremental update on sequential patterns in large databases by implicit merging and efficient counting , 2004, Inf. Syst..