Fast Wireless Sensor Anomaly Detection based on Data Stream in Edge Computing Enabled Smart Greenhouse

Edge computing enabled smart greenhouse is a representative application of Internet of Things technology, which can monitor the environmental information in real time and employ the information to contribute to intelligent decision-making. In the process, anomaly detection for wireless sensor data plays an important role. However, traditional anomaly detection algorithms originally designed for anomaly detection in static data have not properly considered the inherent characteristics of data stream produced by wireless sensor such as infiniteness, correlations and concept drift, which may pose a considerable challenge on anomaly detection based on data stream, and lead to low detection accuracy and efficiency. First, data stream usually generates quickly which means that it is infinite and enormous, so any traditional off-line anomaly detection algorithm that attempts to store the whole dataset or to scan the dataset multiple times for anomaly detection will run out of memory space. Second, there exist correlations among different data streams, which traditional algorithms hardly consider. Third, the underlying data generation process or data distribution may change over time. Thus, traditional anomaly detection algorithms with no model update will lose their effects. Considering these issues, a novel method (called DLSHiForest) on basis of Locality-Sensitive Hashing and time window technique in this paper is proposed to solve these problems while achieving accurate and efficient detection. Comprehensive experiments are executed using real-world agricultural greenhouse dataset to demonstrate the feasibility of our approach. Experimental results show that our proposal is practicable in addressing challenges of traditional anomaly detection while ensuring accuracy and efficiency. © 2015 Published by Elsevier Ltd.

[1]  Manjit Kaur,et al.  Effect of E-learning on public health and environment during COVID-19 lockdown , 2021, Big Data Min. Anal..

[2]  Anjali Sardana,et al.  Prediction of COVID-19 confirmed, death, and cured cases in India using random forest model , 2021, Big Data Min. Anal..

[3]  Xiaolong Xu,et al.  PDM: Privacy-Aware Deployment of Machine-Learning Applications for Industrial Cyber–Physical Cloud Systems , 2021, IEEE Transactions on Industrial Informatics.

[4]  Zhongdao Wang,et al.  Incremental face clustering with optimal summary learning via graph convolutional network , 2021 .

[5]  Gautam Srivastava,et al.  Robust Collaborative Filtering Recommendation With User-Item-Trust Records , 2021, IEEE Transactions on Computational Social Systems.

[6]  Lei Wang,et al.  Optimized Content Caching and User Association for Edge Computing in Densely Deployed Heterogeneous Networks , 2020, IEEE Transactions on Mobile Computing.

[7]  Xuyun Zhang,et al.  A balanced virtual machine scheduling method for energy-performance trade-offs in cyber-physical cloud systems , 2017, Future Gener. Comput. Syst..

[8]  Yun Li,et al.  Joint Optimization of Radio and Virtual Machine Resources With Uncertain User Demands in Mobile Cloud Computing , 2018, IEEE Transactions on Multimedia.

[9]  Wen-Liang Hwang,et al.  EMD Revisited: A New Understanding of the Envelope and Resolving the Mode-Mixing Problem in AM-FM Signals , 2012, IEEE Transactions on Signal Processing.

[10]  Kai Ming Ting,et al.  Fast Anomaly Detection for Streaming Data , 2011, IJCAI.

[11]  Miriam A. M. Capretz,et al.  An ensemble learning framework for anomaly detection in building energy consumption , 2017 .

[12]  K. Srinathan,et al.  LSH based outlier detection and its application in distributed setting , 2011, CIKM '11.

[13]  Xuyun Zhang,et al.  Privacy-Aware Data Fusion and Prediction With Spatial-Temporal Context for Smart City Industrial Environment , 2021, IEEE Transactions on Industrial Informatics.

[14]  Xiyuan Hu,et al.  PCA-SRGAN: Incremental Orthogonal Projection Discrimination for Face Super-resolution , 2020, ACM Multimedia.

[15]  Bin Cao,et al.  Lyapunov Optimization-Based Trade-Off Policy for Mobile Cloud Offloading in Heterogeneous Wireless Networks , 2019, IEEE Transactions on Cloud Computing.

[16]  Fan Yang,et al.  Streaming data anomaly detection method based on hyper-grid structure and online ensemble learning , 2017, Soft Comput..

[17]  Vijander Singh,et al.  Analysis and predictions of spread, recovery, and death caused by COVID-19 in India , 2021, Big Data Min. Anal..

[18]  Fei Tony Liu,et al.  Isolation-Based Anomaly Detection , 2012, TKDD.

[19]  Xiaomei Yu,et al.  Research cooperations of blockchain: toward the view of complexity network , 2020, Journal of Ambient Intelligence and Humanized Computing.

[20]  Yun Li,et al.  A Load-Balanced Re-Embedding Scheme for Wireless Network Virtualization , 2021, IEEE Transactions on Vehicular Technology.

[21]  Xuyun Zhang,et al.  An attention‐based category‐aware GRU model for the next POI recommendation , 2021, Int. J. Intell. Syst..

[22]  Gautam Srivastava,et al.  Diversified and Scalable Service Recommendation With Accuracy Guarantee , 2020, IEEE Transactions on Computational Social Systems.

[23]  Shirish Tatikonda,et al.  Locality Sensitive Outlier Detection: A ranking driven approach , 2011, 2011 IEEE 27th International Conference on Data Engineering.

[24]  Xing Zhang,et al.  Adaptive Computation Offloading With Edge for 5G-Envisioned Internet of Connected Vehicles , 2020, IEEE Transactions on Intelligent Transportation Systems.

[25]  Qiang He,et al.  LSHiForest: A Generic Framework for Fast Tree Isolation Based Ensemble Anomaly Analysis , 2017, 2017 IEEE 33rd International Conference on Data Engineering (ICDE).

[26]  Georges Kaddoum,et al.  Securing Fog-to-Things Environment Using Intrusion Detection System Based On Ensemble Learning , 2019, 2019 IEEE Wireless Communications and Networking Conference (WCNC).

[27]  Sencun Zhu,et al.  Preserving personalized location privacy in ride-hailing service , 2020 .

[28]  Guoyin Wang,et al.  Lifetime-Priority-Driven Resource Allocation for WNV-Based Internet of Things , 2021, IEEE Internet of Things Journal.

[29]  Gilberto Reynoso-Meza,et al.  Ensemble learning by means of a multi-objective optimization design approach for dealing with imbalanced data sets , 2020, Expert Syst. Appl..

[30]  Zhipeng Cai,et al.  Trading Private Range Counting over Big IoT Data , 2019, 2019 IEEE 39th International Conference on Distributed Computing Systems (ICDCS).

[31]  Xuyun Zhang,et al.  Diversified service recommendation with high accuracy and efficiency , 2020, Knowl. Based Syst..

[32]  Sudipto Guha,et al.  Robust Random Cut Forest Based Anomaly Detection on Streams , 2016, ICML.

[33]  Yingshu Li,et al.  Collective Data-Sanitization for Preventing Sensitive Information Inference Attacks in Social Networks , 2018, IEEE Transactions on Dependable and Secure Computing.

[34]  Joshua Zhexue Huang,et al.  A survey of data partitioning and sampling methods to support big data analysis , 2020, Big Data Min. Anal..

[35]  Zhenkun Wen,et al.  Machine learning-based multi-modal information perception for soft robotic hands , 2020 .

[36]  Cynthia Rudin,et al.  Online coordinate boosting , 2008, 2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops.

[37]  Ying Zhong,et al.  HELAD: A novel network anomaly detection model based on heterogeneous ensemble learning , 2020, Comput. Networks.

[38]  Xiaolong Xu,et al.  Artificial intelligence for edge service optimization in Internet of Vehicles: A survey , 2022, Tsinghua Science and Technology.

[39]  Philip S. Yu,et al.  RS-Forest: A Rapid Density Estimator for Streaming Anomaly Detection , 2014, 2014 IEEE International Conference on Data Mining.

[40]  Zhipeng Cai,et al.  A Private and Efficient Mechanism for Data Uploading in Smart Cyber-Physical Systems , 2020, IEEE Transactions on Network Science and Engineering.

[41]  A. Madansky Identification of Outliers , 1988 .

[42]  Yong Wang,et al.  Energy-Efficient Optimal Relay Selection in Cooperative Cellular Networks Based on Double Auction , 2015, IEEE Transactions on Wireless Communications.

[43]  Jin Wang,et al.  Anomaly detection model based on data stream clustering , 2017, Cluster Computing.

[44]  Ahmed Sallam,et al.  DELR: A double-level ensemble learning method for unsupervised anomaly detection , 2019, Knowl. Based Syst..

[45]  Gautam Srivastava,et al.  Service Offloading With Deep Q-Network for Digital Twinning-Empowered Internet of Vehicles in Edge Computing , 2022, IEEE Transactions on Industrial Informatics.