Concept drift detection with False Positive rate for multi-label classification in IoT data stream

Machine learning, as a significant component of the Industrial Internet of Things (IIoT), has been widely applied in many fields. The continuously generated data from the various sensors are collected and stored, this is also known as a data stream. However, the non-stationary phenomenon in data stream, concept drift, is important to be detected immediately for the operation of the IoT system. Therefore, the detection method for concept drift is needed to alert the requirement to maintain or replace some components in advance, so as to avoid or mitigate the risk of malfunction of the IoT system. The majority of existing literature focuses on concept drift detection on binary classification. To fill this gap, here we propose an algorithm to detect multi-class. Moreover to improve the performance of detection, we also introduce an algorithm which integrates the existing error rate which is widely used with our newly proposed False Positive rate. The new method is called Drift Detection Method with False Positive rate for multi-label classification (DDM-FP-M). The DDM-FP-M firstly defines the false positive rate calculation method in multi-label classification, then integrates it with Drift Detection method with False positive rate (DDM-FP). The performance of the proposed method is evaluated through Intel Lab data and is found to outperform the Drift Detection method(DDM) over 50% cases.

[1]  A. Bifet,et al.  Early Drift Detection Method , 2005 .

[2]  João Gama,et al.  Learning with Drift Detection , 2004, SBIA.

[3]  Xin Yao,et al.  The Impact of Diversity on Online Ensemble Learning in the Presence of Concept Drift , 2010, IEEE Transactions on Knowledge and Data Engineering.

[4]  Huiyu Zhou,et al.  Multi-Source Transfer Learning for Non-Stationary Environments , 2019, 2019 International Joint Conference on Neural Networks (IJCNN).

[5]  Lei Du,et al.  Detecting concept drift: An information entropy based method using an adaptive sliding window , 2014, Intell. Data Anal..

[6]  Shujian Yu,et al.  Concept Drift Detection and Adaptation with Hierarchical Hypothesis Testing , 2017, J. Frankl. Inst..

[7]  A. Dawid,et al.  Prequential probability: principles and properties , 1999 .

[8]  Der-Jiunn Deng,et al.  Concept Drift Detection and Adaption in Big Imbalance Industrial IoT Data Using an Ensemble Learning Method of Offline Classifiers , 2019, IEEE Access.

[9]  Geoff Holmes,et al.  MOA: Massive Online Analysis , 2010, J. Mach. Learn. Res..

[10]  Yi Xu,et al.  Low-Energy Data Collection in Wireless Sensor Networks Based on Matrix Completion , 2019, Sensors.

[11]  Dimitris K. Tasoulis,et al.  Exponentially weighted moving average charts for detecting concept drift , 2012, Pattern Recognit. Lett..

[12]  Guangjie Han,et al.  Concept drift detection for data stream learning based on angle optimized global embedding and principal component analysis in sensor networks , 2017, Comput. Electr. Eng..

[13]  Peter A. Flach,et al.  Machine Learning - The Art and Science of Algorithms that Make Sense of Data , 2012 .